GenAIExamples

Author	SHA1	Message	Date
bjzhjing	c8c6fa2e3e	Provide unified scalable deployment and benchmarking support for exam… (#1315 ) Signed-off-by: Cathy Zhang <cathy.zhang@intel.com> Signed-off-by: letonghan <letong.han@intel.com> Co-authored-by: letonghan <letong.han@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> (cherry picked from commit `ed163087ba`) v1.2	2025-01-24 22:55:38 +08:00
NeuralChatBot	905a5100f9	Freeze OPEA images tag Signed-off-by: NeuralChatBot <grp_neural_chat_bot@intel.com>	2025-01-24 08:31:22 +00:00
chen, suyue	259099d19f	Remove kubernetes manifest related code and tests (#1466 ) Remove deprecated kubernetes manifest related code and tests. k8s implementation for those examples based on helm charts will target for next release. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-24 15:23:12 +08:00
chen, suyue	9a1118730b	Freeze the triton version in vllm-gaudi image to 3.1.0 (#1463 ) The new triton version 3.2.0 can't work with vllm-gaudi. Freeze the triton version in vllm-gaudi image to 3.1.0. Issue create for vllm-fork: HabanaAI/vllm-fork#732 Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-24 09:50:59 +08:00
chen, suyue	ffce7068aa	Fix image on push action due to manifest test remove (#1460 ) 1. Fix image on push action due to manifest test remove. 2. Fix helm test cd workflow get test matrix step. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-23 14:30:09 +08:00
dolpher	9b0f98be8b	Update ChatQnA helm chart README. (#1459 ) Signed-off-by: Dolpher Du <dolpher.du@intel.com>	2025-01-23 10:54:39 +08:00
XinyuYe-Intel	f0fea7b706	Add docker compose yaml for text2image example (#1418 ) Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>	2025-01-23 09:57:54 +08:00
Melanie Hart Buehler	1864fac978	Fixes MultimodalQnA dataprep endpoint and port in the UI (#1457 ) Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-01-22 17:11:09 -08:00
Lianhao Lu	94f71f2322	Update top level readme (#1458 ) Add helm support of SeachQnA and Text2Image in top level readme. Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2025-01-23 09:07:33 +08:00
chen, suyue	6600c32a9b	remove image build condition (#1456 ) Test compose cd workflow depend on image build, so if we want to run both compose and helm charts deployment in cd workflow, this condition should be removed. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-23 00:17:04 +08:00
Liang Lv	d953332f43	Fix multimodal docker image issue for MutimodalQnA on Gaudi (#1455 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-01-23 00:12:06 +08:00
chyundunovDatamonsters	cbe5805f47	AgentQnA - add README file for deploy on ROCm (#1379 ) Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>	2025-01-22 21:57:15 +08:00
Ervin Castelino	27fdbcab58	[chore/chatqna] Missing protocol in curl command (#1447 ) This PR fixes the missing protocol for in the curl command mentioned in chatqna readme for tei-embedding-service.	2025-01-22 21:41:47 +08:00
lkk	f07cf1dad2	Fix wrong vllm repo. (#1454 ) Use vllm-fork for gaudi. fix the issue #1451	2025-01-22 21:22:56 +08:00
dolpher	ee0e5cc8d9	Sync value files from GenAIInfra (#1428 ) All gaudi values updated with extra flags. Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice. Signed-off-by: Dolpher Du <dolpher.du@intel.com>	2025-01-22 17:44:11 +08:00
chen, suyue	5c36443b11	Use local hub cache for AgentQnA test (#1450 ) Use local hub cache for AgentQnA test to save workspace. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-22 13:23:00 +08:00
Lianhao Lu	62cea74a23	CI: improve helm CI (#1452 ) Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2025-01-22 09:18:35 +08:00
WenjiaoYue	b721c256f9	Fix Domain Access Issue in Latest Vite Version (#1444 ) Fix the restriction on using domain names when users are using the latest version of Vite When users use the new version of Vite, the UI cannot be accessed via domain names due to Vite's new rules. This fix adds the corresponding parameters according to Vite's new rules, ensuring that users can access the frontend via domain names when building the UI. Fixes #1441 Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com>	2025-01-21 23:28:37 +08:00
chen, suyue	927698e23e	Simplify git clone code in CI test (#1434 ) 1. Simplify git clone code in CI test. 2. Replace git clone branch in Dockerfile. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-21 23:00:08 +08:00
ZePan110	c3e84b5ffa	Fix test matrix for helm charts (#1449 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-21 22:28:31 +08:00
ZePan110	6b2a041f25	Fix Helm-chart workflow issues. (#1448 ) Fix matrix error issues and CD test files cannot be obtained. Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-21 21:48:57 +08:00
ZePan110	842f46326b	Switch helm-chart test runs-on label. (#1446 ) Switch helm-chart test runs-on label from ${{ inputs.hardware }} to k8s-${{ inputs.hardware }}. Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-21 18:07:03 +08:00
Wang, Kai Lawrence	284db982be	[ROCm] Fix the hf-token setting for TGI and TEI in ChatQnA (#1432 ) This PR is to correct the env variable names in chatqna example on ROCm platform passing to the docker container of TGI and TEI. For tgi, either HF_TOKEN and HUGGING_FACE_HUB_TOKEN could be parsed in TGI while HF_API_TOKEN can be parsed in TEI. TGI: https://github.com/huggingface/text-generation-inference/blob/main/router/src/server.rs#L1700C1-L1702C15 TEI: https://github.com/huggingface/text-embeddings-inference/blob/main/router/src/main.rs#L112 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-21 14:22:39 +08:00
ZePan110	fc96fe83e2	Fix CD workflow issue (#1443 ) Fix the issue of CD workflow values_files errors. Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-21 11:54:12 +08:00
Hoong Tee, Yeoh	0316114c4b	ProductivitySuite: Fix FaqGen Microservice CI test fail (#1437 ) Change in FAQGen microservice for content-type header result in CI failure. #1431 Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>	2025-01-21 10:23:35 +08:00
chen, suyue	0408453fa2	Unify the yaml name to fix the CD workflow (#1435 ) Fix the issue in #1372 Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-21 01:10:41 +08:00
XinyaoWa	d0cd0aaf53	Update GraphRAG to be compatible with latest component changes (#1427 ) - Updated ENV VARS to align with recent changes in neo4j dataprep and retriever. - upgraded tgi-gaudi image version Related to GenAIComps repo issue #1025 (opea-project/GenAIComps#1025) Original PR #1384 Original contributor is @rbrugaro Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: Liang Lv <liang1.lv@intel.com>	2025-01-21 00:18:01 +08:00
chen, suyue	0ba3decb6b	Simplify git clone code in CI test (#1422 ) 1. Simplify git clone code in CI test. 2. Replace git clone branch in Dockerfile. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-20 23:55:20 +08:00
Wang, Kai Lawrence	3d3ac59bfb	[ChatQnA] Update the default LLM to llama3-8B on cpu/gpu/hpu (#1430 ) Update the default LLM to llama3-8B on cpu/nvgpu/amdgpu/gaudi for docker-compose deployment to avoid the potential model serving issue or the missing chat-template issue using neural-chat-7b. Slow serving issue of neural-chat-7b on ICX: #1420 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-20 22:47:56 +08:00
Melanie Hart Buehler	f11ab458d8	MultimodalQnA image query, pdf, dynamic ports, and UI updates (#1381 ) Per the proposed changes in this [RFC](https://github.com/opea-project/docs/blob/main/community/rfcs/24-10-02-GenAIExamples-001-Image_and_Audio_Support_in_MultimodalQnA.md)'s Phase 2 plan, this PR adds support for image queries, PDF ingestion and display, and dynamic ports. There are also some bug fixes. This PR goes with [this one in GenAIComps](https://github.com/opea-project/GenAIComps/pull/1134). Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> Co-authored-by: Liang Lv <liang1.lv@intel.com>	2025-01-20 22:41:52 +08:00
ZePan110	f3562bef36	Add helm e2e test workflow (#1372 ) Add both CICD workflow for helm charts values test. Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-20 21:04:11 +08:00
chen, suyue	7a54064d65	remove Dockerfile.wrapper (#1429 ) Remove Dockerfile.wrapper, it's not used anymore and no test cover this Dockerfile. So remove this Dockerfile to avoid regression. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-20 20:49:18 +08:00
Liang Lv	0f7e5a37ac	Adapt code for dataprep microservice refactor (#1408 ) https://github.com/opea-project/GenAIComps/pull/1153 Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-01-20 20:37:03 +08:00
xiguiw	2d5898244c	Enchance health check in GenAIExample docker-compose (#1410 ) Fix service launch issue 1. Update Gaudi TGI image from 2.0.6 to 2.3.1 2. Change the hpu-gaudi TGI health check condition. Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-01-20 20:13:13 +08:00
Neo Zhang Jianyu	59722d2bc9	[Bug] Enhance the template (#1396 ) Enhance the bug & feature template according to the issue #1002. Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>	2025-01-20 17:56:14 +08:00
chen, suyue	6bfd156573	Clean up test scripts and enhance git clone (#1417 ) 1. Clean up test code in scripts. 2. Simplify git clone code. 3. Replace git clone branch in Dockerfile. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-20 16:34:28 +08:00
XinyuYe-Intel	528770a8d7	Add UT for Text2Image on Gaudi (#1424 ) Add UT for Text2Image on Gaudi. #1421 Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>	2025-01-20 16:01:35 +08:00
chen, suyue	239995da16	Update DocIndexRetriever CI test scripts (#1416 ) 1. Add image build condition. 2. Update single branch clone. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-20 11:16:38 +08:00
chen, suyue	f65e8d8668	Add port 5000 checking and warning (#1414 ) Port 5000 is used by local docker registry, please DO NOT use it in docker compose deployment!!! Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-20 09:09:31 +08:00
chen, suyue	a49a36cebc	Add secrets OPENAI_API_KEY (#1412 ) Add secrets OPENAI_API_KEY for AMD GPU CI test. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-19 19:39:45 +08:00
Wang, Kai Lawrence	742cb6ddd3	[ChatQnA] Switch to vLLM as default llm backend on Xeon (#1403 ) Switching from TGI to vLLM as the default LLM serving backend on Xeon for the ChatQnA example to enhance the perf. https://github.com/opea-project/GenAIExamples/issues/1213 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-17 20:48:19 +08:00
Wang, Kai Lawrence	00e9da9ced	[ChatQnA] Switch to vLLM as default llm backend on Gaudi (#1404 ) Switching from TGI to vLLM as the default LLM serving backend on Gaudi for the ChatQnA example to enhance the perf. https://github.com/opea-project/GenAIExamples/issues/1213 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-17 20:46:38 +08:00
chyundunovDatamonsters	277222a922	General README.md - add deploy on AMD info (#1409 ) Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com> Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-01-17 20:26:59 +08:00
lkk	5c68effc9f	update agent example for the GenAIComps changes. (#1407 ) Update build.yaml and compose_vllm.yaml because of refactoring of GenAIComps. Fix issue left by https://github.com/opea-project/GenAIExamples/pull/1353	2025-01-17 11:29:11 +08:00
XinyaoWa	39409d7f61	Align OpenAI API for FaqGen, DocSum (#1401 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2025-01-17 11:19:35 +08:00
XinyaoWa	71e3c57366	Standardize name for LLM comps (#1402 ) Update all the names for classes and files in llm comps to follow the standard format, related GenAIComps PR opea-project/GenAIComps#1162 Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2025-01-16 23:10:27 +08:00
Letong Han	5ad24af2ee	Fix Vectorestores Path Issue of Refactor (#1399 ) Fix vectorestores path issue caused by refactor in PR opea-project/GenAIComps#1159. Modify docker image name and file path in docker_images_list.md. Signed-off-by: letonghan <letong.han@intel.com>	2025-01-16 19:50:59 +08:00
WenjiaoYue	3a9a24a51a	Agent ui (#1389 ) Signed-off-by: WenjiaoYue <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> Co-authored-by: WenjiaoYue <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> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-01-16 18:47:46 +08:00
XinyaoWa	301b5e9a69	Fix vllm hpu to a stable release (#1398 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2025-01-16 16:35:32 +08:00
Yao Qing	b4269d6c4f	Modify the corresponding path based on the refactor of chathistory in GenAIComps. (#1397 ) GenAIComps has refactored chathistory based on E-RAG code structure. Related path in GenAIExample have been modified. Fix GenAIComps Issue https://github.com/opea-project/GenAIComps/issues/989 Signed-off-by: Yao, Qing <qing.yao@intel.com>	2025-01-16 14:26:17 +08:00

1 2 3 4 5 ...

870 Commits