Liang Lv
d953332f43
Fix multimodal docker image issue for MutimodalQnA on Gaudi ( #1455 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-23 00:12:06 +08:00
chyundunovDatamonsters
cbe5805f47
AgentQnA - add README file for deploy on ROCm ( #1379 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-01-22 21:57:15 +08:00
Ervin Castelino
27fdbcab58
[chore/chatqna] Missing protocol in curl command ( #1447 )
...
This PR fixes the missing protocol for in the curl command mentioned in chatqna readme for tei-embedding-service.
2025-01-22 21:41:47 +08:00
lkk
f07cf1dad2
Fix wrong vllm repo. ( #1454 )
...
Use vllm-fork for gaudi.
fix the issue #1451
2025-01-22 21:22:56 +08:00
dolpher
ee0e5cc8d9
Sync value files from GenAIInfra ( #1428 )
...
All gaudi values updated with extra flags.
Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice.
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-22 17:44:11 +08:00
chen, suyue
5c36443b11
Use local hub cache for AgentQnA test ( #1450 )
...
Use local hub cache for AgentQnA test to save workspace.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-22 13:23:00 +08:00
Lianhao Lu
62cea74a23
CI: improve helm CI ( #1452 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2025-01-22 09:18:35 +08:00
WenjiaoYue
b721c256f9
Fix Domain Access Issue in Latest Vite Version ( #1444 )
...
Fix the restriction on using domain names when users are using the latest version of Vite
When users use the new version of Vite, the UI cannot be accessed via domain names due to Vite's new rules. This fix adds the corresponding parameters according to Vite's new rules, ensuring that users can access the frontend via domain names when building the UI.
Fixes #1441
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-01-21 23:28:37 +08:00
chen, suyue
927698e23e
Simplify git clone code in CI test ( #1434 )
...
1. Simplify git clone code in CI test.
2. Replace git clone branch in Dockerfile.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-21 23:00:08 +08:00
ZePan110
c3e84b5ffa
Fix test matrix for helm charts ( #1449 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-21 22:28:31 +08:00
ZePan110
6b2a041f25
Fix Helm-chart workflow issues. ( #1448 )
...
Fix matrix error issues and CD test files cannot be obtained.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-21 21:48:57 +08:00
ZePan110
842f46326b
Switch helm-chart test runs-on label. ( #1446 )
...
Switch helm-chart test runs-on label from ${{ inputs.hardware }} to k8s-${{ inputs.hardware }}.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-21 18:07:03 +08:00
Wang, Kai Lawrence
284db982be
[ROCm] Fix the hf-token setting for TGI and TEI in ChatQnA ( #1432 )
...
This PR is to correct the env variable names in chatqna example on ROCm platform passing to the docker container of TGI and TEI. For tgi, either HF_TOKEN and HUGGING_FACE_HUB_TOKEN could be parsed in TGI while HF_API_TOKEN can be parsed in TEI.
TGI: https://github.com/huggingface/text-generation-inference/blob/main/router/src/server.rs#L1700C1-L1702C15
TEI: https://github.com/huggingface/text-embeddings-inference/blob/main/router/src/main.rs#L112
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-21 14:22:39 +08:00
ZePan110
fc96fe83e2
Fix CD workflow issue ( #1443 )
...
Fix the issue of CD workflow values_files errors.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-21 11:54:12 +08:00
Hoong Tee, Yeoh
0316114c4b
ProductivitySuite: Fix FaqGen Microservice CI test fail ( #1437 )
...
Change in FAQGen microservice for content-type header result in CI failure.
#1431
Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com >
2025-01-21 10:23:35 +08:00
chen, suyue
0408453fa2
Unify the yaml name to fix the CD workflow ( #1435 )
...
Fix the issue in #1372
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-21 01:10:41 +08:00
XinyaoWa
d0cd0aaf53
Update GraphRAG to be compatible with latest component changes ( #1427 )
...
- Updated ENV VARS to align with recent changes in neo4j dataprep and retriever.
- upgraded tgi-gaudi image version
Related to GenAIComps repo issue #1025 (opea-project/GenAIComps#1025 )
Original PR #1384
Original contributor is @rbrugaro
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: Liang Lv <liang1.lv@intel.com >
2025-01-21 00:18:01 +08:00
chen, suyue
0ba3decb6b
Simplify git clone code in CI test ( #1422 )
...
1. Simplify git clone code in CI test.
2. Replace git clone branch in Dockerfile.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-20 23:55:20 +08:00
Wang, Kai Lawrence
3d3ac59bfb
[ChatQnA] Update the default LLM to llama3-8B on cpu/gpu/hpu ( #1430 )
...
Update the default LLM to llama3-8B on cpu/nvgpu/amdgpu/gaudi for docker-compose deployment to avoid the potential model serving issue or the missing chat-template issue using neural-chat-7b.
Slow serving issue of neural-chat-7b on ICX: #1420
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-20 22:47:56 +08:00
Melanie Hart Buehler
f11ab458d8
MultimodalQnA image query, pdf, dynamic ports, and UI updates ( #1381 )
...
Per the proposed changes in this [RFC](https://github.com/opea-project/docs/blob/main/community/rfcs/24-10-02-GenAIExamples-001-Image_and_Audio_Support_in_MultimodalQnA.md )'s Phase 2 plan, this PR adds support for image queries, PDF ingestion and display, and dynamic ports. There are also some bug fixes. This PR goes with [this one in GenAIComps](https://github.com/opea-project/GenAIComps/pull/1134 ).
Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com >
Co-authored-by: Liang Lv <liang1.lv@intel.com >
2025-01-20 22:41:52 +08:00
ZePan110
f3562bef36
Add helm e2e test workflow ( #1372 )
...
Add both CICD workflow for helm charts values test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-20 21:04:11 +08:00
chen, suyue
7a54064d65
remove Dockerfile.wrapper ( #1429 )
...
Remove Dockerfile.wrapper, it's not used anymore and no test cover this Dockerfile. So remove this Dockerfile to avoid regression.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-20 20:49:18 +08:00
Liang Lv
0f7e5a37ac
Adapt code for dataprep microservice refactor ( #1408 )
...
https://github.com/opea-project/GenAIComps/pull/1153
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-20 20:37:03 +08:00
xiguiw
2d5898244c
Enchance health check in GenAIExample docker-compose ( #1410 )
...
Fix service launch issue
1. Update Gaudi TGI image from 2.0.6 to 2.3.1
2. Change the hpu-gaudi TGI health check condition.
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-20 20:13:13 +08:00
Neo Zhang Jianyu
59722d2bc9
[Bug] Enhance the template ( #1396 )
...
Enhance the bug & feature template according to the issue #1002 .
Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com >
2025-01-20 17:56:14 +08:00
chen, suyue
6bfd156573
Clean up test scripts and enhance git clone ( #1417 )
...
1. Clean up test code in scripts.
2. Simplify git clone code.
3. Replace git clone branch in Dockerfile.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-20 16:34:28 +08:00
XinyuYe-Intel
528770a8d7
Add UT for Text2Image on Gaudi ( #1424 )
...
Add UT for Text2Image on Gaudi.
#1421
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com >
2025-01-20 16:01:35 +08:00
chen, suyue
239995da16
Update DocIndexRetriever CI test scripts ( #1416 )
...
1. Add image build condition.
2. Update single branch clone.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-20 11:16:38 +08:00
chen, suyue
f65e8d8668
Add port 5000 checking and warning ( #1414 )
...
Port 5000 is used by local docker registry, please DO NOT use it in docker compose deployment!!!
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-20 09:09:31 +08:00
chen, suyue
a49a36cebc
Add secrets OPENAI_API_KEY ( #1412 )
...
Add secrets OPENAI_API_KEY for AMD GPU CI test.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-19 19:39:45 +08:00
Wang, Kai Lawrence
742cb6ddd3
[ChatQnA] Switch to vLLM as default llm backend on Xeon ( #1403 )
...
Switching from TGI to vLLM as the default LLM serving backend on Xeon for the ChatQnA example to enhance the perf.
https://github.com/opea-project/GenAIExamples/issues/1213
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-17 20:48:19 +08:00
Wang, Kai Lawrence
00e9da9ced
[ChatQnA] Switch to vLLM as default llm backend on Gaudi ( #1404 )
...
Switching from TGI to vLLM as the default LLM serving backend on Gaudi for the ChatQnA example to enhance the perf.
https://github.com/opea-project/GenAIExamples/issues/1213
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-17 20:46:38 +08:00
chyundunovDatamonsters
277222a922
General README.md - add deploy on AMD info ( #1409 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-17 20:26:59 +08:00
lkk
5c68effc9f
update agent example for the GenAIComps changes. ( #1407 )
...
Update build.yaml and compose_vllm.yaml because of refactoring of GenAIComps.
Fix issue left by https://github.com/opea-project/GenAIExamples/pull/1353
2025-01-17 11:29:11 +08:00
XinyaoWa
39409d7f61
Align OpenAI API for FaqGen, DocSum ( #1401 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-17 11:19:35 +08:00
XinyaoWa
71e3c57366
Standardize name for LLM comps ( #1402 )
...
Update all the names for classes and files in llm comps to follow the standard format, related GenAIComps PR opea-project/GenAIComps#1162
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-16 23:10:27 +08:00
Letong Han
5ad24af2ee
Fix Vectorestores Path Issue of Refactor ( #1399 )
...
Fix vectorestores path issue caused by refactor in PR opea-project/GenAIComps#1159 .
Modify docker image name and file path in docker_images_list.md.
Signed-off-by: letonghan <letong.han@intel.com >
2025-01-16 19:50:59 +08:00
WenjiaoYue
3a9a24a51a
Agent ui ( #1389 )
...
Signed-off-by: WenjiaoYue <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>
Co-authored-by: WenjiaoYue <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>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-16 18:47:46 +08:00
XinyaoWa
301b5e9a69
Fix vllm hpu to a stable release ( #1398 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-16 16:35:32 +08:00
Yao Qing
b4269d6c4f
Modify the corresponding path based on the refactor of chathistory in GenAIComps. ( #1397 )
...
GenAIComps has refactored chathistory based on E-RAG code structure. Related path in GenAIExample have been modified.
Fix GenAIComps Issue https://github.com/opea-project/GenAIComps/issues/989
Signed-off-by: Yao, Qing <qing.yao@intel.com >
2025-01-16 14:26:17 +08:00
Letong Han
4cabd55778
Refactor Retrievers related Examples ( #1387 )
...
Delete redundant retrievers docker image in docker_images_list.md.
Refactor Retrievers related Examples READMEs.
Change all of the comps/retrievers/xxx/xxx/Dockerfile path into comps/retrievers/src/Dockerfile.
Fix the Examples CI issues of PR opea-project/GenAIComps#1138 .
Signed-off-by: letonghan <letong.han@intel.com >
2025-01-16 14:21:48 +08:00
xiguiw
698a06edbf
[DOC] Fix document issue ( #1395 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-16 11:30:07 +08:00
Eero Tamminen
0eae391fda
Use staged builds to minimize final image sizes ( #1031 )
...
Staged image builds so that final images do not have redundant things like:
- Git tool and its deps
- Git repo history
- Test directories
Fixes : #225
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-01-16 11:14:47 +08:00
XinyaoWa
23d885bf60
Refactor vllm openvino to third parties ( #1388 )
...
vllm-openvino is a dependency for text generation comps, in GenAIComps PR opea-project/GenAIComps#1141 we move it to third-parties folder, update the path accordingly.
#998
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-16 10:07:56 +08:00
minmin-intel
287f03a834
Add SQL agent to AgentQnA ( #1370 )
...
Signed-off-by: minmin-intel <minmin.hou@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-01-15 09:31:13 -08:00
ZePan110
a65a1e5598
Fix CI filter issue ( #1393 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-15 11:39:51 +08:00
Neo Zhang Jianyu
9812c2fb45
Update check-online-doc-build.yml ( #1390 )
2025-01-15 09:07:02 +08:00
XinyaoWa
7d218b9f36
Remove vllm hpu commit id limit ( #1386 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-14 11:05:32 +08:00
Zhu Yongbo
ba9892f8ee
minor bug fix for EC-RAG ( #1378 )
...
Signed-off-by: Zhu, Yongbo <yongbo.zhu@intel.com >
2025-01-14 10:45:15 +08:00
XinyaoWa
ff1310b11a
Refactor docsum ( #1336 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-13 15:49:48 +08:00