Wang, Kai Lawrence
742cb6ddd3
[ChatQnA] Switch to vLLM as default llm backend on Xeon ( #1403 )
...
Switching from TGI to vLLM as the default LLM serving backend on Xeon for the ChatQnA example to enhance the perf.
https://github.com/opea-project/GenAIExamples/issues/1213
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-17 20:48:19 +08:00
Wang, Kai Lawrence
00e9da9ced
[ChatQnA] Switch to vLLM as default llm backend on Gaudi ( #1404 )
...
Switching from TGI to vLLM as the default LLM serving backend on Gaudi for the ChatQnA example to enhance the perf.
https://github.com/opea-project/GenAIExamples/issues/1213
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-17 20:46:38 +08:00
chyundunovDatamonsters
277222a922
General README.md - add deploy on AMD info ( #1409 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-17 20:26:59 +08:00
lkk
5c68effc9f
update agent example for the GenAIComps changes. ( #1407 )
...
Update build.yaml and compose_vllm.yaml because of refactoring of GenAIComps.
Fix issue left by https://github.com/opea-project/GenAIExamples/pull/1353
2025-01-17 11:29:11 +08:00
XinyaoWa
39409d7f61
Align OpenAI API for FaqGen, DocSum ( #1401 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-17 11:19:35 +08:00
XinyaoWa
71e3c57366
Standardize name for LLM comps ( #1402 )
...
Update all the names for classes and files in llm comps to follow the standard format, related GenAIComps PR opea-project/GenAIComps#1162
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-16 23:10:27 +08:00
Letong Han
5ad24af2ee
Fix Vectorestores Path Issue of Refactor ( #1399 )
...
Fix vectorestores path issue caused by refactor in PR opea-project/GenAIComps#1159 .
Modify docker image name and file path in docker_images_list.md.
Signed-off-by: letonghan <letong.han@intel.com >
2025-01-16 19:50:59 +08:00
WenjiaoYue
3a9a24a51a
Agent ui ( #1389 )
...
Signed-off-by: WenjiaoYue <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>
Co-authored-by: WenjiaoYue <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>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-16 18:47:46 +08:00
XinyaoWa
301b5e9a69
Fix vllm hpu to a stable release ( #1398 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-16 16:35:32 +08:00
Yao Qing
b4269d6c4f
Modify the corresponding path based on the refactor of chathistory in GenAIComps. ( #1397 )
...
GenAIComps has refactored chathistory based on E-RAG code structure. Related path in GenAIExample have been modified.
Fix GenAIComps Issue https://github.com/opea-project/GenAIComps/issues/989
Signed-off-by: Yao, Qing <qing.yao@intel.com >
2025-01-16 14:26:17 +08:00
Letong Han
4cabd55778
Refactor Retrievers related Examples ( #1387 )
...
Delete redundant retrievers docker image in docker_images_list.md.
Refactor Retrievers related Examples READMEs.
Change all of the comps/retrievers/xxx/xxx/Dockerfile path into comps/retrievers/src/Dockerfile.
Fix the Examples CI issues of PR opea-project/GenAIComps#1138 .
Signed-off-by: letonghan <letong.han@intel.com >
2025-01-16 14:21:48 +08:00
xiguiw
698a06edbf
[DOC] Fix document issue ( #1395 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-16 11:30:07 +08:00
Eero Tamminen
0eae391fda
Use staged builds to minimize final image sizes ( #1031 )
...
Staged image builds so that final images do not have redundant things like:
- Git tool and its deps
- Git repo history
- Test directories
Fixes : #225
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-01-16 11:14:47 +08:00
XinyaoWa
23d885bf60
Refactor vllm openvino to third parties ( #1388 )
...
vllm-openvino is a dependency for text generation comps, in GenAIComps PR opea-project/GenAIComps#1141 we move it to third-parties folder, update the path accordingly.
#998
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-16 10:07:56 +08:00
minmin-intel
287f03a834
Add SQL agent to AgentQnA ( #1370 )
...
Signed-off-by: minmin-intel <minmin.hou@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-01-15 09:31:13 -08:00
ZePan110
a65a1e5598
Fix CI filter issue ( #1393 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-15 11:39:51 +08:00
Neo Zhang Jianyu
9812c2fb45
Update check-online-doc-build.yml ( #1390 )
2025-01-15 09:07:02 +08:00
XinyaoWa
7d218b9f36
Remove vllm hpu commit id limit ( #1386 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-14 11:05:32 +08:00
Zhu Yongbo
ba9892f8ee
minor bug fix for EC-RAG ( #1378 )
...
Signed-off-by: Zhu, Yongbo <yongbo.zhu@intel.com >
2025-01-14 10:45:15 +08:00
XinyaoWa
ff1310b11a
Refactor docsum ( #1336 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-13 15:49:48 +08:00
Sihan Chen
ca15fe9bdb
Refactor lvm related examples ( #1333 )
2025-01-13 13:42:06 +08:00
XinyaoWa
f48bd8e74f
Refactor Faqgen ( #1323 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-13 13:01:04 +08:00
Ying Hu
91ff520baa
Update README.md for add K8S cluster link for Gaudi ( #1380 )
2025-01-13 09:33:58 +08:00
Liang Lv
3ca78867eb
Update example code for embedding dependency moving to 3rd_party ( #1368 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-10 15:36:58 +08:00
Yao Qing
7a3dfa90ca
Fix for animation dockerfile path. ( #1371 )
...
Signed-off-by: Yao, Qing <qing.yao@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-01-10 11:44:57 +08:00
dolpher
c795ef2203
Add helm deployment instructions for GenAIExamples ( #1373 )
...
Add helm deployment instructions for ChatQnA, AgentQnA, AudioQnA, CodeTrans, DocSum, FaqGen and VisualQnA
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-10 09:55:31 +08:00
chen, suyue
99120f4cd2
Update action token for CI ( #1374 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-09 17:19:07 +08:00
XinyuYe-Intel
9fe480b010
Update dockerfile path for text2image ( #1307 )
...
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com >
2025-01-09 12:03:27 +08:00
XinyuYe-Intel
113281d073
Update path for finetuning ( #1306 )
...
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-09 12:01:59 +08:00
Liang Lv
370d6928c1
Update example code for prompt registry refactor ( #1362 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-09 11:59:32 +08:00
Liang Lv
2b26450bb9
Update docker file path for feedback management refactor ( #1364 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-09 11:21:25 +08:00
Louie Tsai
81022355a7
Enable OpenTelemetry Tracing for ChatQnA TGI serving on Gaudi ( #1316 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-01-08 17:20:13 -08:00
Jaswanth Karani
ddacb7e86d
fixed build issue ( #1367 )
2025-01-08 22:19:23 +08:00
Sihan Chen
5128c2d650
Refactor web retrievers links ( #1338 )
2025-01-08 16:19:50 +08:00
Liang Lv
b3c405a5f6
Adapt example code for guardrails refactor ( #1360 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-08 14:35:23 +08:00
dolpher
5638075d65
Add helm deployment instructions for codegen ( #1351 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-08 13:20:32 +08:00
chen, suyue
23117871c2
remove chatqna-conversation-ui build in CI test ( #1361 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-08 12:09:33 +08:00
WenjiaoYue
9970605460
Adapt refactor comps ( #1340 )
...
Signed-off-by: WenjiaoYue
2025-01-08 10:36:24 +08:00
dolpher
28206311fd
Disable GMC CI temporarily ( #1359 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-08 09:55:53 +08:00
ZePan110
589bfb2b7a
Change license template from 2024 to 2025 ( #1358 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-07 19:29:55 +08:00
Pranav Singh
d2b49bbc82
[ChatQNA] Fix K8s Deployment for CPU/HPU ( #1274 )
...
Signed-off-by: Pranav Singh <pranav.singh@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-07 13:45:09 +08:00
Ying Hu
41374d865b
Update README.md for support matrix ( #983 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com >
2025-01-07 11:45:42 +08:00
pre-commit-ci[bot]
2c624e1f5f
[pre-commit.ci] pre-commit autoupdate ( #1356 )
...
Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-07 11:13:07 +08:00
Ying Hu
00241d01d2
Update README.md for quick start guide ( #1355 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-07 10:08:16 +08:00
ZePan110
ed2b8ed983
Exclude dockerfile under tests and exclude check Dockerfile under tests. ( #1354 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-07 09:05:01 +08:00
lkk
a6e702e4d5
refine agent directories. ( #1353 )
2025-01-06 17:40:24 +08:00
ZePan110
aa5c91d7ee
Check duplicated dockerfile ( #1289 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-06 17:30:12 +08:00
chen, suyue
b88d09e23f
Fix code owner list ( #1352 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-06 14:00:13 +08:00
XinyaoWa
464e2d3125
Rename streaming to stream to align with OpenAI API ( #1332 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-06 13:25:55 +08:00
chen, suyue
1f29eca288
fix chatqna benchmark without rerank config issue ( #1341 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-06 09:16:20 +08:00