Wang, Kai Lawrence
8fe19291c8
[AudioQnA] Enable vLLM and set it as default LLM serving ( #1657 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-03-14 09:56:33 +08:00
ZePan110
6723395e31
Update compose.yaml ( #1620 )
...
Update compose.yaml for AudioQnA, DBQnA, DocIndexRetriever, FaqGen, Translation and VisualQnA.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:20:08 +08:00
Wang, Kai Lawrence
2dfcfa0436
[AudioQnA] Fix the LLM model field for inputs alignment ( #1611 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-03-05 22:15:07 +08:00
ZePan110
e4de76da78
Use model cache for docker compose test ( #1582 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-04 09:48:27 +08:00
xiguiw
2d5898244c
Enchance health check in GenAIExample docker-compose ( #1410 )
...
Fix service launch issue
1. Update Gaudi TGI image from 2.0.6 to 2.3.1
2. Change the hpu-gaudi TGI health check condition.
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-20 20:13:13 +08:00
Sihan Chen
cc1d97f816
Refactor AudioQnA/MultiModalQnA/AvatarChatbot ( #1310 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chensuyue <suyue.chen@intel.com >
2024-12-31 12:47:30 +08:00
WenjiaoYue
f5c08d4fbb
Update audioQnA compose ( #1227 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
2024-12-05 16:23:47 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00