ZePan110
05011ebaac
Integrate AudioQnA set_env to ut scripts. ( #1897 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-05-08 09:14:44 +08:00
Spycsh
608dc963c9
Refine readme of AudioQnA ( #1804 )
2025-04-21 17:30:14 +08:00
Spycsh
27813b3bf9
add AudioQnA key parameters to comply with the image size reduction ( #1833 )
2025-04-20 16:34:19 +08:00
ZePan110
5f4b3a6d12
Adaptation to vllm v0.8.3 build paths ( #1761 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-09 13:20:02 +08:00
Spycsh
d4952d1e7c
Refine third parties links ( #1764 )
...
Signed-off-by: Spycsh <sihan.chen@intel.com >
2025-04-08 18:39:13 +08:00
ZePan110
42735d0d7d
Fix vllm and vllm-fork tags ( #1766 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-07 22:58:50 +08:00
chyundunovDatamonsters
60591d8d56
Adding files to deploy AudioQnA application on ROCm vLLM ( #1655 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Signed-off-by: Artem Astafev <a.astafev@datamonsters.com >
Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Artem Astafev <a.astafev@datamonsters.com >
2025-03-24 10:03:37 +08:00
Wang, Kai Lawrence
8fe19291c8
[AudioQnA] Enable vLLM and set it as default LLM serving ( #1657 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-03-14 09:56:33 +08:00
ZePan110
6723395e31
Update compose.yaml ( #1620 )
...
Update compose.yaml for AudioQnA, DBQnA, DocIndexRetriever, FaqGen, Translation and VisualQnA.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:20:08 +08:00
Wang, Kai Lawrence
2dfcfa0436
[AudioQnA] Fix the LLM model field for inputs alignment ( #1611 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-03-05 22:15:07 +08:00
ZePan110
e4de76da78
Use model cache for docker compose test ( #1582 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-04 09:48:27 +08:00
Spycsh
9f36e84c1c
Refactor AudioQnA README ( #1508 )
...
Signed-off-by: Spycsh <sihan.chen@intel.com >
2025-02-15 11:30:16 +08:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
xiguiw
45d5da2ddd
HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN ( #1503 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-09 20:33:06 +08:00
xiguiw
2d5898244c
Enchance health check in GenAIExample docker-compose ( #1410 )
...
Fix service launch issue
1. Update Gaudi TGI image from 2.0.6 to 2.3.1
2. Change the hpu-gaudi TGI health check condition.
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-20 20:13:13 +08:00
Sihan Chen
cc1d97f816
Refactor AudioQnA/MultiModalQnA/AvatarChatbot ( #1310 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chensuyue <suyue.chen@intel.com >
2024-12-31 12:47:30 +08:00
WenjiaoYue
f5c08d4fbb
Update audioQnA compose ( #1227 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
2024-12-05 16:23:47 +08:00
Louie Tsai
152adf8012
maintain a version info for docker_compose yaml files among release ( #1141 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-11-17 22:39:41 -08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
Dina Suehiro Jones
184e9a43b8
Update AudioQnA README to add a couple usage details ( #948 )
...
Signed-off-by: Dina Suehiro Jones <dina.s.jones@intel.com >
Co-authored-by: Sihan Chen <39623753+Spycsh@users.noreply.github.com >
2024-10-21 10:22:22 +08:00
Sihan Chen
658867fce4
Add multi-language AudioQnA on Xeon ( #982 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-21 09:58:14 +08:00
XinyaoWa
2f03a3a894
Align parameters for "max_token, repetition_penalty,presence_penalty,frequency_penalty" ( #726 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 14:15:25 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00