Ed Lee @ Intel
e9153b82bb
Updated SearchQnA to use nginx like ChatQnA ( #1769 )
...
Signed-off-by: Ed Lee <16417837+edlee123@users.noreply.github.com >
2025-05-20 14:15:46 +08:00
ZePan110
11b04b38db
Integrate SearchQnA set_env to ut scripts. ( #1950 )
...
Integrate SearchQnA set_env to ut scripts.
Add README.md for UT scripts.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-05-16 15:09:07 +08:00
Artem Astafev
ccc145ea1a
Refine README.MD for SearchQnA on AMD ROCm platform ( #1876 )
...
Signed-off-by: Artem Astafev <a.astafev@datamonsters.com >
2025-04-25 10:16:03 +08:00
WenjiaoYue
52c4db2fc6
[ SearchQnA ] Refine documents ( #1803 )
...
Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-04-21 17:16:41 +08:00
xiguiw
4fc19c7d73
Update TEI docker images to CPU-1.6 ( #1791 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-04-17 15:00:06 +08:00
Xiaotian Chen
1bd56af994
Update TGI image versions ( #1625 )
...
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com >
2025-04-01 11:27:51 +08:00
chyundunovDatamonsters
853f1302af
Adding files to deploy SearchQnA application on ROCm vLLM ( #1649 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Signed-off-by: Artem Astafev <a.astafev@datamonsters.com >
2025-03-31 17:51:51 +08:00
xiguiw
87baeb833d
Update TEI docker image to 1.6 ( #1650 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-03-27 09:40:22 +08:00
ZePan110
428ba481b2
Update compose.yaml for SearchQnA ( #1622 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 08:38:59 +08:00
ZePan110
24cacaaa48
Enable SearchQnA model cache for docker compose test. ( #1606 )
...
Enable SearchQnA model cache for docker compose test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-05 17:13:24 +08:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
xiguiw
45d5da2ddd
HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN ( #1503 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-09 20:33:06 +08:00
chyundunovDatamonsters
39abef8be8
SearchQnA App - Adding files to deploy SearchQnA application on AMD GPU ( #1193 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-01-27 10:58:55 +08:00
xiguiw
2d5898244c
Enchance health check in GenAIExample docker-compose ( #1410 )
...
Fix service launch issue
1. Update Gaudi TGI image from 2.0.6 to 2.3.1
2. Change the hpu-gaudi TGI health check condition.
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-20 20:13:13 +08:00
Sihan Chen
5128c2d650
Refactor web retrievers links ( #1338 )
2025-01-08 16:19:50 +08:00
WenjiaoYue
9970605460
Adapt refactor comps ( #1340 )
...
Signed-off-by: WenjiaoYue
2025-01-08 10:36:24 +08:00
ZePan110
ed2b8ed983
Exclude dockerfile under tests and exclude check Dockerfile under tests. ( #1354 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-07 09:05:01 +08:00
ZePan110
aa5c91d7ee
Check duplicated dockerfile ( #1289 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-06 17:30:12 +08:00
XinyaoWa
464e2d3125
Rename streaming to stream to align with OpenAI API ( #1332 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-06 13:25:55 +08:00
chen, suyue
5c7a5bd850
Update Code and README for GenAIComps Refactor ( #1285 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
Signed-off-by: WenjiaoYue <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>
2025-01-02 20:03:26 +08:00
Louie Tsai
152adf8012
maintain a version info for docker_compose yaml files among release ( #1141 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-11-17 22:39:41 -08:00
Letong Han
39f68d5d6b
Fix SearchQnA CI Issue ( #1134 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-11-15 10:01:27 +08:00
lvliang-intel
9ff7df9202
Use fixed version of TEI Gaudi for stability ( #1101 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2024-11-13 10:45:50 -08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
xiguiw
a0921f127f
[Doc] Fix broken build instruction ( #1063 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2024-11-05 13:35:12 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
XinyaoWa
2f03a3a894
Align parameters for "max_token, repetition_penalty,presence_penalty,frequency_penalty" ( #726 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 14:15:25 +08:00
lvliang-intel
bceacdc804
Fix README issues ( #817 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-18 09:50:17 +08:00
XinyaoWa
264759d85a
fix path bug for reorg ( #801 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-09-12 17:52:06 +08:00
xiguiw
5c67204734
Update SearchQnA document and compose.yaml ( #774 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 15:39:07 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00