Eze Lanza (Eze)
fba0de45d2
ChatQnA Docker compose file for Milvus as vdb ( #1548 )
...
Signed-off-by: Ezequiel Lanza <ezequiel.lanza@gmail.com >
Signed-off-by: Kendall González León <kendall.gonzalez.leon@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Spycsh <sihan.chen@intel.com >
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: minmin-intel <minmin.hou@intel.com >
Signed-off-by: Artem Astafev <a.astafev@datamonsters.com >
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Signed-off-by: alexsin368 <alex.sin@intel.com >
Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com >
Co-authored-by: Ezequiel Lanza <emlanza@CDQ242RKJDmac.local >
Co-authored-by: Kendall González León <kendallgonzalez@hotmail.es >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
Co-authored-by: Spycsh <39623753+Spycsh@users.noreply.github.com >
Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com >
Co-authored-by: jotpalch <49465120+jotpalch@users.noreply.github.com >
Co-authored-by: ZePan110 <ze.pan@intel.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: minmin-intel <minmin.hou@intel.com >
Co-authored-by: Ying Hu <ying.hu@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com >
Co-authored-by: Liang Lv <liang1.lv@intel.com >
Co-authored-by: Artem Astafev <a.astafev@datamonsters.com >
Co-authored-by: XinyaoWa <xinyao.wang@intel.com >
Co-authored-by: alexsin368 <109180236+alexsin368@users.noreply.github.com >
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-02-28 22:40:31 +08:00
WenjiaoYue
f2a5644d9c
fix click example button issue ( #1586 )
...
Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-02-28 16:10:58 +08:00
alexsin368
6cd7827365
Top level README: add link to github.io documentation ( #1584 )
...
Signed-off-by: alexsin368 <alex.sin@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-28 13:43:43 +08:00
chen, suyue
3d8009aa91
Fix benchmark scripts ( #1517 )
...
- Align benchmark default config:
1. Update default helm charts version.
2. Add `# mandatory` comment.
3. Update default model ID for LLM.
- Fix deploy issue:
1. Support different `replicaCount` for w/ w/o rerank test.
2. Add `max_num_seqs` for vllm.
3. Add resource setting for tune mode.
- Fix Benchmark issue:
1. Update `user_queries` and `concurrency` setting.
2. Remove invalid parameters.
3. Fix `dataset` and `prompt` setting. And dataset ingest into db.
5. Fix the benchmark hang issue with large user queries. Update `"processes": 16` will fix this issue.
6. Update the eval_path setting logical.
- Optimize benchmark readme.
- Optimize the log path to make the logs more readable.
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
2025-02-28 10:30:54 +08:00
XinyaoWa
78f8ae524d
Fix async in chatqna bug ( #1589 )
...
Algin async with comps: related PR: opea-project/GenAIComps#1300
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-02-27 23:32:29 +08:00
Artem Astafev
6abf7652e8
Fix ChatQnA ROCm compose Readme file and absolute path for ROCM CI test ( #1159 )
...
Signed-off-by: Artem Astafev <a.astafev@datamonsters.com >
2025-02-27 15:26:45 +08:00
Spycsh
25c1aefc27
Align mongo related image names with comps ( #1543 )
...
- chathistory-mongo-server -> chathistory-mongo (except container names)
- feedbackmanagement -> feedbackmanagement-mongo
- promptregistry-server/promptregistry-mongo-server -> promptregistry-mongo (except container names)
Signed-off-by: Spycsh <sihan.chen@intel.com >
2025-02-27 09:25:49 +08:00
dependabot[bot]
d46df4331d
Bump gradio from 5.5.0 to 5.11.0 in /DocSum/ui/gradio ( #1576 )
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Liang Lv <liang1.lv@intel.com >
2025-02-25 14:32:03 +08:00
Eero Tamminen
23a77df302
Fix "OpenAI" & "response" spelling ( #1561 )
2025-02-25 12:45:21 +08:00
Ying Hu
852bc7027c
Update README.md of AIPC quick start ( #1578 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-23 17:38:27 +08:00
minmin-intel
a7eced4161
Update AgentQnA and DocIndexRetriever ( #1564 )
...
Signed-off-by: minmin-intel <minmin.hou@intel.com >
2025-02-22 09:51:26 +08:00
ZePan110
caec354324
Fix trivy issue ( #1569 )
...
Fix docker image security issue
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-20 14:41:52 +08:00
xiguiw
d482554a6b
Fix mismatched environment variable ( #1575 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-19 19:24:10 +08:00
xiguiw
2ae6871fc5
Simplify ChatQnA AIPC user setting ( #1573 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-19 16:30:02 +08:00
dependabot[bot]
2ac5be9921
Bump gradio from 5.5.0 to 5.11.0 in /MultimodalQnA/ui/gradio ( #1391 )
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-02-19 15:58:46 +08:00
ZePan110
799881a3fa
Remove perf test code from test scripts. ( #1510 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-18 16:23:49 +08:00
jotpalch
e5c6418c81
Fix minor typo in README ( #1559 )
...
Change Docker Compost<br/>Deployment on ROCm to Docker Compose<br/>Deployment on ROCm
2025-02-17 12:07:31 +08:00
xiguiw
0c0edffc5b
update vLLM CPU to the latest stable version ( #1546 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-02-17 08:26:25 +08:00
Spycsh
9f36e84c1c
Refactor AudioQnA README ( #1508 )
...
Signed-off-by: Spycsh <sihan.chen@intel.com >
2025-02-15 11:30:16 +08:00
chen, suyue
8c547c2ba5
Expand CI test scope for common test scripts ( #1554 )
...
Expand CI test scope, trigger all hw test when the common test scripts changed.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-02-14 18:17:03 +08:00
Kendall González León
80dd86f122
Make a fix in the main README.md of the ChatQnA. ( #1551 )
...
Signed-off-by: Kendall González León <kendall.gonzalez.leon@intel.com >
2025-02-14 17:00:44 +08:00
ZePan110
6d781f7b2b
Fix CICD workflow strategy running condition ( #1533 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-13 16:10:00 +08:00
WenjiaoYue
abafd5de20
Update UI of the three demos: faqGen, VisualQnA, and DocSum. ( #1528 )
...
Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-12 15:57:51 +08:00
Louie Tsai
970b869838
Add a new section to change LLM model such as deepseek based on validated model table in LLM microservice ( #1501 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
Co-authored-by: Wang, Kai Lawrence <109344418+wangkl2@users.noreply.github.com >
Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com >
2025-02-12 09:34:56 +08:00
XinyaoWa
87ff149f61
Remove vllm hpu triton version fix ( #1515 )
...
vllm-fork has fix triton version issue, remove duplicated code https://github.com/HabanaAI/vllm-fork/blob/habana_main/requirements-hpu.txt
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-02-12 09:24:38 +08:00
chen, suyue
c39a569ab2
Update workflow condition and env ( #1522 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-02-12 09:08:22 +08:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
Louie Tsai
47069ac70c
fix a test script issue due to name change for telemetry yaml files ( #1516 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-02-11 17:58:42 +08:00
chen, suyue
6ce7730863
Update CI/CD workflow ( #1520 )
...
1. Update auto commit account.
2. Fix test condition.
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-11 17:48:37 +08:00
Louie Tsai
ad5523bac7
Enable OpenTelemtry Tracing for ChatQnA on Xeon and Gaudi by docker compose merge feature ( #1488 )
...
Signed-off-by: Louie, Tsai <louie.tsai@intel.com >
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-02-10 22:58:50 -08:00
Louie Tsai
88a8235f21
Update README.md for Agent UI ( #1495 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-02-10 22:22:55 -08:00
ZePan110
63ad850052
Update docker image list ( #1513 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-11 13:18:22 +08:00
ZePan110
9a0c547112
Fix publish issue ( #1514 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-11 11:43:00 +08:00
ZePan110
26a6da4123
Fix nightly triggered exceptions ( #1505 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-10 16:51:34 +08:00
xiguiw
45d5da2ddd
HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN ( #1503 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-09 20:33:06 +08:00
xiguiw
1b3291a1c8
Fix docker compose.yaml error ( #1496 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-07 09:53:20 +08:00
ZePan110
7ac8cf517a
Restore test code. ( #1502 )
...
Remove nightly test code.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-07 09:50:21 +08:00
ZePan110
44a689b0bf
Fix null value_file judgment ( #1470 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2025-02-06 17:09:01 +08:00
xiguiw
388d3eb5c5
[Doc] Clean empty document ( #1497 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-06 10:53:25 +08:00
chyundunovDatamonsters
ef9ad61440
DBQnA - Adding files to deploy DBQnA application on AMD GPU ( #1273 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2025-02-06 09:41:59 +08:00
Louie Tsai
4c41a5db83
Update README.md for OPEA OTLP tracing ( #1406 )
...
Signed-off-by: louie-tsai <louie.tsai@intel.com >
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-02-05 13:03:15 -08:00
Liang Lv
9adf7a6af0
Add support for latest deepseek models on Gaudi ( #1491 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-02-05 08:30:04 +08:00
chen, suyue
a4d028e8ea
update image release workflow ( #1303 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2025-02-03 17:07:07 -08:00
Omar Khleif
32d4f714fd
Fix for NLTK related import failure ( #1487 )
...
Signed-off-by: okhleif-IL <omar.khleif@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-01 10:04:37 +08:00
chyundunovDatamonsters
fdbc27a9b5
AvatarChatbot - Adding files to deploy AvatarChatbot application on AMD GPU ( #1288 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-01-27 11:30:52 +08:00
XinyuYe-Intel
5f4b1828a5
Added UT for rerank finetuning on Gaudi ( #1472 )
...
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com >
2025-01-27 11:24:05 +08:00
chyundunovDatamonsters
39abef8be8
SearchQnA App - Adding files to deploy SearchQnA application on AMD GPU ( #1193 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-01-27 10:58:55 +08:00
bjzhjing
ed163087ba
Provide unified scalable deployment and benchmarking support for exam… ( #1315 )
...
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-24 22:27:49 +08:00
chen, suyue
259099d19f
Remove kubernetes manifest related code and tests ( #1466 )
...
Remove deprecated kubernetes manifest related code and tests.
k8s implementation for those examples based on helm charts will target for next release.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-24 15:23:12 +08:00
chen, suyue
9a1118730b
Freeze the triton version in vllm-gaudi image to 3.1.0 ( #1463 )
...
The new triton version 3.2.0 can't work with vllm-gaudi. Freeze the triton version in vllm-gaudi image to 3.1.0.
Issue create for vllm-fork: HabanaAI/vllm-fork#732
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-24 09:50:59 +08:00