Letong Han
9180f1066d
Enable vllm for CodeTrans ( #1626 )
...
Set vllm as default llm serving, and add related docker compose files, readmes, and test scripts.
Issue: https://github.com/opea-project/GenAIExamples/issues/1436
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-03-07 10:56:21 +08:00
ZePan110
5aecea8e47
Update compose.yaml ( #1619 )
...
Update compose.yaml for CodeGen, CodeTrans and DocSum
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:20:28 +08:00
ZePan110
6723395e31
Update compose.yaml ( #1620 )
...
Update compose.yaml for AudioQnA, DBQnA, DocIndexRetriever, FaqGen, Translation and VisualQnA.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:20:08 +08:00
ZePan110
785ffb9a1e
Update compose.yaml for ChatQnA ( #1621 )
...
Update compose.yaml for ChatQnA
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:19:39 +08:00
ZePan110
428ba481b2
Update compose.yaml for SearchQnA ( #1622 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 08:38:59 +08:00
Wang, Kai Lawrence
2dfcfa0436
[AudioQnA] Fix the LLM model field for inputs alignment ( #1611 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-03-05 22:15:07 +08:00
Zhu Yongbo
8a5ad1fc72
Fix docker image opea/edgecraftrag security issue #1577 ( #1617 )
...
Signed-off-by: Zhu, Yongbo <yongbo.zhu@intel.com >
2025-03-05 22:13:53 +08:00
ZePan110
24cacaaa48
Enable SearchQnA model cache for docker compose test. ( #1606 )
...
Enable SearchQnA model cache for docker compose test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-05 17:13:24 +08:00
ZePan110
6ead1b12db
Enable ChatQnA model cache for docker compose test. ( #1605 )
...
Enable ChatQnA model cache for docker compose test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-05 11:30:04 +08:00
rbrugaro
8dac9d1035
bugfix GraphRAG updated docker compose and env settings to fix issues post refactor ( #1567 )
...
Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com >
Signed-off-by: Rita Brugarolas Brufau <rita.brugarolas.brufau@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-03-04 09:44:13 -08:00
ZePan110
c1b5ba281f
Enable CodeGen,CodeTrans and DocSum model cache for docker compose test. ( #1599 )
...
1.Add cache path check
2.Enable CodeGen,CodeTrans and DocSum model cache for docker compose test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-04 16:10:20 +08:00
chen, suyue
8f8d3af7c3
open chatqna frontend test ( #1594 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-03-04 10:41:22 +08:00
ZePan110
e4de76da78
Use model cache for docker compose test ( #1582 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-04 09:48:27 +08:00
Spycsh
ce38a84372
Revert chatqna async and enhance tests ( #1598 )
...
align with opea-project/GenAIComps#1354
2025-03-03 23:03:44 +08:00
Ying Hu
e8b07c28ec
Update DBQnA tgi docker image to latest tgi 2.4.0 ( #1593 )
2025-03-03 16:17:19 +08:00
chen, suyue
7b3a125bdf
Fix cd workflow condition ( #1588 )
...
Fix cd workflow condition
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: ZePan110 <ze.pan@intel.com >
2025-03-03 08:45:10 +08:00
Eze Lanza (Eze)
fba0de45d2
ChatQnA Docker compose file for Milvus as vdb ( #1548 )
...
Signed-off-by: Ezequiel Lanza <ezequiel.lanza@gmail.com >
Signed-off-by: Kendall González León <kendall.gonzalez.leon@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Spycsh <sihan.chen@intel.com >
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: minmin-intel <minmin.hou@intel.com >
Signed-off-by: Artem Astafev <a.astafev@datamonsters.com >
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Signed-off-by: alexsin368 <alex.sin@intel.com >
Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com >
Co-authored-by: Ezequiel Lanza <emlanza@CDQ242RKJDmac.local >
Co-authored-by: Kendall González León <kendallgonzalez@hotmail.es >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
Co-authored-by: Spycsh <39623753+Spycsh@users.noreply.github.com >
Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com >
Co-authored-by: jotpalch <49465120+jotpalch@users.noreply.github.com >
Co-authored-by: ZePan110 <ze.pan@intel.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: minmin-intel <minmin.hou@intel.com >
Co-authored-by: Ying Hu <ying.hu@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com >
Co-authored-by: Liang Lv <liang1.lv@intel.com >
Co-authored-by: Artem Astafev <a.astafev@datamonsters.com >
Co-authored-by: XinyaoWa <xinyao.wang@intel.com >
Co-authored-by: alexsin368 <109180236+alexsin368@users.noreply.github.com >
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-02-28 22:40:31 +08:00
WenjiaoYue
f2a5644d9c
fix click example button issue ( #1586 )
...
Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-02-28 16:10:58 +08:00
alexsin368
6cd7827365
Top level README: add link to github.io documentation ( #1584 )
...
Signed-off-by: alexsin368 <alex.sin@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-28 13:43:43 +08:00
chen, suyue
3d8009aa91
Fix benchmark scripts ( #1517 )
...
- Align benchmark default config:
1. Update default helm charts version.
2. Add `# mandatory` comment.
3. Update default model ID for LLM.
- Fix deploy issue:
1. Support different `replicaCount` for w/ w/o rerank test.
2. Add `max_num_seqs` for vllm.
3. Add resource setting for tune mode.
- Fix Benchmark issue:
1. Update `user_queries` and `concurrency` setting.
2. Remove invalid parameters.
3. Fix `dataset` and `prompt` setting. And dataset ingest into db.
5. Fix the benchmark hang issue with large user queries. Update `"processes": 16` will fix this issue.
6. Update the eval_path setting logical.
- Optimize benchmark readme.
- Optimize the log path to make the logs more readable.
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
2025-02-28 10:30:54 +08:00
XinyaoWa
78f8ae524d
Fix async in chatqna bug ( #1589 )
...
Algin async with comps: related PR: opea-project/GenAIComps#1300
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-02-27 23:32:29 +08:00
Artem Astafev
6abf7652e8
Fix ChatQnA ROCm compose Readme file and absolute path for ROCM CI test ( #1159 )
...
Signed-off-by: Artem Astafev <a.astafev@datamonsters.com >
2025-02-27 15:26:45 +08:00
Spycsh
25c1aefc27
Align mongo related image names with comps ( #1543 )
...
- chathistory-mongo-server -> chathistory-mongo (except container names)
- feedbackmanagement -> feedbackmanagement-mongo
- promptregistry-server/promptregistry-mongo-server -> promptregistry-mongo (except container names)
Signed-off-by: Spycsh <sihan.chen@intel.com >
2025-02-27 09:25:49 +08:00
dependabot[bot]
d46df4331d
Bump gradio from 5.5.0 to 5.11.0 in /DocSum/ui/gradio ( #1576 )
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Liang Lv <liang1.lv@intel.com >
2025-02-25 14:32:03 +08:00
Eero Tamminen
23a77df302
Fix "OpenAI" & "response" spelling ( #1561 )
2025-02-25 12:45:21 +08:00
Ying Hu
852bc7027c
Update README.md of AIPC quick start ( #1578 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-23 17:38:27 +08:00
minmin-intel
a7eced4161
Update AgentQnA and DocIndexRetriever ( #1564 )
...
Signed-off-by: minmin-intel <minmin.hou@intel.com >
2025-02-22 09:51:26 +08:00
ZePan110
caec354324
Fix trivy issue ( #1569 )
...
Fix docker image security issue
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-20 14:41:52 +08:00
xiguiw
d482554a6b
Fix mismatched environment variable ( #1575 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-19 19:24:10 +08:00
xiguiw
2ae6871fc5
Simplify ChatQnA AIPC user setting ( #1573 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-19 16:30:02 +08:00
dependabot[bot]
2ac5be9921
Bump gradio from 5.5.0 to 5.11.0 in /MultimodalQnA/ui/gradio ( #1391 )
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-02-19 15:58:46 +08:00
ZePan110
799881a3fa
Remove perf test code from test scripts. ( #1510 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-18 16:23:49 +08:00
jotpalch
e5c6418c81
Fix minor typo in README ( #1559 )
...
Change Docker Compost<br/>Deployment on ROCm to Docker Compose<br/>Deployment on ROCm
2025-02-17 12:07:31 +08:00
xiguiw
0c0edffc5b
update vLLM CPU to the latest stable version ( #1546 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-02-17 08:26:25 +08:00
Spycsh
9f36e84c1c
Refactor AudioQnA README ( #1508 )
...
Signed-off-by: Spycsh <sihan.chen@intel.com >
2025-02-15 11:30:16 +08:00
chen, suyue
8c547c2ba5
Expand CI test scope for common test scripts ( #1554 )
...
Expand CI test scope, trigger all hw test when the common test scripts changed.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-02-14 18:17:03 +08:00
Kendall González León
80dd86f122
Make a fix in the main README.md of the ChatQnA. ( #1551 )
...
Signed-off-by: Kendall González León <kendall.gonzalez.leon@intel.com >
2025-02-14 17:00:44 +08:00
ZePan110
6d781f7b2b
Fix CICD workflow strategy running condition ( #1533 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-13 16:10:00 +08:00
WenjiaoYue
abafd5de20
Update UI of the three demos: faqGen, VisualQnA, and DocSum. ( #1528 )
...
Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-12 15:57:51 +08:00
Louie Tsai
970b869838
Add a new section to change LLM model such as deepseek based on validated model table in LLM microservice ( #1501 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
Co-authored-by: Wang, Kai Lawrence <109344418+wangkl2@users.noreply.github.com >
Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com >
2025-02-12 09:34:56 +08:00
XinyaoWa
87ff149f61
Remove vllm hpu triton version fix ( #1515 )
...
vllm-fork has fix triton version issue, remove duplicated code https://github.com/HabanaAI/vllm-fork/blob/habana_main/requirements-hpu.txt
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-02-12 09:24:38 +08:00
chen, suyue
c39a569ab2
Update workflow condition and env ( #1522 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-02-12 09:08:22 +08:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
Louie Tsai
47069ac70c
fix a test script issue due to name change for telemetry yaml files ( #1516 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-02-11 17:58:42 +08:00
chen, suyue
6ce7730863
Update CI/CD workflow ( #1520 )
...
1. Update auto commit account.
2. Fix test condition.
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-11 17:48:37 +08:00
Louie Tsai
ad5523bac7
Enable OpenTelemtry Tracing for ChatQnA on Xeon and Gaudi by docker compose merge feature ( #1488 )
...
Signed-off-by: Louie, Tsai <louie.tsai@intel.com >
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-02-10 22:58:50 -08:00
Louie Tsai
88a8235f21
Update README.md for Agent UI ( #1495 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-02-10 22:22:55 -08:00
ZePan110
63ad850052
Update docker image list ( #1513 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-11 13:18:22 +08:00
ZePan110
9a0c547112
Fix publish issue ( #1514 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-11 11:43:00 +08:00
ZePan110
26a6da4123
Fix nightly triggered exceptions ( #1505 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-10 16:51:34 +08:00