jotpalch
e5c6418c81
Fix minor typo in README ( #1559 )
...
Change Docker Compost<br/>Deployment on ROCm to Docker Compose<br/>Deployment on ROCm
2025-02-17 12:07:31 +08:00
xiguiw
0c0edffc5b
update vLLM CPU to the latest stable version ( #1546 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-02-17 08:26:25 +08:00
Spycsh
9f36e84c1c
Refactor AudioQnA README ( #1508 )
...
Signed-off-by: Spycsh <sihan.chen@intel.com >
2025-02-15 11:30:16 +08:00
chen, suyue
8c547c2ba5
Expand CI test scope for common test scripts ( #1554 )
...
Expand CI test scope, trigger all hw test when the common test scripts changed.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-02-14 18:17:03 +08:00
Kendall González León
80dd86f122
Make a fix in the main README.md of the ChatQnA. ( #1551 )
...
Signed-off-by: Kendall González León <kendall.gonzalez.leon@intel.com >
2025-02-14 17:00:44 +08:00
ZePan110
6d781f7b2b
Fix CICD workflow strategy running condition ( #1533 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-13 16:10:00 +08:00
WenjiaoYue
abafd5de20
Update UI of the three demos: faqGen, VisualQnA, and DocSum. ( #1528 )
...
Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-12 15:57:51 +08:00
Louie Tsai
970b869838
Add a new section to change LLM model such as deepseek based on validated model table in LLM microservice ( #1501 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
Co-authored-by: Wang, Kai Lawrence <109344418+wangkl2@users.noreply.github.com >
Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com >
2025-02-12 09:34:56 +08:00
XinyaoWa
87ff149f61
Remove vllm hpu triton version fix ( #1515 )
...
vllm-fork has fix triton version issue, remove duplicated code https://github.com/HabanaAI/vllm-fork/blob/habana_main/requirements-hpu.txt
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-02-12 09:24:38 +08:00
chen, suyue
c39a569ab2
Update workflow condition and env ( #1522 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-02-12 09:08:22 +08:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
Louie Tsai
47069ac70c
fix a test script issue due to name change for telemetry yaml files ( #1516 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-02-11 17:58:42 +08:00
chen, suyue
6ce7730863
Update CI/CD workflow ( #1520 )
...
1. Update auto commit account.
2. Fix test condition.
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-11 17:48:37 +08:00
Louie Tsai
ad5523bac7
Enable OpenTelemtry Tracing for ChatQnA on Xeon and Gaudi by docker compose merge feature ( #1488 )
...
Signed-off-by: Louie, Tsai <louie.tsai@intel.com >
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-02-10 22:58:50 -08:00
Louie Tsai
88a8235f21
Update README.md for Agent UI ( #1495 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-02-10 22:22:55 -08:00
ZePan110
63ad850052
Update docker image list ( #1513 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-11 13:18:22 +08:00
ZePan110
9a0c547112
Fix publish issue ( #1514 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-11 11:43:00 +08:00
ZePan110
26a6da4123
Fix nightly triggered exceptions ( #1505 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-10 16:51:34 +08:00
xiguiw
45d5da2ddd
HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN ( #1503 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-09 20:33:06 +08:00
xiguiw
1b3291a1c8
Fix docker compose.yaml error ( #1496 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-07 09:53:20 +08:00
ZePan110
7ac8cf517a
Restore test code. ( #1502 )
...
Remove nightly test code.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-02-07 09:50:21 +08:00
ZePan110
44a689b0bf
Fix null value_file judgment ( #1470 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2025-02-06 17:09:01 +08:00
xiguiw
388d3eb5c5
[Doc] Clean empty document ( #1497 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-06 10:53:25 +08:00
chyundunovDatamonsters
ef9ad61440
DBQnA - Adding files to deploy DBQnA application on AMD GPU ( #1273 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2025-02-06 09:41:59 +08:00
Louie Tsai
4c41a5db83
Update README.md for OPEA OTLP tracing ( #1406 )
...
Signed-off-by: louie-tsai <louie.tsai@intel.com >
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-02-05 13:03:15 -08:00
Liang Lv
9adf7a6af0
Add support for latest deepseek models on Gaudi ( #1491 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-02-05 08:30:04 +08:00
chen, suyue
a4d028e8ea
update image release workflow ( #1303 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2025-02-03 17:07:07 -08:00
Omar Khleif
32d4f714fd
Fix for NLTK related import failure ( #1487 )
...
Signed-off-by: okhleif-IL <omar.khleif@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-01 10:04:37 +08:00
chyundunovDatamonsters
fdbc27a9b5
AvatarChatbot - Adding files to deploy AvatarChatbot application on AMD GPU ( #1288 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-01-27 11:30:52 +08:00
XinyuYe-Intel
5f4b1828a5
Added UT for rerank finetuning on Gaudi ( #1472 )
...
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com >
2025-01-27 11:24:05 +08:00
chyundunovDatamonsters
39abef8be8
SearchQnA App - Adding files to deploy SearchQnA application on AMD GPU ( #1193 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-01-27 10:58:55 +08:00
bjzhjing
ed163087ba
Provide unified scalable deployment and benchmarking support for exam… ( #1315 )
...
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-24 22:27:49 +08:00
chen, suyue
259099d19f
Remove kubernetes manifest related code and tests ( #1466 )
...
Remove deprecated kubernetes manifest related code and tests.
k8s implementation for those examples based on helm charts will target for next release.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-24 15:23:12 +08:00
chen, suyue
9a1118730b
Freeze the triton version in vllm-gaudi image to 3.1.0 ( #1463 )
...
The new triton version 3.2.0 can't work with vllm-gaudi. Freeze the triton version in vllm-gaudi image to 3.1.0.
Issue create for vllm-fork: HabanaAI/vllm-fork#732
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-24 09:50:59 +08:00
chen, suyue
ffce7068aa
Fix image on push action due to manifest test remove ( #1460 )
...
1. Fix image on push action due to manifest test remove.
2. Fix helm test cd workflow get test matrix step.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-23 14:30:09 +08:00
dolpher
9b0f98be8b
Update ChatQnA helm chart README. ( #1459 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-23 10:54:39 +08:00
XinyuYe-Intel
f0fea7b706
Add docker compose yaml for text2image example ( #1418 )
...
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com >
2025-01-23 09:57:54 +08:00
Melanie Hart Buehler
1864fac978
Fixes MultimodalQnA dataprep endpoint and port in the UI ( #1457 )
...
Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-22 17:11:09 -08:00
Lianhao Lu
94f71f2322
Update top level readme ( #1458 )
...
Add helm support of SeachQnA and Text2Image in top level readme.
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2025-01-23 09:07:33 +08:00
chen, suyue
6600c32a9b
remove image build condition ( #1456 )
...
Test compose cd workflow depend on image build, so if we want to run both compose and helm charts deployment in cd workflow, this condition should be removed.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-23 00:17:04 +08:00
Liang Lv
d953332f43
Fix multimodal docker image issue for MutimodalQnA on Gaudi ( #1455 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-23 00:12:06 +08:00
chyundunovDatamonsters
cbe5805f47
AgentQnA - add README file for deploy on ROCm ( #1379 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-01-22 21:57:15 +08:00
Ervin Castelino
27fdbcab58
[chore/chatqna] Missing protocol in curl command ( #1447 )
...
This PR fixes the missing protocol for in the curl command mentioned in chatqna readme for tei-embedding-service.
2025-01-22 21:41:47 +08:00
lkk
f07cf1dad2
Fix wrong vllm repo. ( #1454 )
...
Use vllm-fork for gaudi.
fix the issue #1451
2025-01-22 21:22:56 +08:00
dolpher
ee0e5cc8d9
Sync value files from GenAIInfra ( #1428 )
...
All gaudi values updated with extra flags.
Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice.
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-22 17:44:11 +08:00
chen, suyue
5c36443b11
Use local hub cache for AgentQnA test ( #1450 )
...
Use local hub cache for AgentQnA test to save workspace.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-22 13:23:00 +08:00
Lianhao Lu
62cea74a23
CI: improve helm CI ( #1452 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2025-01-22 09:18:35 +08:00
WenjiaoYue
b721c256f9
Fix Domain Access Issue in Latest Vite Version ( #1444 )
...
Fix the restriction on using domain names when users are using the latest version of Vite
When users use the new version of Vite, the UI cannot be accessed via domain names due to Vite's new rules. This fix adds the corresponding parameters according to Vite's new rules, ensuring that users can access the frontend via domain names when building the UI.
Fixes #1441
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-01-21 23:28:37 +08:00
chen, suyue
927698e23e
Simplify git clone code in CI test ( #1434 )
...
1. Simplify git clone code in CI test.
2. Replace git clone branch in Dockerfile.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-21 23:00:08 +08:00
ZePan110
c3e84b5ffa
Fix test matrix for helm charts ( #1449 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-21 22:28:31 +08:00