GenAIExamples

Author	SHA1	Message	Date
WenjiaoYue	abafd5de20	Update UI of the three demos: faqGen, VisualQnA, and DocSum. (#1528 ) Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-02-12 15:57:51 +08:00
Louie Tsai	970b869838	Add a new section to change LLM model such as deepseek based on validated model table in LLM microservice (#1501 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com> Co-authored-by: Wang, Kai Lawrence <109344418+wangkl2@users.noreply.github.com> Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com>	2025-02-12 09:34:56 +08:00
XinyaoWa	87ff149f61	Remove vllm hpu triton version fix (#1515 ) vllm-fork has fix triton version issue, remove duplicated code https://github.com/HabanaAI/vllm-fork/blob/habana_main/requirements-hpu.txt Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2025-02-12 09:24:38 +08:00
chen, suyue	c39a569ab2	Update workflow condition and env (#1522 ) Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-02-12 09:08:22 +08:00
chen, suyue	81b02bb947	Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… (#1521 ) Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, `44a689b0bf`, which block the CI test. This change will be submitted in another PR.	2025-02-11 18:36:12 +08:00
Louie Tsai	47069ac70c	fix a test script issue due to name change for telemetry yaml files (#1516 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-02-11 17:58:42 +08:00
chen, suyue	6ce7730863	Update CI/CD workflow (#1520 ) 1. Update auto commit account. 2. Fix test condition. Signed-off-by: chensuyue <suyue.chen@intel.com> Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-02-11 17:48:37 +08:00
Louie Tsai	ad5523bac7	Enable OpenTelemtry Tracing for ChatQnA on Xeon and Gaudi by docker compose merge feature (#1488 ) Signed-off-by: Louie, Tsai <louie.tsai@intel.com> Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-02-10 22:58:50 -08:00
Louie Tsai	88a8235f21	Update README.md for Agent UI (#1495 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-02-10 22:22:55 -08:00
ZePan110	63ad850052	Update docker image list (#1513 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-02-11 13:18:22 +08:00
ZePan110	9a0c547112	Fix publish issue (#1514 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-02-11 11:43:00 +08:00
ZePan110	26a6da4123	Fix nightly triggered exceptions (#1505 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-02-10 16:51:34 +08:00
xiguiw	45d5da2ddd	HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#1503 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-02-09 20:33:06 +08:00
xiguiw	1b3291a1c8	Fix docker compose.yaml error (#1496 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-02-07 09:53:20 +08:00
ZePan110	7ac8cf517a	Restore test code. (#1502 ) Remove nightly test code. Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-02-07 09:50:21 +08:00
ZePan110	44a689b0bf	Fix null value_file judgment (#1470 ) Signed-off-by: ZePan110 <ze.pan@intel.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>	2025-02-06 17:09:01 +08:00
xiguiw	388d3eb5c5	[Doc] Clean empty document (#1497 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-02-06 10:53:25 +08:00
chyundunovDatamonsters	ef9ad61440	DBQnA - Adding files to deploy DBQnA application on AMD GPU (#1273 ) Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com> Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>	2025-02-06 09:41:59 +08:00
Louie Tsai	4c41a5db83	Update README.md for OPEA OTLP tracing (#1406 ) Signed-off-by: louie-tsai <louie.tsai@intel.com> Signed-off-by: Tsai, Louie <louie.tsai@intel.com> Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>	2025-02-05 13:03:15 -08:00
Liang Lv	9adf7a6af0	Add support for latest deepseek models on Gaudi (#1491 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-02-05 08:30:04 +08:00
chen, suyue	a4d028e8ea	update image release workflow (#1303 ) Signed-off-by: chensuyue <suyue.chen@intel.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>	2025-02-03 17:07:07 -08:00
Omar Khleif	32d4f714fd	Fix for NLTK related import failure (#1487 ) Signed-off-by: okhleif-IL <omar.khleif@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-02-01 10:04:37 +08:00
chyundunovDatamonsters	fdbc27a9b5	AvatarChatbot - Adding files to deploy AvatarChatbot application on AMD GPU (#1288 ) Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>	2025-01-27 11:30:52 +08:00
XinyuYe-Intel	5f4b1828a5	Added UT for rerank finetuning on Gaudi (#1472 ) Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>	2025-01-27 11:24:05 +08:00
chyundunovDatamonsters	39abef8be8	SearchQnA App - Adding files to deploy SearchQnA application on AMD GPU (#1193 ) Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>	2025-01-27 10:58:55 +08:00
bjzhjing	ed163087ba	Provide unified scalable deployment and benchmarking support for exam… (#1315 ) Signed-off-by: Cathy Zhang <cathy.zhang@intel.com> Signed-off-by: letonghan <letong.han@intel.com> Co-authored-by: letonghan <letong.han@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-01-24 22:27:49 +08:00
chen, suyue	259099d19f	Remove kubernetes manifest related code and tests (#1466 ) Remove deprecated kubernetes manifest related code and tests. k8s implementation for those examples based on helm charts will target for next release. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-24 15:23:12 +08:00
chen, suyue	9a1118730b	Freeze the triton version in vllm-gaudi image to 3.1.0 (#1463 ) The new triton version 3.2.0 can't work with vllm-gaudi. Freeze the triton version in vllm-gaudi image to 3.1.0. Issue create for vllm-fork: HabanaAI/vllm-fork#732 Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-24 09:50:59 +08:00
chen, suyue	ffce7068aa	Fix image on push action due to manifest test remove (#1460 ) 1. Fix image on push action due to manifest test remove. 2. Fix helm test cd workflow get test matrix step. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-23 14:30:09 +08:00
dolpher	9b0f98be8b	Update ChatQnA helm chart README. (#1459 ) Signed-off-by: Dolpher Du <dolpher.du@intel.com>	2025-01-23 10:54:39 +08:00
XinyuYe-Intel	f0fea7b706	Add docker compose yaml for text2image example (#1418 ) Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>	2025-01-23 09:57:54 +08:00
Melanie Hart Buehler	1864fac978	Fixes MultimodalQnA dataprep endpoint and port in the UI (#1457 ) Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-01-22 17:11:09 -08:00
Lianhao Lu	94f71f2322	Update top level readme (#1458 ) Add helm support of SeachQnA and Text2Image in top level readme. Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2025-01-23 09:07:33 +08:00
chen, suyue	6600c32a9b	remove image build condition (#1456 ) Test compose cd workflow depend on image build, so if we want to run both compose and helm charts deployment in cd workflow, this condition should be removed. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-23 00:17:04 +08:00
Liang Lv	d953332f43	Fix multimodal docker image issue for MutimodalQnA on Gaudi (#1455 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-01-23 00:12:06 +08:00
chyundunovDatamonsters	cbe5805f47	AgentQnA - add README file for deploy on ROCm (#1379 ) Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>	2025-01-22 21:57:15 +08:00
Ervin Castelino	27fdbcab58	[chore/chatqna] Missing protocol in curl command (#1447 ) This PR fixes the missing protocol for in the curl command mentioned in chatqna readme for tei-embedding-service.	2025-01-22 21:41:47 +08:00
lkk	f07cf1dad2	Fix wrong vllm repo. (#1454 ) Use vllm-fork for gaudi. fix the issue #1451	2025-01-22 21:22:56 +08:00
dolpher	ee0e5cc8d9	Sync value files from GenAIInfra (#1428 ) All gaudi values updated with extra flags. Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice. Signed-off-by: Dolpher Du <dolpher.du@intel.com>	2025-01-22 17:44:11 +08:00
chen, suyue	5c36443b11	Use local hub cache for AgentQnA test (#1450 ) Use local hub cache for AgentQnA test to save workspace. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-22 13:23:00 +08:00
Lianhao Lu	62cea74a23	CI: improve helm CI (#1452 ) Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2025-01-22 09:18:35 +08:00
WenjiaoYue	b721c256f9	Fix Domain Access Issue in Latest Vite Version (#1444 ) Fix the restriction on using domain names when users are using the latest version of Vite When users use the new version of Vite, the UI cannot be accessed via domain names due to Vite's new rules. This fix adds the corresponding parameters according to Vite's new rules, ensuring that users can access the frontend via domain names when building the UI. Fixes #1441 Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com>	2025-01-21 23:28:37 +08:00
chen, suyue	927698e23e	Simplify git clone code in CI test (#1434 ) 1. Simplify git clone code in CI test. 2. Replace git clone branch in Dockerfile. Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-21 23:00:08 +08:00
ZePan110	c3e84b5ffa	Fix test matrix for helm charts (#1449 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-21 22:28:31 +08:00
ZePan110	6b2a041f25	Fix Helm-chart workflow issues. (#1448 ) Fix matrix error issues and CD test files cannot be obtained. Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-21 21:48:57 +08:00
ZePan110	842f46326b	Switch helm-chart test runs-on label. (#1446 ) Switch helm-chart test runs-on label from ${{ inputs.hardware }} to k8s-${{ inputs.hardware }}. Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-21 18:07:03 +08:00
Wang, Kai Lawrence	284db982be	[ROCm] Fix the hf-token setting for TGI and TEI in ChatQnA (#1432 ) This PR is to correct the env variable names in chatqna example on ROCm platform passing to the docker container of TGI and TEI. For tgi, either HF_TOKEN and HUGGING_FACE_HUB_TOKEN could be parsed in TGI while HF_API_TOKEN can be parsed in TEI. TGI: https://github.com/huggingface/text-generation-inference/blob/main/router/src/server.rs#L1700C1-L1702C15 TEI: https://github.com/huggingface/text-embeddings-inference/blob/main/router/src/main.rs#L112 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-21 14:22:39 +08:00
ZePan110	fc96fe83e2	Fix CD workflow issue (#1443 ) Fix the issue of CD workflow values_files errors. Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-21 11:54:12 +08:00
Hoong Tee, Yeoh	0316114c4b	ProductivitySuite: Fix FaqGen Microservice CI test fail (#1437 ) Change in FAQGen microservice for content-type header result in CI failure. #1431 Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>	2025-01-21 10:23:35 +08:00
chen, suyue	0408453fa2	Unify the yaml name to fix the CD workflow (#1435 ) Fix the issue in #1372 Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-21 01:10:41 +08:00

1 2 3 4 5 ...

894 Commits