GenAIExamples

Author	SHA1	Message	Date
Louie Tsai	e8cdf7d668	[ChatQnA] update to the latest Grafana Dashboard (#1728 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-04-03 12:14:55 -07:00
chyundunovDatamonsters	c50dfb2510	Adding files to deploy ChatQnA application on ROCm vLLM (#1560 ) Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>	2025-04-03 17:19:26 +08:00
Xiaotian Chen	1bd56af994	Update TGI image versions (#1625 ) Signed-off-by: xiaotia3 <xiaotian.chen@intel.com>	2025-04-01 11:27:51 +08:00
xiguiw	87baeb833d	Update TEI docker image to 1.6 (#1650 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-03-27 09:40:22 +08:00
Louie Tsai	0736912c69	change gaudi node exporter from default one to 41612 (#1702 ) Signed-off-by: Louie Tsai <louie.tsai@intel.com> Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-03-20 21:38:24 -07:00
XinyaoWa	6d24c1c77a	Merge FaqGen into ChatQnA (#1654 ) 1. Delete FaqGen 2. Refactor FaqGen into ChatQnA, serve as a LLM selection. 3. Combine all ChatQnA related Dockerfile into one Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2025-03-20 17:40:00 +08:00
James Edwards	527b146a80	Add final README.md and set_env.sh script for quickstart review. Previous pull request was 1595. (#1662 ) Signed-off-by: Edwards, James A <jaedwards@habana.ai> Co-authored-by: Edwards, James A <jaedwards@habana.ai>	2025-03-14 16:05:01 -07:00
Louie Tsai	671dff7f51	[ChatQnA] Enable Prometheus and Grafana with telemetry docker compose file. (#1623 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-03-13 23:18:29 -07:00
Li Gang	0701b8cfff	[ChatQnA][docker]Check healthy of redis to avoid dataprep failure (#1591 ) Signed-off-by: Li Gang <gang.g.li@intel.com>	2025-03-13 10:52:33 +08:00
chen, suyue	43d0a18270	Enhance ChatQnA test scripts (#1643 ) Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-03-10 17:36:26 +08:00
Wang, Kai Lawrence	5362321d3a	Fix vllm model cache directory (#1642 ) Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-03-10 13:40:42 +08:00
ZePan110	785ffb9a1e	Update compose.yaml for ChatQnA (#1621 ) Update compose.yaml for ChatQnA Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-03-07 09:19:39 +08:00
ZePan110	6ead1b12db	Enable ChatQnA model cache for docker compose test. (#1605 ) Enable ChatQnA model cache for docker compose test. Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-03-05 11:30:04 +08:00
Eze Lanza (Eze)	fba0de45d2	ChatQnA Docker compose file for Milvus as vdb (#1548 ) Signed-off-by: Ezequiel Lanza <ezequiel.lanza@gmail.com> Signed-off-by: Kendall González León <kendall.gonzalez.leon@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com> Signed-off-by: Spycsh <sihan.chen@intel.com> Signed-off-by: Wang, Xigui <xigui.wang@intel.com> Signed-off-by: ZePan110 <ze.pan@intel.com> Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: minmin-intel <minmin.hou@intel.com> Signed-off-by: Artem Astafev <a.astafev@datamonsters.com> Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: Cathy Zhang <cathy.zhang@intel.com> Signed-off-by: letonghan <letong.han@intel.com> Signed-off-by: alexsin368 <alex.sin@intel.com> Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com> Co-authored-by: Ezequiel Lanza <emlanza@CDQ242RKJDmac.local> Co-authored-by: Kendall González León <kendallgonzalez@hotmail.es> Co-authored-by: chen, suyue <suyue.chen@intel.com> Co-authored-by: Spycsh <39623753+Spycsh@users.noreply.github.com> Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com> Co-authored-by: jotpalch <49465120+jotpalch@users.noreply.github.com> Co-authored-by: ZePan110 <ze.pan@intel.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: minmin-intel <minmin.hou@intel.com> Co-authored-by: Ying Hu <ying.hu@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com> Co-authored-by: Liang Lv <liang1.lv@intel.com> Co-authored-by: Artem Astafev <a.astafev@datamonsters.com> Co-authored-by: XinyaoWa <xinyao.wang@intel.com> Co-authored-by: alexsin368 <109180236+alexsin368@users.noreply.github.com> Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com>	2025-02-28 22:40:31 +08:00
Artem Astafev	6abf7652e8	Fix ChatQnA ROCm compose Readme file and absolute path for ROCM CI test (#1159 ) Signed-off-by: Artem Astafev <a.astafev@datamonsters.com>	2025-02-27 15:26:45 +08:00
Ying Hu	852bc7027c	Update README.md of AIPC quick start (#1578 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-02-23 17:38:27 +08:00
xiguiw	d482554a6b	Fix mismatched environment variable (#1575 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-02-19 19:24:10 +08:00
xiguiw	2ae6871fc5	Simplify ChatQnA AIPC user setting (#1573 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-02-19 16:30:02 +08:00
Louie Tsai	970b869838	Add a new section to change LLM model such as deepseek based on validated model table in LLM microservice (#1501 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com> Co-authored-by: Wang, Kai Lawrence <109344418+wangkl2@users.noreply.github.com> Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com>	2025-02-12 09:34:56 +08:00
chen, suyue	81b02bb947	Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… (#1521 ) Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, `44a689b0bf`, which block the CI test. This change will be submitted in another PR.	2025-02-11 18:36:12 +08:00
Louie Tsai	ad5523bac7	Enable OpenTelemtry Tracing for ChatQnA on Xeon and Gaudi by docker compose merge feature (#1488 ) Signed-off-by: Louie, Tsai <louie.tsai@intel.com> Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-02-10 22:58:50 -08:00
xiguiw	45d5da2ddd	HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#1503 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-02-09 20:33:06 +08:00
Liang Lv	9adf7a6af0	Add support for latest deepseek models on Gaudi (#1491 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-02-05 08:30:04 +08:00
Ervin Castelino	27fdbcab58	[chore/chatqna] Missing protocol in curl command (#1447 ) This PR fixes the missing protocol for in the curl command mentioned in chatqna readme for tei-embedding-service.	2025-01-22 21:41:47 +08:00
Wang, Kai Lawrence	284db982be	[ROCm] Fix the hf-token setting for TGI and TEI in ChatQnA (#1432 ) This PR is to correct the env variable names in chatqna example on ROCm platform passing to the docker container of TGI and TEI. For tgi, either HF_TOKEN and HUGGING_FACE_HUB_TOKEN could be parsed in TGI while HF_API_TOKEN can be parsed in TEI. TGI: https://github.com/huggingface/text-generation-inference/blob/main/router/src/server.rs#L1700C1-L1702C15 TEI: https://github.com/huggingface/text-embeddings-inference/blob/main/router/src/main.rs#L112 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-21 14:22:39 +08:00
Wang, Kai Lawrence	3d3ac59bfb	[ChatQnA] Update the default LLM to llama3-8B on cpu/gpu/hpu (#1430 ) Update the default LLM to llama3-8B on cpu/nvgpu/amdgpu/gaudi for docker-compose deployment to avoid the potential model serving issue or the missing chat-template issue using neural-chat-7b. Slow serving issue of neural-chat-7b on ICX: #1420 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-20 22:47:56 +08:00
Liang Lv	0f7e5a37ac	Adapt code for dataprep microservice refactor (#1408 ) https://github.com/opea-project/GenAIComps/pull/1153 Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-01-20 20:37:03 +08:00
xiguiw	2d5898244c	Enchance health check in GenAIExample docker-compose (#1410 ) Fix service launch issue 1. Update Gaudi TGI image from 2.0.6 to 2.3.1 2. Change the hpu-gaudi TGI health check condition. Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-01-20 20:13:13 +08:00
Wang, Kai Lawrence	742cb6ddd3	[ChatQnA] Switch to vLLM as default llm backend on Xeon (#1403 ) Switching from TGI to vLLM as the default LLM serving backend on Xeon for the ChatQnA example to enhance the perf. https://github.com/opea-project/GenAIExamples/issues/1213 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-17 20:48:19 +08:00
Wang, Kai Lawrence	00e9da9ced	[ChatQnA] Switch to vLLM as default llm backend on Gaudi (#1404 ) Switching from TGI to vLLM as the default LLM serving backend on Gaudi for the ChatQnA example to enhance the perf. https://github.com/opea-project/GenAIExamples/issues/1213 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-17 20:46:38 +08:00
Letong Han	4cabd55778	Refactor Retrievers related Examples (#1387 ) Delete redundant retrievers docker image in docker_images_list.md. Refactor Retrievers related Examples READMEs. Change all of the comps/retrievers/xxx/xxx/Dockerfile path into comps/retrievers/src/Dockerfile. Fix the Examples CI issues of PR opea-project/GenAIComps#1138. Signed-off-by: letonghan <letong.han@intel.com>	2025-01-16 14:21:48 +08:00
xiguiw	698a06edbf	[DOC] Fix document issue (#1395 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-01-16 11:30:07 +08:00
Liang Lv	3ca78867eb	Update example code for embedding dependency moving to 3rd_party (#1368 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-01-10 15:36:58 +08:00
Louie Tsai	81022355a7	Enable OpenTelemetry Tracing for ChatQnA TGI serving on Gaudi (#1316 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-01-08 17:20:13 -08:00
Liang Lv	b3c405a5f6	Adapt example code for guardrails refactor (#1360 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-08 14:35:23 +08:00
ZePan110	ed2b8ed983	Exclude dockerfile under tests and exclude check Dockerfile under tests. (#1354 ) Signed-off-by: ZePan110 <ze.pan@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-01-07 09:05:01 +08:00
ZePan110	aa5c91d7ee	Check duplicated dockerfile (#1289 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-06 17:30:12 +08:00
chen, suyue	5c7a5bd850	Update Code and README for GenAIComps Refactor (#1285 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com> Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: letonghan <letong.han@intel.com> Signed-off-by: ZePan110 <ze.pan@intel.com> Signed-off-by: WenjiaoYue <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>	2025-01-02 20:03:26 +08:00
Ying Hu	597f17b979	Update set_env.sh to fix LOGFLAG warning (#1319 )	2024-12-30 10:54:26 +08:00
Wang, Kai Lawrence	4c01e14642	[ChatQnA] Remove enforce-eager to enable HPU graphs for better vLLM perf (#1210 ) Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2024-12-10 13:19:15 +08:00
pallavijaini0525	3a371ac102	Updated the Pinecone readme to reflect the new structure (#1222 ) Signed-off-by: Pallavi Jaini <pallavi.jaini@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-12-05 10:04:09 +08:00
Sihan Chen	907b30b7fe	Refactor service names (#1199 )	2024-11-28 10:01:31 +08:00
Wang, Kai Lawrence	ac470421d0	Update the llm backend ports (#1172 ) Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2024-11-22 09:20:09 +08:00
ZePan110	8808b51e42	Rename image name XXX-hpu to XXX-gaudi (#1154 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2024-11-19 22:18:41 +08:00
Wang, Kai Lawrence	2587179224	Add instructions of modifying reranking docker image for NVGPU (#1133 ) Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-18 15:37:32 +08:00
Louie Tsai	152adf8012	maintain a version info for docker_compose yaml files among release (#1141 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2024-11-17 22:39:41 -08:00
Artem Astafev	6d3a017609	Add compose example for ChatQnA AMD ROCm deployment (#1122 ) Signed-off-by: Artem Astafev <a.astafev@datamonsters.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-15 17:24:06 +08:00
Louie Tsai	00d9bb6128	Enable vLLM Profiling for ChatQnA on Gaudi (#1128 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2024-11-14 15:46:33 -08:00
lvliang-intel	9ff7df9202	Use fixed version of TEI Gaudi for stability (#1101 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>	2024-11-13 10:45:50 -08:00
chen, suyue	393367e9f1	Fix left issue of tgi version update (#1121 ) Signed-off-by: chensuyue <suyue.chen@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-13 15:42:42 +08:00

1 2

95 Commits