GenAIExamples

Author	SHA1	Message	Date
Razvan Liviu Varzaru	ebb7c24ca8	Add ChatQnA docker-compose example on Intel Xeon using MariaDB Vector (#1916 ) Signed-off-by: Razvan-Liviu Varzaru <razvan@mariadb.org> Co-authored-by: Liang Lv <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-05-08 21:08:15 -07:00
Liang Lv	1eb2e36a18	Refine ChatQnA READMEs (#1850 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-04-20 10:34:24 +08:00
Ying Hu	1b3f1f632a	Update README.md of ChatQnA for layout (#1842 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-04-18 11:41:35 +08:00
sri-intel	90cfe89e21	new chatqna readme template (#1755 ) Signed-off-by: Srinarayan Srikanthan <srinarayan.srikanthan@intel.com>	2025-04-17 16:38:40 +08:00
XinyaoWa	6d24c1c77a	Merge FaqGen into ChatQnA (#1654 ) 1. Delete FaqGen 2. Refactor FaqGen into ChatQnA, serve as a LLM selection. 3. Combine all ChatQnA related Dockerfile into one Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2025-03-20 17:40:00 +08:00
Louie Tsai	671dff7f51	[ChatQnA] Enable Prometheus and Grafana with telemetry docker compose file. (#1623 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-03-13 23:18:29 -07:00
Wang, Kai Lawrence	5362321d3a	Fix vllm model cache directory (#1642 ) Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-03-10 13:40:42 +08:00
Louie Tsai	970b869838	Add a new section to change LLM model such as deepseek based on validated model table in LLM microservice (#1501 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com> Co-authored-by: Wang, Kai Lawrence <109344418+wangkl2@users.noreply.github.com> Co-authored-by: xiguiw <111278656+xiguiw@users.noreply.github.com>	2025-02-12 09:34:56 +08:00
chen, suyue	81b02bb947	Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… (#1521 ) Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, `44a689b0bf`, which block the CI test. This change will be submitted in another PR.	2025-02-11 18:36:12 +08:00
Louie Tsai	ad5523bac7	Enable OpenTelemtry Tracing for ChatQnA on Xeon and Gaudi by docker compose merge feature (#1488 ) Signed-off-by: Louie, Tsai <louie.tsai@intel.com> Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-02-10 22:58:50 -08:00
xiguiw	45d5da2ddd	HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#1503 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2025-02-09 20:33:06 +08:00
Ervin Castelino	27fdbcab58	[chore/chatqna] Missing protocol in curl command (#1447 ) This PR fixes the missing protocol for in the curl command mentioned in chatqna readme for tei-embedding-service.	2025-01-22 21:41:47 +08:00
Wang, Kai Lawrence	3d3ac59bfb	[ChatQnA] Update the default LLM to llama3-8B on cpu/gpu/hpu (#1430 ) Update the default LLM to llama3-8B on cpu/nvgpu/amdgpu/gaudi for docker-compose deployment to avoid the potential model serving issue or the missing chat-template issue using neural-chat-7b. Slow serving issue of neural-chat-7b on ICX: #1420 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-20 22:47:56 +08:00
Liang Lv	0f7e5a37ac	Adapt code for dataprep microservice refactor (#1408 ) https://github.com/opea-project/GenAIComps/pull/1153 Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-01-20 20:37:03 +08:00
Wang, Kai Lawrence	742cb6ddd3	[ChatQnA] Switch to vLLM as default llm backend on Xeon (#1403 ) Switching from TGI to vLLM as the default LLM serving backend on Xeon for the ChatQnA example to enhance the perf. https://github.com/opea-project/GenAIExamples/issues/1213 Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2025-01-17 20:48:19 +08:00
Letong Han	4cabd55778	Refactor Retrievers related Examples (#1387 ) Delete redundant retrievers docker image in docker_images_list.md. Refactor Retrievers related Examples READMEs. Change all of the comps/retrievers/xxx/xxx/Dockerfile path into comps/retrievers/src/Dockerfile. Fix the Examples CI issues of PR opea-project/GenAIComps#1138. Signed-off-by: letonghan <letong.han@intel.com>	2025-01-16 14:21:48 +08:00
Liang Lv	3ca78867eb	Update example code for embedding dependency moving to 3rd_party (#1368 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-01-10 15:36:58 +08:00
chen, suyue	5c7a5bd850	Update Code and README for GenAIComps Refactor (#1285 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com> Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: letonghan <letong.han@intel.com> Signed-off-by: ZePan110 <ze.pan@intel.com> Signed-off-by: WenjiaoYue <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>	2025-01-02 20:03:26 +08:00
Sihan Chen	907b30b7fe	Refactor service names (#1199 )	2024-11-28 10:01:31 +08:00
chen, suyue	393367e9f1	Fix left issue of tgi version update (#1121 ) Signed-off-by: chensuyue <suyue.chen@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-13 15:42:42 +08:00
Louie Tsai	7adbba6add	Enable vLLM Profiling for ChatQnA (#1124 )	2024-11-13 11:26:31 +08:00
Letong Han	aa314f6757	[Readme] Update ChatQnA Readme for LLM Endpoint (#1086 ) Signed-off-by: letonghan <letong.han@intel.com>	2024-11-11 13:53:06 +08:00
Arthur Leung	6263b517b9	[Doc] Add steps to deploy opea services using minikube (#1058 ) Signed-off-by: Arthur Leung <arcyleung@gmail.com> Co-authored-by: Arthur Leung <arcyleung@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-07 13:57:34 +08:00
lvliang-intel	0306c620b5	Update TGI CPU image to latest official release 2.4.0 (#1035 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-04 11:28:43 +08:00
Louie Tsai	a10b4a1f1d	Address request from Issue#971 (#1018 )	2024-10-23 23:57:52 -07:00
lvliang-intel	0eedbbfce0	Update aipc ollama docker compose and readme (#984 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-10-22 10:30:47 +08:00
lvliang-intel	9438d392b4	Update README for some minor issues (#1000 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-22 10:30:18 +08:00
lvliang-intel	619d941047	Set no wrapper ChatQnA as default (#891 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-11 13:30:45 +08:00
Steve Zhang	954a22051b	Make all xeon tgi image version consistent (#851 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-24 11:19:37 +08:00
Letong Han	c35fe0b429	[Doc] Update ChatQnA README for Nginx Docker Image (#862 ) Signed-off-by: letonghan <letong.han@intel.com>	2024-09-23 12:25:30 +09:00
Letong Han	7eaab93d0b	[Doc] Refine ChatQnA README (#855 ) Signed-off-by: letonghan <letong.han@intel.com>	2024-09-20 11:20:20 +08:00
Neo Zhang Jianyu	bc817700b9	refactor the network port setting for AWS (#849 ) Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>	2024-09-19 21:58:56 +08:00
lvliang-intel	bd811bd622	Add validate microservice details link (#852 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-09-19 21:54:32 +08:00
Letong Han	6c364487d3	[ChatQnA] Add Nginx in Docker Compose and README (#850 ) Signed-off-by: letonghan <letong.han@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-19 20:39:58 +08:00
XinyaoWa	2f03a3a894	Align parameters for "max_token, repetition_penalty,presence_penalty,frequency_penalty" (#726 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-19 14:15:25 +08:00
kevinintel	3b70fb0d42	Refine the quick start of ChatQnA (#828 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-18 22:23:22 +08:00
lvliang-intel	bceacdc804	Fix README issues (#817 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-18 09:50:17 +08:00
Ying Hu	87e51d5c36	Update README.md of pdf file (#804 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-13 17:14:34 +08:00
Malini Bhandaru	558ea3bb7f	adopted tech writing style (#796 ) Signed-off-by: Malini Bhandaru <malini.bhandaru@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-13 09:42:59 +08:00
sri-intel	5ab27b63ff	Change megaservice path in line with new file structure (#792 ) Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com>	2024-09-12 09:07:35 +08:00
XinyaoWa	d2bab99835	refine readme for reorg (#782 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-11 14:57:29 +08:00
XinyaoWa	d73129cbf0	Refactor folder to support different vendors (#743 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-09-10 23:27:19 +08:00

42 Commits