GenAIExamples

Author	SHA1	Message	Date
Louie Tsai	7adbba6add	Enable vLLM Profiling for ChatQnA (#1124 )	2024-11-13 11:26:31 +08:00
pallavijaini0525	0d52c2f003	Pinecone update to Readme and docker compose for ChatQnA (#540 ) Signed-off-by: pallavi jaini <pallavi.jaini@intel.com> Signed-off-by: AI Workloads <aigoldrush1@g2-r3-2.iind.intel.com> Signed-off-by: Pallavi Jaini <pallavi,jaini@intel.com> Signed-off-by: Pallavi Jaini <pallavi.jaini@intel.com> Signed-off-by: root <root@test-pjaini.535545281608.us-region-2.idcservice.net> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: AI Workloads <aigoldrush1@g2-r3-2.iind.intel.com> Co-authored-by: Pallavi Jaini <pallavi,jaini@intel.com> Co-authored-by: root <root@test-pjaini.535545281608.us-region-2.idcservice.net> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-11-13 09:32:37 +08:00
lvliang-intel	1ff85f6a85	Upgrade TGI Gaudi version to v2.0.6 (#1088 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-11-12 14:38:22 +08:00
bjzhjing	f7a7f8aa3f	Fix typo (#1117 ) Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>	2024-11-12 09:54:05 +08:00
lvliang-intel	e3187be819	Update ChatQnA manifests using always pull image policy (#1100 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-11-11 14:37:14 +08:00
Sihan Chen	abd9d12937	Fix non stream case (#1115 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-11 14:18:42 +08:00
bjzhjing	a7353bbaa4	Refine performance directory (#1017 ) Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>	2024-11-11 13:58:46 +08:00
Letong Han	aa314f6757	[Readme] Update ChatQnA Readme for LLM Endpoint (#1086 ) Signed-off-by: letonghan <letong.han@intel.com>	2024-11-11 13:53:06 +08:00
Wang, Kai Lawrence	f7026773b8	[ChatQnA] Fix the no_proxy setting for gpu example (#1078 ) Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2024-11-08 22:27:51 +08:00
XinyaoWa	40386d9bd6	remove vllm-on-ray (#1084 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2024-11-08 13:01:48 +08:00
lvliang-intel	4635a927fa	Make embedding run on CPU for aligning with Gaudi performance benchmark (#1057 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-11-07 17:39:34 +08:00
XinyaoWa	e9b164505e	align vllm hpu version to latest vllm-fork (#1061 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2024-11-07 14:08:56 +08:00
Arthur Leung	6263b517b9	[Doc] Add steps to deploy opea services using minikube (#1058 ) Signed-off-by: Arthur Leung <arcyleung@gmail.com> Co-authored-by: Arthur Leung <arcyleung@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-07 13:57:34 +08:00
Wang, Kai Lawrence	944ae47948	[ChatQnA] Fix the service connection issue on GPU and modify the emb backend (#1059 ) Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>	2024-11-06 10:22:21 +08:00
xiguiw	a0921f127f	[Doc] Fix broken build instruction (#1063 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2024-11-05 13:35:12 +08:00
lvliang-intel	0306c620b5	Update TGI CPU image to latest official release 2.4.0 (#1035 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-04 11:28:43 +08:00
lkk	3372b9d480	update accuracy embedding endpoint for no wrapper (#1056 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-04 09:18:49 +08:00
XinyaoWa	c65d7d40fb	fix vllm output in chatqna (#1038 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2024-11-01 09:26:57 +08:00
chen, suyue	0f5a9c4a5e	Fix ChatQnA manifest test issue on Xeon (#1044 ) Signed-off-by: chensuyue <suyue.chen@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-31 14:23:17 +08:00
lvliang-intel	7197286a14	Fix ChatQnA manifest default port issue (#1033 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-30 11:52:04 +08:00
xiguiw	95b58b51fa	Fix AIPC docker container network issue (#1021 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2024-10-25 10:46:57 +08:00
Louie Tsai	a10b4a1f1d	Address request from Issue#971 (#1018 )	2024-10-23 23:57:52 -07:00
RuijingGuo	def39cfcdc	setup ollama service in aipc docker compose (#1008 ) Signed-off-by: Guo Ruijing <ruijing.guo@intel.com>	2024-10-23 14:22:48 +08:00
lvliang-intel	0eedbbfce0	Update aipc ollama docker compose and readme (#984 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-10-22 10:30:47 +08:00
lvliang-intel	9438d392b4	Update README for some minor issues (#1000 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-22 10:30:18 +08:00
lvliang-intel	3c164f3aa2	Make rerank run on gaudi for hpu docker compose (#980 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-18 21:49:36 +08:00
CharleneHu-42	7669c42085	Update ChatQnA README to add benchmark launcher (#958 ) Signed-off-by: CharleneHu-42 <yabai.hu@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Yi Yao <yi.a.yao@intel.com>	2024-10-18 13:33:20 +08:00
lvliang-intel	256b58c07e	Replace environment variables with service name for ChatQnA (#977 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-18 11:31:24 +08:00
ylg	37c74b232c	Update ChatQnA yaml and set retriever's TEI_EMBEDDING_ENDPOINT (#953 ) Signed-off-by: longguang.yue <bigclouds@163.com>	2024-10-17 16:58:47 +08:00
Sihan Chen	4a265abb73	Fix top_n rerank docs (#976 )	2024-10-17 15:49:16 +08:00
Sihan Chen	b0487fe92b	fix chatqna accuracy issue with incorrect penalty (#974 )	2024-10-17 15:48:44 +08:00
chen, suyue	eeced9b31c	Enhance CI/CD image build (#961 ) Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-10-17 14:33:58 +08:00
WenjiaoYue	b377c2b8f8	Update manifest ui containerPort (#952 ) Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-17 09:42:55 +08:00
lvliang-intel	c930bea172	Add missing nginx microservice and fix frontend test (#951 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-16 13:29:31 +08:00
lvliang-intel	778afb50ac	Clean no wrapper image in performance benchmark manifests (#955 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-15 18:21:53 +08:00
lkk	088ab98f31	update examples accuracy (#941 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-14 13:20:50 +08:00
xiguiw	b056ce6617	[Doc] Update ChatQnA AIPC README (#935 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-12 11:04:53 +08:00
xiguiw	773c32b38b	Fix AIPC retriever and UI error (#933 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com>	2024-10-11 13:35:27 +08:00
lvliang-intel	619d941047	Set no wrapper ChatQnA as default (#891 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-11 13:30:45 +08:00
Abolfazl Shahbazi	b71a12d424	Remove 'vim' from Dockerfiles (#924 ) Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>	2024-10-10 18:24:31 -07:00
feng-intel	ae10712fe8	doc: Update ChatQnA/benchmark/performance doc (#930 )	2024-10-10 16:30:40 +08:00
pallavijaini0525	e2f9037344	Added the K8s yaml for vLLM support (#917 ) Signed-off-by: desaidhr <dhruv.desai@intel.com> Co-authored-by: desaidhr <dhruv.desai@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-10 11:08:07 +08:00
Zhenzhong1	d16c80e493	[ChatQnA] manage your own ChatQnA pipelines. (#878 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-30 17:01:44 +09:00
sri-intel	2de1bfc5bb	Bug fix for issue #881 (#882 ) Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com>	2024-09-27 13:06:02 +08:00
sri-intel	75df2c9979	docker install instruction for csp (#843 ) Signed-off-by: sri <srinarayan.srikanthan@intel.com> Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com>	2024-09-27 13:00:10 +08:00
jotpalch	bd32b03e3c	Doc: Update folder path to correct location in "Deploy ChatQnA in Kubernetes" (#875 )	2024-09-26 14:38:22 +08:00
xiguiw	9d0b49c2d6	[doc] Update AIPC document (#874 ) Signed-off-by: Wang, Xigui <xigui.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-26 14:28:16 +08:00
David Kinder	99c10933b4	doc: fix doc heading (#873 ) Signed-off-by: David B. Kinder <david.b.kinder@intel.com>	2024-09-26 12:33:57 +09:00
Zhenzhong1	c1038d2193	[ChatQnA] Deploy ChatQnA for benchmarking with different configurations. (#870 )	2024-09-25 16:47:44 +08:00
lvliang-intel	33b9d4e421	Remove redundant code and update tgi version (#871 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-09-25 15:33:33 +08:00

1 2 3 4 5 ...

332 Commits