GenAIExamples

Author	SHA1	Message	Date
Liang Lv	fb514bb8ba	Add chatqna wrapper for multiple model selection (#1144 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: Ying Hu <ying.hu@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-11-18 10:48:09 +08:00
lvliang-intel	9ff7df9202	Use fixed version of TEI Gaudi for stability (#1101 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>	2024-11-13 10:45:50 -08:00
lvliang-intel	1ff85f6a85	Upgrade TGI Gaudi version to v2.0.6 (#1088 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-11-12 14:38:22 +08:00
lvliang-intel	e3187be819	Update ChatQnA manifests using always pull image policy (#1100 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-11-11 14:37:14 +08:00
Arthur Leung	6263b517b9	[Doc] Add steps to deploy opea services using minikube (#1058 ) Signed-off-by: Arthur Leung <arcyleung@gmail.com> Co-authored-by: Arthur Leung <arcyleung@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-07 13:57:34 +08:00
lvliang-intel	0306c620b5	Update TGI CPU image to latest official release 2.4.0 (#1035 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-04 11:28:43 +08:00
lvliang-intel	7197286a14	Fix ChatQnA manifest default port issue (#1033 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-30 11:52:04 +08:00
WenjiaoYue	b377c2b8f8	Update manifest ui containerPort (#952 ) Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-17 09:42:55 +08:00
lvliang-intel	619d941047	Set no wrapper ChatQnA as default (#891 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-11 13:30:45 +08:00
pallavijaini0525	e2f9037344	Added the K8s yaml for vLLM support (#917 ) Signed-off-by: desaidhr <dhruv.desai@intel.com> Co-authored-by: desaidhr <dhruv.desai@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-10 11:08:07 +08:00
jotpalch	bd32b03e3c	Doc: Update folder path to correct location in "Deploy ChatQnA in Kubernetes" (#875 )	2024-09-26 14:38:22 +08:00
lvliang-intel	33b9d4e421	Remove redundant code and update tgi version (#871 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-09-25 15:33:33 +08:00
David Kinder	3e796ba73d	doc: fix missing references to README.md (#860 ) Signed-off-by: David B. Kinder <david.b.kinder@intel.com>	2024-09-24 21:40:42 +08:00
Steve Zhang	954a22051b	Make all xeon tgi image version consistent (#851 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-24 11:19:37 +08:00
lvliang-intel	3fb60608b3	Use official tei gaudi image and update tgi gaudi version (#810 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-23 17:52:56 +08:00
WenjiaoYue	05f9828e77	Add nginx and UI to the ChatQnA manifest (#848 ) Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-19 21:04:12 +08:00
lkk	ba17031198	add tgi bf16 setup on CPU k8s. (#795 ) Co-authored-by: root <root@idc708073.jf.intel.com> Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>	2024-09-13 19:55:57 +08:00
XinyaoWa	d2bab99835	refine readme for reorg (#782 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-11 14:57:29 +08:00
XinyaoWa	d73129cbf0	Refactor folder to support different vendors (#743 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-09-10 23:27:19 +08:00
Lianhao Lu	ba94e0130d	Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum (#773 ) Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2024-09-10 16:30:14 +08:00
Lianhao Lu	0629696333	K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum - Update ChatQnA/CodeGen/CodeTrans/DocSum k8s manifest to avoid requiring creating directory for cache model. - Add chatqna-guardrails manifest files. - Fix bug #752 introduced by PR #669 Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2024-09-06 16:09:42 +08:00
David Kinder	67394b88fa	doc: fix headings and indenting (#748 ) * doc: fix headings and indenting * only one H1 header (for title) is allowed * fix indenting under ordered lists Signed-off-by: David B. Kinder <david.b.kinder@intel.com>	2024-09-06 12:59:33 +08:00
WenjiaoYue	758d236463	Add chatQnA UI manifest (#669 ) * Add chatQnA UI manifest Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * update port Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add nginx config Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update code * update nginx config Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * update nginx config Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * update ui IP Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * update yaml Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * update api Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * update env config Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * update env Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update specify node Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update node-type Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * update yaml Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * update yaml Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * delete nodeSelector Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update dataprep api Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * add node-type Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * delete specify nodeSelector Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> * delete useless space Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> --------- Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-05 22:44:10 +08:00
Steve Zhang	4133757642	Change docs of kubernetes for curl commands in README (#661 ) * change docs for curl commands in README. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> * The Namespace 'CT' is invalid. Signed-off-by: zhlsunshine <huailong.zhang@intel.com>	2024-08-27 19:36:37 +08:00
Lianhao Lu	771975510a	chatqna k8s manifest: Fixed retriever-redis v0.9 image issue (#638 ) Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2024-08-21 22:24:29 +08:00
Sihan Chen	6674832162	fix tgi xeon tag (#641 )	2024-08-21 22:17:07 +08:00
Lianhao Lu	01c1b7504f	Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum - Sync with docker-compose changes since v0.8 release - Add K8S probes Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2024-08-20 10:45:15 +08:00
lvliang-intel	b2771ad3f2	Using TGI official release docker image for intel cpu (#581 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-08-18 17:17:44 +08:00
Steve Zhang	1c23d87aa2	Add dataprep microservice to chatQnA example and the e2e test (#589 ) Signed-off-by: zhlsunshine <huailong.zhang@intel.com>	2024-08-14 14:39:46 +08:00
chen, suyue	965c13c556	rename docker compose.yaml (#446 ) Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-07-26 20:03:36 +08:00
David Kinder	8d0c8fb949	doc: fix missing title H1 heading (#458 ) Signed-off-by: David B. Kinder <david.b.kinder@intel.com> Co-authored-by: Haihao Shen <haihao.shen@intel.com>	2024-07-26 09:32:54 +08:00
lvliang-intel	f4b4ac0d3a	Update TEI version v1.5 for better performance (#447 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-07-25 08:54:34 +08:00
Lianhao Lu	665c46ffae	Update Kubernetes manifest files for deploying ChatQnA (#445 ) Update Kubernetes manifest files for deploying ChatQnA without GMC. Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2024-07-24 09:59:38 +08:00
Steve Zhang	290a74fae9	Update all examples yaml files of GMC in GenAIExample (#436 ) * Update all examples yaml files of GMC in GenAIExample. Signed-off-by: zhlsunshine <huailong.zhang@intel.com>	2024-07-23 16:40:51 +08:00
Ruoyu Ying	d9946180a2	doc: fix minor issue in GMC doc (#383 ) Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com>	2024-07-18 16:21:28 +08:00
Malini Bhandaru	c37d9c82b0	Updated READMEs for kubernetes example pipelines (#353 ) * Updated READMEs for kubernetes. Signed-off-by: mkbhanda <malini.bhandaru@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Kubernetes related Readme. Signed-off-by: mkbhanda <malini.bhandaru@intel.com> --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-07-10 09:03:08 +08:00
Steve Zhang	2e62ecc18a	add docsum example e2e test for GMC. (#347 ) * add docsum example e2e test for GMC. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> * fix curl error for docsum. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> * change the manifest e2e yaml. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> * change the image format. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> * fixing image mapping error. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> * change the gmc e2e test. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> * accelarate the e2e test. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> * change the gmc e2e configuration. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> * retrigger. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: Yingchun Guo <yingchun.guo@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>	2024-06-28 03:15:51 -07:00
Malini Bhandaru	7dd0506e08	chatqna kubernetes readme. (#335 ) * chatqna kubernetes readme. Signed-off-by: mkbhanda <malini.bhandaru@intel.com>	2024-06-28 17:29:59 +08:00
Steve Zhang	afcb3a3523	Add e2e test of chatqna for genai example (#334 ) * add e2e test of chatqna for genai example. Signed-off-by: zhlsunshine <huailong.zhang@intel.com> Co-authored-by: mkbhanda <malini.bhandaru@intel.com> Co-authored-by: daisy-ycguo <yingchun.guo@intel.com>	2024-06-27 19:05:59 +08:00
sri-intel	44c5cb71fa	Updated ReadMe for ChatQnA (#264 ) * Update gaudi README.md Modified path and added cd commands for copy paste instructions. * Update xeon README.md Added cd commands for reproducibility. * Update README.md	2024-06-07 15:58:41 +08:00
Ying Chun Guo	0c7f23cdc9	Remove hard coded port in ChatQnA to avoid conflict (#254 ) Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>	2024-06-04 17:36:24 +08:00
chen, suyue	7eb402e95b	Revert hf_token setting (#226 ) Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-05-30 23:12:03 +08:00
lvliang-intel	c54705e57e	Replace Reranking model with BGE base (#218 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-05-30 16:09:09 +08:00
lvliang-intel	9d3bc0e00c	Fix huggingface hub token environment variable (#214 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-05-30 16:04:59 +08:00
Tian, Feng	169fe96332	GenAIExample code structure reorg (#207 ) Signed-off-by: Tian, Feng <feng.tian@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-05-30 00:13:49 +08:00
lvliang-intel	ee6debe54f	Remove model info in curl request (#209 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-05-29 19:12:27 +08:00
Ying Chun Guo	3255392dff	improve ChatQnA manifests (#213 ) Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>	2024-05-29 18:53:21 +08:00
leslieluyu	f106dd9f03	manifests for deploy ChatQnA into Kubernetes(Gaudi&Xeon) (#191 ) * upload manifests for deploy ChatQnA on kubernetes Signed-off-by: leslieluyu <leslie.luyu@gmail.com> * add index for deploy into kubernetes Signed-off-by: leslieluyu <leslie.luyu@gmail.com> * modify pre-commit-config.yaml for charts Signed-off-by: leslieluyu <leslie.luyu@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: leslieluyu <leslie.luyu@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-05-29 11:01:55 +08:00
lvliang-intel	a6b3caf128	Refactor example code (#183 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-05-24 13:32:14 +08:00

49 Commits