GenAIExamples

Author	SHA1	Message	Date
Liang Lv	13dd27e6d5	Update vLLM parameter max-seq-len-to-capture (#1809 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-04-15 14:27:12 +08:00
dolpher	46ebb78aa3	Sync values yaml file for 1.3 release (#1748 ) Signed-off-by: Dolpher Du <dolpher.du@intel.com>	2025-04-08 22:39:40 +08:00
dolpher	ee0e5cc8d9	Sync value files from GenAIInfra (#1428 ) All gaudi values updated with extra flags. Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice. Signed-off-by: Dolpher Du <dolpher.du@intel.com>	2025-01-22 17:44:11 +08:00
dolpher	c795ef2203	Add helm deployment instructions for GenAIExamples (#1373 ) Add helm deployment instructions for ChatQnA, AgentQnA, AudioQnA, CodeTrans, DocSum, FaqGen and VisualQnA Signed-off-by: Dolpher Du <dolpher.du@intel.com>	2025-01-10 09:55:31 +08:00
ZePan110	aa5c91d7ee	Check duplicated dockerfile (#1289 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2025-01-06 17:30:12 +08:00
lvliang-intel	1ff85f6a85	Upgrade TGI Gaudi version to v2.0.6 (#1088 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-11-12 14:38:22 +08:00
lvliang-intel	0306c620b5	Update TGI CPU image to latest official release 2.4.0 (#1035 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-04 11:28:43 +08:00
lvliang-intel	7197286a14	Fix ChatQnA manifest default port issue (#1033 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-30 11:52:04 +08:00
XinyaoWa	a2afce1675	update codetrans default model (#1015 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-28 09:11:54 +08:00
David Kinder	3e796ba73d	doc: fix missing references to README.md (#860 ) Signed-off-by: David B. Kinder <david.b.kinder@intel.com>	2024-09-24 21:40:42 +08:00
lvliang-intel	3fb60608b3	Use official tei gaudi image and update tgi gaudi version (#810 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-23 17:52:56 +08:00
XinyaoWa	d2bab99835	refine readme for reorg (#782 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-11 14:57:29 +08:00
XinyaoWa	d73129cbf0	Refactor folder to support different vendors (#743 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-09-10 23:27:19 +08:00
Lianhao Lu	ba94e0130d	Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum (#773 ) Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2024-09-10 16:30:14 +08:00
Lianhao Lu	0629696333	K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum - Update ChatQnA/CodeGen/CodeTrans/DocSum k8s manifest to avoid requiring creating directory for cache model. - Add chatqna-guardrails manifest files. - Fix bug #752 introduced by PR #669 Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2024-09-06 16:09:42 +08:00
David Kinder	67394b88fa	doc: fix headings and indenting (#748 ) * doc: fix headings and indenting * only one H1 header (for title) is allowed * fix indenting under ordered lists Signed-off-by: David B. Kinder <david.b.kinder@intel.com>	2024-09-06 12:59:33 +08:00
Letong Han	2a2ff45e2b	Explain Default Model in ChatQnA and CodeTrans READMEs (#694 ) * explain default model in CodeTrans READMEs Signed-off-by: letonghan <letong.han@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * explain default model in ChatQnA READMEs Signed-off-by: letonghan <letong.han@intel.com> * add required models Signed-off-by: letonghan <letong.han@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: letonghan <letong.han@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-08-29 21:22:59 +08:00
Letong Han	6a679ba80f	Add Nginx - k8s manifest in CodeTrans (#610 ) Signed-off-by: letonghan <letong.han@intel.com>	2024-08-29 17:32:30 +08:00
Sihan Chen	6674832162	fix tgi xeon tag (#641 )	2024-08-21 22:17:07 +08:00
Lianhao Lu	01c1b7504f	Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum - Sync with docker-compose changes since v0.8 release - Add K8S probes Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2024-08-20 10:45:15 +08:00
lvliang-intel	b2771ad3f2	Using TGI official release docker image for intel cpu (#581 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-08-18 17:17:44 +08:00
David Kinder	8d0c8fb949	doc: fix missing title H1 heading (#458 ) Signed-off-by: David B. Kinder <david.b.kinder@intel.com> Co-authored-by: Haihao Shen <haihao.shen@intel.com>	2024-07-26 09:32:54 +08:00
Steve Zhang	290a74fae9	Update all examples yaml files of GMC in GenAIExample (#436 ) * Update all examples yaml files of GMC in GenAIExample. Signed-off-by: zhlsunshine <huailong.zhang@intel.com>	2024-07-23 16:40:51 +08:00
Lianhao Lu	c9548d7921	Add Kubernetes manifest files for deploying CodeTrans (#435 ) Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>	2024-07-23 13:29:59 +08:00
Malini Bhandaru	c37d9c82b0	Updated READMEs for kubernetes example pipelines (#353 ) * Updated READMEs for kubernetes. Signed-off-by: mkbhanda <malini.bhandaru@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Kubernetes related Readme. Signed-off-by: mkbhanda <malini.bhandaru@intel.com> --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-07-10 09:03:08 +08:00
Steve Zhang	295b81823c	Add codetrans example test for genaiexample (#339 ) * add codetrans example for genaiexample. Signed-off-by: zhlsunshine <huailong.zhang@intel.com>	2024-06-28 11:59:20 +08:00
Tian, Feng	169fe96332	GenAIExample code structure reorg (#207 ) Signed-off-by: Tian, Feng <feng.tian@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-05-30 00:13:49 +08:00
lvliang-intel	a6b3caf128	Refactor example code (#183 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com>	2024-05-24 13:32:14 +08:00

28 Commits