dolpher
ee0e5cc8d9
Sync value files from GenAIInfra ( #1428 )
...
All gaudi values updated with extra flags.
Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice.
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-22 17:44:11 +08:00
XinyaoWa
ff1310b11a
Refactor docsum ( #1336 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-13 15:49:48 +08:00
dolpher
c795ef2203
Add helm deployment instructions for GenAIExamples ( #1373 )
...
Add helm deployment instructions for ChatQnA, AgentQnA, AudioQnA, CodeTrans, DocSum, FaqGen and VisualQnA
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-10 09:55:31 +08:00
Sihan Chen
a01729a5c2
Refactor DocSum example ( #1286 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-26 14:45:17 +08:00
XinyaoWa
0cdeb946e4
DocSum Manifest support multimedia ( #1158 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-18 18:46:01 +08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
lvliang-intel
7197286a14
Fix ChatQnA manifest default port issue ( #1033 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-30 11:52:04 +08:00
David Kinder
3e796ba73d
doc: fix missing references to README.md ( #860 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-24 21:40:42 +08:00
Steve Zhang
954a22051b
Make all xeon tgi image version consistent ( #851 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-24 11:19:37 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
XinyaoWa
d2bab99835
refine readme for reorg ( #782 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 14:57:29 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00
Lianhao Lu
ba94e0130d
Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum ( #773 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-10 16:30:14 +08:00
Lianhao Lu
0629696333
K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum
...
- Update ChatQnA/CodeGen/CodeTrans/DocSum k8s manifest
to avoid requiring creating directory for cache model.
- Add chatqna-guardrails manifest files.
- Fix bug #752 introduced by PR #669
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-06 16:09:42 +08:00
David Kinder
67394b88fa
doc: fix headings and indenting ( #748 )
...
* doc: fix headings and indenting
* only one H1 header (for title) is allowed
* fix indenting under ordered lists
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-06 12:59:33 +08:00
XinyaoWa
d487093d10
Add default model in readme for FaqGen and DocSum ( #693 )
...
* update default model in readme for DocSum
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-08-30 12:40:36 +08:00
Steve Zhang
4133757642
Change docs of kubernetes for curl commands in README ( #661 )
...
* change docs for curl commands in README.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
* The Namespace 'CT' is invalid.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-27 19:36:37 +08:00
Sihan Chen
6674832162
fix tgi xeon tag ( #641 )
2024-08-21 22:17:07 +08:00
Lianhao Lu
01c1b7504f
Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum
...
- Sync with docker-compose changes since v0.8 release
- Add K8S probes
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-08-20 10:45:15 +08:00
lvliang-intel
b2771ad3f2
Using TGI official release docker image for intel cpu ( #581 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-08-18 17:17:44 +08:00
David Kinder
8d0c8fb949
doc: fix missing title H1 heading ( #458 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
Co-authored-by: Haihao Shen <haihao.shen@intel.com >
2024-07-26 09:32:54 +08:00
Jaswanth Karani
edf0d14c95
added doc sum react-ui ( #418 )
...
Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com >
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
2024-07-25 12:12:36 +08:00
Steve Zhang
290a74fae9
Update all examples yaml files of GMC in GenAIExample ( #436 )
...
* Update all examples yaml files of GMC in GenAIExample.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-07-23 16:40:51 +08:00
Lianhao Lu
83146320aa
Add Kubernetes manifest files for deploying DocSum ( #434 )
...
* Add Kubernetes manifest files for deploying DocSum
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-07-23 13:29:32 +08:00
Ruoyu Ying
d9946180a2
doc: fix minor issue in GMC doc ( #383 )
...
Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com >
2024-07-18 16:21:28 +08:00
Malini Bhandaru
c37d9c82b0
Updated READMEs for kubernetes example pipelines ( #353 )
...
* Updated READMEs for kubernetes.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Kubernetes related Readme.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com >
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-10 09:03:08 +08:00
Steve Zhang
2e62ecc18a
add docsum example e2e test for GMC. ( #347 )
...
* add docsum example e2e test for GMC.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* fix curl error for docsum.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* change the manifest e2e yaml.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* change the image format.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* fixing image mapping error.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* change the gmc e2e test.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* accelarate the e2e test.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* change the gmc e2e configuration.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* retrigger.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2024-06-28 03:15:51 -07:00
lvliang-intel
a6b3caf128
Refactor example code ( #183 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-05-24 13:32:14 +08:00