Commit Graph

36 Commits

Author SHA1 Message Date
Steve Zhang
954a22051b Make all xeon tgi image version consistent (#851)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-24 11:19:37 +08:00
lvliang-intel
3fb60608b3 Use official tei gaudi image and update tgi gaudi version (#810)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
WenjiaoYue
05f9828e77 Add nginx and UI to the ChatQnA manifest (#848)
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 21:04:12 +08:00
lkk
ba17031198 add tgi bf16 setup on CPU k8s. (#795)
Co-authored-by: root <root@idc708073.jf.intel.com>
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>
2024-09-13 19:55:57 +08:00
XinyaoWa
d2bab99835 refine readme for reorg (#782)
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 14:57:29 +08:00
XinyaoWa
d73129cbf0 Refactor folder to support different vendors (#743)
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-09-10 23:27:19 +08:00
Lianhao Lu
ba94e0130d Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum (#773)
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-09-10 16:30:14 +08:00
Lianhao Lu
0629696333 K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum
- Update ChatQnA/CodeGen/CodeTrans/DocSum k8s manifest
  to avoid requiring creating directory for cache model.

- Add chatqna-guardrails manifest files.

- Fix bug #752 introduced by PR #669

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-09-06 16:09:42 +08:00
David Kinder
67394b88fa doc: fix headings and indenting (#748)
* doc: fix headings and indenting
* only one H1 header (for title) is allowed
* fix indenting under ordered lists

Signed-off-by: David B. Kinder <david.b.kinder@intel.com>
2024-09-06 12:59:33 +08:00
WenjiaoYue
758d236463 Add chatQnA UI manifest (#669)
* Add chatQnA UI manifest

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* update port

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add nginx config

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update code

* update nginx config

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* update nginx config

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* update ui IP

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* update yaml

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* update api

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* update env config

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* update env

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update specify node

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update node-type

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* update yaml

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* update yaml

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* delete nodeSelector

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update dataprep api

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* add node-type

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* delete specify nodeSelector

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* delete useless space

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

---------

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-05 22:44:10 +08:00
Steve Zhang
4133757642 Change docs of kubernetes for curl commands in README (#661)
* change docs for curl commands in README.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

* The Namespace 'CT' is invalid.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
2024-08-27 19:36:37 +08:00
Lianhao Lu
771975510a chatqna k8s manifest: Fixed retriever-redis v0.9 image issue (#638)
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-08-21 22:24:29 +08:00
Sihan Chen
6674832162 fix tgi xeon tag (#641) 2024-08-21 22:17:07 +08:00
Lianhao Lu
01c1b7504f Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum
- Sync with docker-compose changes since v0.8 release

- Add K8S probes

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-08-20 10:45:15 +08:00
lvliang-intel
b2771ad3f2 Using TGI official release docker image for intel cpu (#581)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-08-18 17:17:44 +08:00
Steve Zhang
1c23d87aa2 Add dataprep microservice to chatQnA example and the e2e test (#589)
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
2024-08-14 14:39:46 +08:00
chen, suyue
965c13c556 rename docker compose.yaml (#446)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-07-26 20:03:36 +08:00
David Kinder
8d0c8fb949 doc: fix missing title H1 heading (#458)
Signed-off-by: David B. Kinder <david.b.kinder@intel.com>
Co-authored-by: Haihao Shen <haihao.shen@intel.com>
2024-07-26 09:32:54 +08:00
lvliang-intel
f4b4ac0d3a Update TEI version v1.5 for better performance (#447)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-07-25 08:54:34 +08:00
Lianhao Lu
665c46ffae Update Kubernetes manifest files for deploying ChatQnA (#445)
Update Kubernetes manifest files for deploying ChatQnA without
GMC.

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-07-24 09:59:38 +08:00
Steve Zhang
290a74fae9 Update all examples yaml files of GMC in GenAIExample (#436)
* Update all examples yaml files of GMC in GenAIExample.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
2024-07-23 16:40:51 +08:00
Ruoyu Ying
d9946180a2 doc: fix minor issue in GMC doc (#383)
Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com>
2024-07-18 16:21:28 +08:00
Malini Bhandaru
c37d9c82b0 Updated READMEs for kubernetes example pipelines (#353)
* Updated READMEs for kubernetes.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Kubernetes related Readme.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com>

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-10 09:03:08 +08:00
Steve Zhang
2e62ecc18a add docsum example e2e test for GMC. (#347)
* add docsum example e2e test for GMC.

Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>

* fix curl error for docsum.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>

* change the manifest e2e yaml.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>

* change the image format.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>

* fixing image mapping error.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>

* change the gmc e2e test.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>

* accelarate the e2e test.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>

* change the gmc e2e configuration.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>

* retrigger.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>
2024-06-28 03:15:51 -07:00
Malini Bhandaru
7dd0506e08 chatqna kubernetes readme. (#335)
* chatqna kubernetes readme.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com>
2024-06-28 17:29:59 +08:00
Steve Zhang
afcb3a3523 Add e2e test of chatqna for genai example (#334)
* add e2e test of chatqna for genai example.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: mkbhanda <malini.bhandaru@intel.com>
Co-authored-by: daisy-ycguo <yingchun.guo@intel.com>
2024-06-27 19:05:59 +08:00
sri-intel
44c5cb71fa Updated ReadMe for ChatQnA (#264)
* Update gaudi README.md

Modified path and added cd commands for copy paste instructions.

* Update xeon README.md

Added cd commands for reproducibility.

* Update README.md
2024-06-07 15:58:41 +08:00
Ying Chun Guo
0c7f23cdc9 Remove hard coded port in ChatQnA to avoid conflict (#254)
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>
2024-06-04 17:36:24 +08:00
chen, suyue
7eb402e95b Revert hf_token setting (#226)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-05-30 23:12:03 +08:00
lvliang-intel
c54705e57e Replace Reranking model with BGE base (#218)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-05-30 16:09:09 +08:00
lvliang-intel
9d3bc0e00c Fix huggingface hub token environment variable (#214)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-05-30 16:04:59 +08:00
Tian, Feng
169fe96332 GenAIExample code structure reorg (#207)
Signed-off-by: Tian, Feng <feng.tian@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-05-30 00:13:49 +08:00
lvliang-intel
ee6debe54f Remove model info in curl request (#209)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-05-29 19:12:27 +08:00
Ying Chun Guo
3255392dff improve ChatQnA manifests (#213)
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>
2024-05-29 18:53:21 +08:00
leslieluyu
f106dd9f03 manifests for deploy ChatQnA into Kubernetes(Gaudi&Xeon) (#191)
* upload manifests for deploy ChatQnA on kubernetes

Signed-off-by: leslieluyu <leslie.luyu@gmail.com>

* add index for deploy into kubernetes

Signed-off-by: leslieluyu <leslie.luyu@gmail.com>

* modify pre-commit-config.yaml for charts

Signed-off-by: leslieluyu <leslie.luyu@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: leslieluyu <leslie.luyu@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-05-29 11:01:55 +08:00
lvliang-intel
a6b3caf128 Refactor example code (#183)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-05-24 13:32:14 +08:00