Liang Lv
fb514bb8ba
Add chatqna wrapper for multiple model selection ( #1144 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: Ying Hu <ying.hu@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-18 10:48:09 +08:00
lvliang-intel
9ff7df9202
Use fixed version of TEI Gaudi for stability ( #1101 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2024-11-13 10:45:50 -08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
lvliang-intel
e3187be819
Update ChatQnA manifests using always pull image policy ( #1100 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-11-11 14:37:14 +08:00
Arthur Leung
6263b517b9
[Doc] Add steps to deploy opea services using minikube ( #1058 )
...
Signed-off-by: Arthur Leung <arcyleung@gmail.com >
Co-authored-by: Arthur Leung <arcyleung@gmail.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-07 13:57:34 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
lvliang-intel
7197286a14
Fix ChatQnA manifest default port issue ( #1033 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-30 11:52:04 +08:00
WenjiaoYue
b377c2b8f8
Update manifest ui containerPort ( #952 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-17 09:42:55 +08:00
lvliang-intel
619d941047
Set no wrapper ChatQnA as default ( #891 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 13:30:45 +08:00
pallavijaini0525
e2f9037344
Added the K8s yaml for vLLM support ( #917 )
...
Signed-off-by: desaidhr <dhruv.desai@intel.com >
Co-authored-by: desaidhr <dhruv.desai@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-10 11:08:07 +08:00
jotpalch
bd32b03e3c
Doc: Update folder path to correct location in "Deploy ChatQnA in Kubernetes" ( #875 )
2024-09-26 14:38:22 +08:00
lvliang-intel
33b9d4e421
Remove redundant code and update tgi version ( #871 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-09-25 15:33:33 +08:00
David Kinder
3e796ba73d
doc: fix missing references to README.md ( #860 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-24 21:40:42 +08:00
Steve Zhang
954a22051b
Make all xeon tgi image version consistent ( #851 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-24 11:19:37 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
WenjiaoYue
05f9828e77
Add nginx and UI to the ChatQnA manifest ( #848 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 21:04:12 +08:00
lkk
ba17031198
add tgi bf16 setup on CPU k8s. ( #795 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com >
2024-09-13 19:55:57 +08:00
XinyaoWa
d2bab99835
refine readme for reorg ( #782 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 14:57:29 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00
Lianhao Lu
ba94e0130d
Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum ( #773 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-10 16:30:14 +08:00
Lianhao Lu
0629696333
K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum
...
- Update ChatQnA/CodeGen/CodeTrans/DocSum k8s manifest
to avoid requiring creating directory for cache model.
- Add chatqna-guardrails manifest files.
- Fix bug #752 introduced by PR #669
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-06 16:09:42 +08:00
David Kinder
67394b88fa
doc: fix headings and indenting ( #748 )
...
* doc: fix headings and indenting
* only one H1 header (for title) is allowed
* fix indenting under ordered lists
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-06 12:59:33 +08:00
WenjiaoYue
758d236463
Add chatQnA UI manifest ( #669 )
...
* Add chatQnA UI manifest
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update port
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add nginx config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update code
* update nginx config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update nginx config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update ui IP
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update yaml
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update api
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update env config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update env
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update specify node
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update node-type
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update yaml
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update yaml
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* delete nodeSelector
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update dataprep api
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* add node-type
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* delete specify nodeSelector
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* delete useless space
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
---------
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-05 22:44:10 +08:00
Steve Zhang
4133757642
Change docs of kubernetes for curl commands in README ( #661 )
...
* change docs for curl commands in README.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
* The Namespace 'CT' is invalid.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-27 19:36:37 +08:00
Lianhao Lu
771975510a
chatqna k8s manifest: Fixed retriever-redis v0.9 image issue ( #638 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-08-21 22:24:29 +08:00
Sihan Chen
6674832162
fix tgi xeon tag ( #641 )
2024-08-21 22:17:07 +08:00
Lianhao Lu
01c1b7504f
Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum
...
- Sync with docker-compose changes since v0.8 release
- Add K8S probes
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-08-20 10:45:15 +08:00
lvliang-intel
b2771ad3f2
Using TGI official release docker image for intel cpu ( #581 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-08-18 17:17:44 +08:00
Steve Zhang
1c23d87aa2
Add dataprep microservice to chatQnA example and the e2e test ( #589 )
...
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-14 14:39:46 +08:00
chen, suyue
965c13c556
rename docker compose.yaml ( #446 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-07-26 20:03:36 +08:00
David Kinder
8d0c8fb949
doc: fix missing title H1 heading ( #458 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
Co-authored-by: Haihao Shen <haihao.shen@intel.com >
2024-07-26 09:32:54 +08:00
lvliang-intel
f4b4ac0d3a
Update TEI version v1.5 for better performance ( #447 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-07-25 08:54:34 +08:00
Lianhao Lu
665c46ffae
Update Kubernetes manifest files for deploying ChatQnA ( #445 )
...
Update Kubernetes manifest files for deploying ChatQnA without
GMC.
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-07-24 09:59:38 +08:00
Steve Zhang
290a74fae9
Update all examples yaml files of GMC in GenAIExample ( #436 )
...
* Update all examples yaml files of GMC in GenAIExample.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-07-23 16:40:51 +08:00
Ruoyu Ying
d9946180a2
doc: fix minor issue in GMC doc ( #383 )
...
Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com >
2024-07-18 16:21:28 +08:00
Malini Bhandaru
c37d9c82b0
Updated READMEs for kubernetes example pipelines ( #353 )
...
* Updated READMEs for kubernetes.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Kubernetes related Readme.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com >
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-10 09:03:08 +08:00
Steve Zhang
2e62ecc18a
add docsum example e2e test for GMC. ( #347 )
...
* add docsum example e2e test for GMC.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* fix curl error for docsum.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* change the manifest e2e yaml.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* change the image format.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* fixing image mapping error.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* change the gmc e2e test.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* accelarate the e2e test.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* change the gmc e2e configuration.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* retrigger.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2024-06-28 03:15:51 -07:00
Malini Bhandaru
7dd0506e08
chatqna kubernetes readme. ( #335 )
...
* chatqna kubernetes readme.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com >
2024-06-28 17:29:59 +08:00
Steve Zhang
afcb3a3523
Add e2e test of chatqna for genai example ( #334 )
...
* add e2e test of chatqna for genai example.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
Co-authored-by: mkbhanda <malini.bhandaru@intel.com >
Co-authored-by: daisy-ycguo <yingchun.guo@intel.com >
2024-06-27 19:05:59 +08:00
sri-intel
44c5cb71fa
Updated ReadMe for ChatQnA ( #264 )
...
* Update gaudi README.md
Modified path and added cd commands for copy paste instructions.
* Update xeon README.md
Added cd commands for reproducibility.
* Update README.md
2024-06-07 15:58:41 +08:00
Ying Chun Guo
0c7f23cdc9
Remove hard coded port in ChatQnA to avoid conflict ( #254 )
...
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com >
2024-06-04 17:36:24 +08:00
chen, suyue
7eb402e95b
Revert hf_token setting ( #226 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-05-30 23:12:03 +08:00
lvliang-intel
c54705e57e
Replace Reranking model with BGE base ( #218 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-05-30 16:09:09 +08:00
lvliang-intel
9d3bc0e00c
Fix huggingface hub token environment variable ( #214 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-05-30 16:04:59 +08:00
Tian, Feng
169fe96332
GenAIExample code structure reorg ( #207 )
...
Signed-off-by: Tian, Feng <feng.tian@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-05-30 00:13:49 +08:00
lvliang-intel
ee6debe54f
Remove model info in curl request ( #209 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-05-29 19:12:27 +08:00
Ying Chun Guo
3255392dff
improve ChatQnA manifests ( #213 )
...
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com >
2024-05-29 18:53:21 +08:00
leslieluyu
f106dd9f03
manifests for deploy ChatQnA into Kubernetes(Gaudi&Xeon) ( #191 )
...
* upload manifests for deploy ChatQnA on kubernetes
Signed-off-by: leslieluyu <leslie.luyu@gmail.com >
* add index for deploy into kubernetes
Signed-off-by: leslieluyu <leslie.luyu@gmail.com >
* modify pre-commit-config.yaml for charts
Signed-off-by: leslieluyu <leslie.luyu@gmail.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: leslieluyu <leslie.luyu@gmail.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-05-29 11:01:55 +08:00
lvliang-intel
a6b3caf128
Refactor example code ( #183 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-05-24 13:32:14 +08:00