XinyaoWa
d2bab99835
refine readme for reorg ( #782 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 14:57:29 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00
Lianhao Lu
ba94e0130d
Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum ( #773 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-10 16:30:14 +08:00
Lianhao Lu
0629696333
K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum
...
- Update ChatQnA/CodeGen/CodeTrans/DocSum k8s manifest
to avoid requiring creating directory for cache model.
- Add chatqna-guardrails manifest files.
- Fix bug #752 introduced by PR #669
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-06 16:09:42 +08:00
David Kinder
67394b88fa
doc: fix headings and indenting ( #748 )
...
* doc: fix headings and indenting
* only one H1 header (for title) is allowed
* fix indenting under ordered lists
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-06 12:59:33 +08:00
Steve Zhang
f5f1e323bb
Revert the LLM model for kubernetes GMS ( #675 )
...
* revert the LLM model to meta-llama/CodeLlama-7b-hf
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-30 13:54:42 +08:00
Yao Qing
814164dc4f
[Codegen] Refine readme to prompt users on how to change the model. ( #695 )
...
* [Codegen] Refine readme to prompt users on how to change the model.
Signed-off-by: Yao, Qing <qing.yao@intel.com >
* [Codegen] Add section Required Model.
Signed-off-by: Yao, Qing <qing.yao@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: Yao, Qing <qing.yao@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-08-29 22:17:03 +08:00
Steve Zhang
4133757642
Change docs of kubernetes for curl commands in README ( #661 )
...
* change docs for curl commands in README.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
* The Namespace 'CT' is invalid.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-27 19:36:37 +08:00
Dina Suehiro Jones
c25063f4bb
Minor fixes for CodeGen Xeon and Gaudi Kubernetes codegen.yaml and doc updates ( #613 )
...
* Minor fixes for CodeGen Xeon and Gaudi Kubernetes codegen.yaml and doc updates
Signed-off-by: dmsuehir <dina.s.jones@intel.com >
2024-08-23 16:04:57 +08:00
Kefei Zhang
06cb308611
change codegen tgi model ( #646 )
...
* change codegen tgi model
Signed-off-by: KfreeZ <kefei.zhang@intel.com >
2024-08-22 11:42:57 +08:00
Sihan Chen
6674832162
fix tgi xeon tag ( #641 )
2024-08-21 22:17:07 +08:00
Lianhao Lu
01c1b7504f
Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum
...
- Sync with docker-compose changes since v0.8 release
- Add K8S probes
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-08-20 10:45:15 +08:00
lvliang-intel
b2771ad3f2
Using TGI official release docker image for intel cpu ( #581 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-08-18 17:17:44 +08:00
David Kinder
8d0c8fb949
doc: fix missing title H1 heading ( #458 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
Co-authored-by: Haihao Shen <haihao.shen@intel.com >
2024-07-26 09:32:54 +08:00
Jaswanth Karani
1b48e54a3d
code-gen-react-ui ( #417 )
...
* CodeGen: Manifest to deploy CodeGen with ReactUI into Kubernetes
Include manifest to deploy CodeGen pipeline with ReactUI into
Kubernetes. Include as well README file for the steps.
Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com >
* code-gen-react-ui
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* made changes as per PR suggestions
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated readme
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update base image
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* updated codegn dockerfile react
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* updated wait time
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* updated as per PR comments
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* aded react ui in gaudi test file
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* fixed PR comments
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* added docker pull for text-generation-model
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
* removed unused files
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
---------
Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com >
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
Co-authored-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-07-24 09:01:16 -07:00
Steve Zhang
290a74fae9
Update all examples yaml files of GMC in GenAIExample ( #436 )
...
* Update all examples yaml files of GMC in GenAIExample.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-07-23 16:40:51 +08:00
Lianhao Lu
2f9397e012
Update Kubernetes manifest files for CodeGen ( #441 )
...
Updated Kubernetes manifest files for CodeGen to reflect latest changes:
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-07-23 14:07:42 +08:00
Malini Bhandaru
c37d9c82b0
Updated READMEs for kubernetes example pipelines ( #353 )
...
* Updated READMEs for kubernetes.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Kubernetes related Readme.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com >
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-10 09:03:08 +08:00
Steve Zhang
960cf38d33
Add codegen e2e test of genaiexample ( #337 )
...
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-06-28 10:42:00 +08:00
zehao-intel
b160010ad3
Modify UI Port for CodeGen to avoid unsafe_port issue ( #249 )
...
Signed-off-by: zehao-intel <zehao.huang@intel.com >
2024-05-31 20:55:28 +08:00
chen, suyue
7eb402e95b
Revert hf_token setting ( #226 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-05-30 23:12:03 +08:00
lvliang-intel
9d3bc0e00c
Fix huggingface hub token environment variable ( #214 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-05-30 16:04:59 +08:00
Tian, Feng
169fe96332
GenAIExample code structure reorg ( #207 )
...
Signed-off-by: Tian, Feng <feng.tian@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-05-30 00:13:49 +08:00
dolpher
eadaacc41c
Manifests to deploy CodeGen into Kubernetes ( #204 )
...
Signed-off-by: Dolpher Du <Dolpher.Du@intel.com >
2024-05-29 14:06:29 +08:00
lvliang-intel
a6b3caf128
Refactor example code ( #183 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-05-24 13:32:14 +08:00