David Kinder
d97882ec8e
doc: fix incorrect path to png image files ( #783 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-11 13:22:44 +08:00
feng-intel
63406dc050
Yaml: add comments to specify gaudi device ids. ( #753 )
...
Signed-off-by: fengding <feng1.ding@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 12:02:18 +08:00
Lianhao Lu
ff6f841ec0
README: fix broken links ( #781 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-11 09:41:01 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00
Lianhao Lu
ba94e0130d
Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum ( #773 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-10 16:30:14 +08:00
Letong Han
aebc23f5ae
[ChatQnA] Update README for ModelScope ( #770 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-10 13:50:36 +08:00
Zhenzhong1
36fb9a987d
[ChatQnA] Update benchmarking manifests ( #766 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-10 11:07:48 +08:00
shaohef
a2745b22a7
Provide the method to get nke-10k-2023.pdf ( #769 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-09 14:52:38 +08:00
Sihan Chen
ebe6b473e9
Add megaservice definition without microservice wrappers ( #700 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-06 18:49:28 +08:00
Lianhao Lu
0629696333
K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum
...
- Update ChatQnA/CodeGen/CodeTrans/DocSum k8s manifest
to avoid requiring creating directory for cache model.
- Add chatqna-guardrails manifest files.
- Fix bug #752 introduced by PR #669
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-06 16:09:42 +08:00
David Kinder
67394b88fa
doc: fix headings and indenting ( #748 )
...
* doc: fix headings and indenting
* only one H1 header (for title) is allowed
* fix indenting under ordered lists
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-06 12:59:33 +08:00
chen, suyue
947936ed7b
Update v0.9 RAG release data ( #747 )
...
* run both xeon and gaudi when both hardware detect
Signed-off-by: chensuyue <suyue.chen@intel.com >
* add v0.9 RAG release data
Signed-off-by: chensuyue <suyue.chen@intel.com >
* update system summary
Signed-off-by: chensuyue <suyue.chen@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-06 09:55:55 +08:00
WenjiaoYue
758d236463
Add chatQnA UI manifest ( #669 )
...
* Add chatQnA UI manifest
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update port
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add nginx config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update code
* update nginx config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update nginx config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update ui IP
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update yaml
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update api
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update env config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update env
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update specify node
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update node-type
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update yaml
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update yaml
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* delete nodeSelector
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update dataprep api
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* add node-type
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* delete specify nodeSelector
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* delete useless space
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
---------
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-05 22:44:10 +08:00
Zhenzhong1
ac3486038c
[ChatQnA] udate OOB & Tuned manifests ( #738 )
...
* update OOB manifests
* update tgi parameters
* update OOB manifests for w/o rerank
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update tgi parameters
* update tgi parameters for v0.9 w/o rerank
* update OOB manifests 2.0.4->2.0.1 for w/o rerank
* update tuned manifests
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update tuned manifests
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-05 22:23:51 +08:00
chen, suyue
e0bc5f2a4d
update logs from standard cd perf workflow ( #733 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-05 14:58:24 +08:00
Letong Han
6b617d6743
[ChatQnA] Update README for without Rerank Pipeline ( #740 )
...
* update readme for chatqna w/o rerank
Signed-off-by: letonghan <letong.han@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-05 14:49:12 +08:00
huiyan2021
43b2ae59a1
Fix readme for nv gpu ( #727 )
2024-09-05 08:33:07 +08:00
Zhenzhong1
6730b242cc
[ChatQnA] Update retrieval & dataprep manifests ( #717 )
...
* modify tgi hyperparameters
* upgrade tgi 2.0.1 to 2.0.4
* Update dataprep-microservice_run.yaml
* Update retrieval-microservice_run.yaml
* Update retrieval-microservice_run.yaml
* Update dataprep-microservice_run.yaml
* Update dataprep-microservice_run.yaml
* Update dataprep-microservice_run.yaml
* Update retrieval-microservice_run.yaml
* Update retrieval-microservice_run.yaml
2024-09-04 19:50:46 +08:00
Letong Han
4a51874e4d
update readme for w/o rerank ( #731 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-04 15:01:30 +08:00
Letong Han
55d287dfcf
update readme to fix input length ( #720 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-03 19:01:28 +08:00
Zhenzhong1
3563f5db6b
[ChatQnA]Update manifests ( #716 )
...
* update manifests for v0.9
2024-09-03 15:24:54 +08:00
bjzhjing
8c40204eda
react-ui: Add support to display Chinese ( #713 )
...
* react-ui: Add support to display Chinese
llm-tgi microservice from GenAIComps has encoded each text, so Chinese
response will be shown as hexadecimal in react UI. Add support to decode
and display the response in Chinese. Also return raw response if no
pattern found.
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-03 15:07:14 +08:00
Letong Han
afc3341156
Refine ChatQnA README for TGI ( #715 )
...
* update chatqna readme for tgi
Signed-off-by: letonghan <letong.han@intel.com >
* update log block
Signed-off-by: letonghan <letong.han@intel.com >
---------
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-03 15:06:50 +08:00
sri-intel
e5ec38c796
Update port in set_env.sh for TGI endpoint ( #649 )
2024-09-03 15:05:44 +08:00
David Kinder
c6d811ab11
doc: remove invalid code block language ( #705 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-03 10:10:56 +08:00
Louie Tsai
2ef83fc67b
Update README.md and remove some open-source details ( #682 )
...
According to TCEs' feedback, don't need to have those open-source project details in the flowchart.
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-08-31 22:44:39 +08:00
Ying Chun Guo
2a6af6491a
update mount path in xeon k8s ( #696 )
...
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com >
2024-08-30 15:09:00 +08:00
Letong Han
2a2ff45e2b
Explain Default Model in ChatQnA and CodeTrans READMEs ( #694 )
...
* explain default model in CodeTrans READMEs
Signed-off-by: letonghan <letong.han@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* explain default model in ChatQnA READMEs
Signed-off-by: letonghan <letong.han@intel.com >
* add required models
Signed-off-by: letonghan <letong.han@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-08-29 21:22:59 +08:00
WenjiaoYue
32afb6501c
update env ( #678 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
2024-08-29 10:29:35 +08:00
Abolfazl Shahbazi
1874dfd148
Remove 'vim' from all Dockerfiles ( #663 )
...
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com >
Co-authored-by: lvliang-intel <liang1.lv@intel.com >
2024-08-28 08:30:49 -07:00
Steve Zhang
4133757642
Change docs of kubernetes for curl commands in README ( #661 )
...
* change docs for curl commands in README.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
* The Namespace 'CT' is invalid.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-27 19:36:37 +08:00
lvliang-intel
10c81f1c57
Update ollama run command ( #668 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-08-27 14:54:53 +08:00
xiguiw
dad8eb4b82
[Doc] Update ChatQnA flow chart ( #542 )
...
* Update flow chart
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
* Updated Flowchart
Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com >
---------
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com >
Co-authored-by: Louie Tsai <louie.tsai@intel.com >
2024-08-26 12:20:21 -07:00
lvliang-intel
af21e94a29
Add benchmark README for ChatQnA ( #662 )
...
* Add benchmark README for ChatQnA
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add benchmark.yaml
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update yaml path
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
* fix preci issue
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update title
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
---------
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-08-26 22:39:57 +08:00
chen, suyue
f78aa9ee2f
add env for chatqna vllm ( #655 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-08-23 22:10:10 +08:00
Ying Hu
9657f7bc83
Update set_env.sh ( #644 )
2024-08-22 16:02:55 +08:00
chen, suyue
dfaf47978d
optimize CI log format ( #648 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-08-22 15:08:59 +08:00
Zhenzhong1
e6b4fff05c
Update the number of microservice replicas for OPEA v0.9 ( #645 )
2024-08-22 09:48:47 +08:00
lvliang-intel
a54ffd2c1e
Support ChatQnA pipeline without rerank microservice ( #643 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-08-22 09:26:54 +08:00
chen, suyue
5d39506c5c
Add env params for chatqna xeon test ( #642 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-08-21 22:53:32 +08:00
Lianhao Lu
771975510a
chatqna k8s manifest: Fixed retriever-redis v0.9 image issue ( #638 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-08-21 22:24:29 +08:00
Sihan Chen
6674832162
fix tgi xeon tag ( #641 )
2024-08-21 22:17:07 +08:00
chen, suyue
db2d2bd1a1
fix chatqna guardrails ( #615 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
2024-08-20 22:15:23 +08:00
Zhenzhong1
ba78b4c994
update manifests for v0.9 ( #632 )
...
* update model HF TOKEN variables & reranking name for v0.9
2024-08-20 14:35:14 +08:00
Lianhao Lu
01c1b7504f
Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum
...
- Sync with docker-compose changes since v0.8 release
- Add K8S probes
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-08-20 10:45:15 +08:00
Ying Chun Guo
4fd3517f23
update benchmark manifest to fix errors ( #626 )
...
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com >
2024-08-19 21:59:26 +08:00
Zhenzhong1
08f57fa54a
update manifests for v0.9 ( #623 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-08-19 15:55:04 +08:00
lvliang-intel
b2771ad3f2
Using TGI official release docker image for intel cpu ( #581 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-08-18 17:17:44 +08:00
Ying Chun Guo
71363a6b9d
change microservice tags in CD workflow ( #612 )
...
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com >
2024-08-16 21:57:28 +08:00
Letong Han
040d2b7fd9
update port for dataprep in set_env.sh ( #606 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-08-16 18:15:33 +08:00