dolpher
87e3c0f59f
Update chatqna values file changes ( #1844 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-04-21 09:38:07 +08:00
Liang Lv
13dd27e6d5
Update vLLM parameter max-seq-len-to-capture ( #1809 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-04-15 14:27:12 +08:00
dolpher
46ebb78aa3
Sync values yaml file for 1.3 release ( #1748 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-04-08 22:39:40 +08:00
Xiaotian Chen
1bd56af994
Update TGI image versions ( #1625 )
...
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com >
2025-04-01 11:27:51 +08:00
xiguiw
87baeb833d
Update TEI docker image to 1.6 ( #1650 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-03-27 09:40:22 +08:00
dolpher
9b0f98be8b
Update ChatQnA helm chart README. ( #1459 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-23 10:54:39 +08:00
dolpher
ee0e5cc8d9
Sync value files from GenAIInfra ( #1428 )
...
All gaudi values updated with extra flags.
Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice.
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-22 17:44:11 +08:00
chen, suyue
7a54064d65
remove Dockerfile.wrapper ( #1429 )
...
Remove Dockerfile.wrapper, it's not used anymore and no test cover this Dockerfile. So remove this Dockerfile to avoid regression.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-20 20:49:18 +08:00
Liang Lv
0f7e5a37ac
Adapt code for dataprep microservice refactor ( #1408 )
...
https://github.com/opea-project/GenAIComps/pull/1153
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-20 20:37:03 +08:00
Letong Han
4cabd55778
Refactor Retrievers related Examples ( #1387 )
...
Delete redundant retrievers docker image in docker_images_list.md.
Refactor Retrievers related Examples READMEs.
Change all of the comps/retrievers/xxx/xxx/Dockerfile path into comps/retrievers/src/Dockerfile.
Fix the Examples CI issues of PR opea-project/GenAIComps#1138 .
Signed-off-by: letonghan <letong.han@intel.com >
2025-01-16 14:21:48 +08:00
dolpher
c795ef2203
Add helm deployment instructions for GenAIExamples ( #1373 )
...
Add helm deployment instructions for ChatQnA, AgentQnA, AudioQnA, CodeTrans, DocSum, FaqGen and VisualQnA
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-10 09:55:31 +08:00
Liang Lv
b3c405a5f6
Adapt example code for guardrails refactor ( #1360 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-08 14:35:23 +08:00
WenjiaoYue
9970605460
Adapt refactor comps ( #1340 )
...
Signed-off-by: WenjiaoYue
2025-01-08 10:36:24 +08:00
Pranav Singh
d2b49bbc82
[ChatQNA] Fix K8s Deployment for CPU/HPU ( #1274 )
...
Signed-off-by: Pranav Singh <pranav.singh@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-07 13:45:09 +08:00
ZePan110
aa5c91d7ee
Check duplicated dockerfile ( #1289 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-06 17:30:12 +08:00
Wang, Kai Lawrence
4c01e14642
[ChatQnA] Remove enforce-eager to enable HPU graphs for better vLLM perf ( #1210 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2024-12-10 13:19:15 +08:00
sgurunat
031cf6e1ff
ChatQnA: Update kubernetes xeon chatqna remote inference and svelte UI ( #1215 )
...
Signed-off-by: sgurunat <gurunath.s@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-04 22:40:03 +08:00
sgurunat
3299e5c9f5
ChatQnA: Update chatqna-vllm-remote-inference ( #1224 )
...
Signed-off-by: sgurunat <gurunath.s@intel.com >
2024-12-04 22:33:27 +08:00
ZePan110
8808b51e42
Rename image name XXX-hpu to XXX-gaudi ( #1154 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2024-11-19 22:18:41 +08:00
jotpalch
c3e6f43ece
Fix command in README for deploying ChatQnA application ( #1156 )
2024-11-18 22:59:22 +08:00
sgurunat
56f770cb28
ChatQnA with Remote Inference Endpoints (Kubernetes) ( #1149 )
...
Signed-off-by: sgurunat <gurunath.s@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-18 20:06:17 +08:00
Liang Lv
fb514bb8ba
Add chatqna wrapper for multiple model selection ( #1144 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: Ying Hu <ying.hu@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-18 10:48:09 +08:00
lvliang-intel
9ff7df9202
Use fixed version of TEI Gaudi for stability ( #1101 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2024-11-13 10:45:50 -08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
lvliang-intel
e3187be819
Update ChatQnA manifests using always pull image policy ( #1100 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-11-11 14:37:14 +08:00
Arthur Leung
6263b517b9
[Doc] Add steps to deploy opea services using minikube ( #1058 )
...
Signed-off-by: Arthur Leung <arcyleung@gmail.com >
Co-authored-by: Arthur Leung <arcyleung@gmail.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-07 13:57:34 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
lvliang-intel
7197286a14
Fix ChatQnA manifest default port issue ( #1033 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-30 11:52:04 +08:00
WenjiaoYue
b377c2b8f8
Update manifest ui containerPort ( #952 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-17 09:42:55 +08:00
lvliang-intel
619d941047
Set no wrapper ChatQnA as default ( #891 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 13:30:45 +08:00
pallavijaini0525
e2f9037344
Added the K8s yaml for vLLM support ( #917 )
...
Signed-off-by: desaidhr <dhruv.desai@intel.com >
Co-authored-by: desaidhr <dhruv.desai@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-10 11:08:07 +08:00
jotpalch
bd32b03e3c
Doc: Update folder path to correct location in "Deploy ChatQnA in Kubernetes" ( #875 )
2024-09-26 14:38:22 +08:00
lvliang-intel
33b9d4e421
Remove redundant code and update tgi version ( #871 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-09-25 15:33:33 +08:00
David Kinder
3e796ba73d
doc: fix missing references to README.md ( #860 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-24 21:40:42 +08:00
Steve Zhang
954a22051b
Make all xeon tgi image version consistent ( #851 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-24 11:19:37 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
WenjiaoYue
05f9828e77
Add nginx and UI to the ChatQnA manifest ( #848 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 21:04:12 +08:00
lkk
ba17031198
add tgi bf16 setup on CPU k8s. ( #795 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com >
2024-09-13 19:55:57 +08:00
XinyaoWa
d2bab99835
refine readme for reorg ( #782 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 14:57:29 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00
Lianhao Lu
ba94e0130d
Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum ( #773 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-10 16:30:14 +08:00
Lianhao Lu
0629696333
K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum
...
- Update ChatQnA/CodeGen/CodeTrans/DocSum k8s manifest
to avoid requiring creating directory for cache model.
- Add chatqna-guardrails manifest files.
- Fix bug #752 introduced by PR #669
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-06 16:09:42 +08:00
David Kinder
67394b88fa
doc: fix headings and indenting ( #748 )
...
* doc: fix headings and indenting
* only one H1 header (for title) is allowed
* fix indenting under ordered lists
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-06 12:59:33 +08:00
WenjiaoYue
758d236463
Add chatQnA UI manifest ( #669 )
...
* Add chatQnA UI manifest
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update port
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add nginx config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update code
* update nginx config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update nginx config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update ui IP
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update yaml
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update api
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update env config
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update env
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update specify node
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update node-type
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update yaml
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* update yaml
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* delete nodeSelector
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update dataprep api
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* add node-type
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* delete specify nodeSelector
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
* delete useless space
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
---------
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-05 22:44:10 +08:00
Steve Zhang
4133757642
Change docs of kubernetes for curl commands in README ( #661 )
...
* change docs for curl commands in README.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
* The Namespace 'CT' is invalid.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-27 19:36:37 +08:00
Lianhao Lu
771975510a
chatqna k8s manifest: Fixed retriever-redis v0.9 image issue ( #638 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-08-21 22:24:29 +08:00
Sihan Chen
6674832162
fix tgi xeon tag ( #641 )
2024-08-21 22:17:07 +08:00
Lianhao Lu
01c1b7504f
Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum
...
- Sync with docker-compose changes since v0.8 release
- Add K8S probes
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-08-20 10:45:15 +08:00
lvliang-intel
b2771ad3f2
Using TGI official release docker image for intel cpu ( #581 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-08-18 17:17:44 +08:00
Steve Zhang
1c23d87aa2
Add dataprep microservice to chatQnA example and the e2e test ( #589 )
...
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-14 14:39:46 +08:00