Commit Graph

314 Commits

Author SHA1 Message Date
chen, suyue
a2437e83e7 update compose.yaml fix the test issue (#467)
Signed-off-by: chensuyue <suyue.chen@intel.com>
v0.8
2024-07-28 23:36:39 +08:00
lvliang-intel
1b3398902b Add Grafana dashboard support (#466)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-07-26 20:43:58 +08:00
David Kinder
07921a98d0 doc: fix headings and broken image reference (#464)
* First (and only) H1 heading should be the document title
* image reference was broken

Signed-off-by: David B. Kinder <david.b.kinder@intel.com>
2024-07-26 20:27:00 +08:00
David Kinder
42104fe117 doc: fix broken image references (#463)
Signed-off-by: David B. Kinder <david.b.kinder@intel.com>
2024-07-26 20:26:45 +08:00
David Kinder
1ac696bfa9 doc: fix broken images (#461)
path to images is a bit off

Signed-off-by: David B. Kinder <david.b.kinder@intel.com>
2024-07-26 20:26:29 +08:00
Sihan Chen
8032ce60f6 add guardrails after LLM in ChatQnA (#456)
* add guardrails out

* change name

* add env
2024-07-26 20:13:46 +08:00
Zhenzhong1
b61c2abd61 upload manifests (#454)
* uploadded manifests

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-26 20:12:54 +08:00
chen, suyue
965c13c556 rename docker compose.yaml (#446)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-07-26 20:03:36 +08:00
huiyan2021
8f9bcd4b3c Update image reference url in readme (#465) 2024-07-26 17:34:48 +08:00
ZePan110
6b5489a36e Add released docker_images_list.md (#432)
Signed-off-by: zepan <ze.pan@intel.com>
2024-07-26 16:42:39 +08:00
David Kinder
8d0c8fb949 doc: fix missing title H1 heading (#458)
Signed-off-by: David B. Kinder <david.b.kinder@intel.com>
Co-authored-by: Haihao Shen <haihao.shen@intel.com>
2024-07-26 09:32:54 +08:00
dependabot[bot]
535dcc36e5 Bump torch from 2.1.0 to 2.2.0 in /AudioQnA/deprecated/docker/asr (#459)
Signed-off-by: dependabot[bot] <support@github.com>
2024-07-25 22:28:54 +08:00
Jaswanth Karani
edf0d14c95 added doc sum react-ui (#418)
Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>
2024-07-25 12:12:36 +08:00
Letong Han
43bd8f2e4b Refine Example READMEs (#457)
* refine retrieval part in readmes

Signed-off-by: letonghan <letong.han@intel.com>

* unify command

Signed-off-by: letonghan <letong.han@intel.com>

---------

Signed-off-by: letonghan <letong.han@intel.com>
2024-07-25 11:36:47 +08:00
Steve Zhang
d0b028d199 Add new example of Translation for GenAIExample (#455)
* Add Translation GMC pipeline
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
2024-07-25 09:26:41 +08:00
lvliang-intel
8b60948c7b Add auto truncate for embedding and rerank (#449)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-07-25 09:21:06 +08:00
WenjiaoYue
268d58d4a9 SearchQnA UT (#421)
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>
2024-07-25 09:18:37 +08:00
WenjiaoYue
5984848bb0 update ChatQnA upload feature (#419)
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>
2024-07-25 09:16:40 +08:00
lvliang-intel
f4b4ac0d3a Update TEI version v1.5 for better performance (#447)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-07-25 08:54:34 +08:00
gadmarkovits
c745641ba1 Added ChatQnA example using Qdrant retriever (#384)
* Added ChatQnA example using Qdrant retriever

Signed-off-by: gadmarkovits <gad.markovits@intel.com>

* Updated dockerfile path

Signed-off-by: gadmarkovits <gad.markovits@intel.com>

---------

Signed-off-by: gadmarkovits <gad.markovits@intel.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-07-25 08:18:06 +08:00
Jaswanth Karani
1b48e54a3d code-gen-react-ui (#417)
* CodeGen: Manifest to deploy CodeGen with ReactUI into Kubernetes

Include manifest to deploy CodeGen pipeline with ReactUI into
Kubernetes. Include as well README file for the steps.

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>

* code-gen-react-ui

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* made changes as per PR suggestions

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* updated readme

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update base image

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* updated codegn dockerfile react

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* updated wait time

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* updated as per PR comments

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* aded react ui in gaudi test file

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* fixed PR comments

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* added docker pull for text-generation-model

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

* removed unused files

Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>

---------

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com>
Co-authored-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-07-24 09:01:16 -07:00
Yogesh Pandey
8c4a2534c1 FAQGen Megaservice (#425)
* Added FAQGEN v1

Signed-off-by: Yogesh Pandey <yogesh.pandey@intel.com>

---------

Signed-off-by: Yogesh Pandey <yogesh.pandey@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-24 23:37:20 +08:00
Steve Zhang
21b7d11098 Add new example of SearchQnA for GenAIExample (#448)
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
2024-07-24 15:59:52 +08:00
WenjiaoYue
e371b1e9d4 Update REAME to be consistent with .env variables (#452)
* Update REAME to be consistent with .env variables

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-24 15:34:11 +08:00
zehao-intel
95c13d9558 Fix Typo for Translation Example (#451)
Signed-off-by: zehao-intel <zehao.huang@intel.com>
2024-07-24 13:31:12 +08:00
Ying Chun Guo
62ae64f13c add checkout ref in gmc e2e workflow (#450)
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>
2024-07-24 13:30:07 +08:00
XinyaoWa
631d841119 Support vLLM/vLLM-on-Ray/Ray Serve for ChatQnA (#428)
* support vllm for chatqna

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add vllm-on-ray into ChatQnA

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* support ray serve in ChatQnA

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix conflice

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* refine readme

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add UT for chatqna vllm

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add UT for ChatQnA Ray Serve

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add UT for chatqna vllm ray

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add vllm for chatqna on xeon

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug for vllm chatqna cpu

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add ut for chatqna vllm

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-24 10:44:07 +08:00
Lianhao Lu
665c46ffae Update Kubernetes manifest files for deploying ChatQnA (#445)
Update Kubernetes manifest files for deploying ChatQnA without
GMC.

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-07-24 09:59:38 +08:00
Lianhao Lu
6e797fae89 README for installing 4 worklods using helm chart (#431)
Add README to install the following workloads using helm chart:

- ChatQnA
- CodeGen
- CodeTrans
- DocSum

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-07-24 09:46:03 +08:00
yang yuhan
b46ae8bdcc Fix a minor bug for chatqna in docker-compose (#442) (#443)
Embedding and reranking services failed to run on GPU H100.
Change the image tag and use CPU for these services. This PR will
fix #442

Signed-off-by: PeterYang12 <yuhan.yang@intel.com>
2024-07-24 08:41:47 +08:00
Ying Chun Guo
f45f508847 allow gmc e2e workflow to get secrets (#444)
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>
2024-07-23 21:34:36 +08:00
WenjiaoYue
284d855bf4 Translate UT for UI (#423)
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com>
2024-07-23 17:14:33 +08:00
Steve Zhang
290a74fae9 Update all examples yaml files of GMC in GenAIExample (#436)
* Update all examples yaml files of GMC in GenAIExample.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
2024-07-23 16:40:51 +08:00
Letong Han
8ad7f36fe2 Update READMEs (#430)
* update readme gaudi part & add tei-gaudi params

Signed-off-by: letonghan <letong.han@intel.com>

* modify supported habana driver version

Signed-off-by: letonghan <letong.han@intel.com>

* update env set part

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add example for no_proxy

Signed-off-by: letonghan <letong.han@intel.com>

* add an example of public ip

Signed-off-by: letonghan <letong.han@intel.com>

---------

Signed-off-by: letonghan <letong.han@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-23 15:25:02 +08:00
Lianhao Lu
2f9397e012 Update Kubernetes manifest files for CodeGen (#441)
Updated Kubernetes manifest files for CodeGen to reflect latest changes:
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-07-23 14:07:42 +08:00
Lianhao Lu
c9548d7921 Add Kubernetes manifest files for deploying CodeTrans (#435)
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-07-23 13:29:59 +08:00
Lianhao Lu
83146320aa Add Kubernetes manifest files for deploying DocSum (#434)
* Add Kubernetes manifest files for deploying DocSum

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-07-23 13:29:32 +08:00
chen, suyue
ecf3338835 refactor searchqna ci test (#437)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-07-23 10:17:41 +08:00
chen, suyue
b7975e79d8 refactor translation ci test (#438)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-07-23 10:17:24 +08:00
chen, suyue
bb42307af9 add docker compose yaml print for test (#433)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-07-22 15:36:03 +08:00
chen, suyue
2e312f44ed update pr template (#422)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-07-18 21:34:03 +08:00
chen, suyue
ee0dcb3d37 expand tgi connect timeout (#424)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-07-18 21:28:48 +08:00
Letong Han
f732674b1e Refine README of Examples (#420)
* update chatqna readme and set env script

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update for comments

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add consume

Signed-off-by: letonghan <letong.han@intel.com>

* modify details

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update codegen readme

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add patch modifications

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update codegen readme

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update ui options

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* udpate codetrans readme

Signed-off-by: letonghan <letong.han@intel.com>

* update docsum & searchqna readme

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: letonghan <letong.han@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-18 21:17:09 +08:00
Ruoyu Ying
d9946180a2 doc: fix minor issue in GMC doc (#383)
Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com>
2024-07-18 16:21:28 +08:00
ZePan110
11a56e09ef Adding Chatqna Benchmark Test (#404)
Signed-off-by: zepan <ze.pan@intel.com>
2024-07-18 16:03:29 +08:00
chen, suyue
615f0d2547 update audioqna image name (#408)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-07-17 23:07:08 +08:00
lvliang-intel
9551594164 Add guardrails in the ChatQnA pipeline (#407)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-07-17 21:09:54 +08:00
xiguiw
97da49f61e [Doc]Add ChatQnA Flow Chart (#398)
Signed-off-by: Wang, Xigui <xigui.wang@intel.com>
2024-07-16 21:05:56 +08:00
Sihan Chen
8d4209a015 add auto-truncate to gaudi tei (#388)
* fix sqna gaudi

* rootcause

* rootcause

---------

Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-07-15 17:13:59 +08:00
Sihan Chen
ba65415b78 Fix win PC issues (#399)
* change to LF

* add readme for windows pc

* add OLLAMA_MODEL param

* readme

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update README.md

* Update docker_compose.yaml

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-15 17:11:49 +08:00