CICD-at-OPEA
274af9eabc
Update vLLM version to v0.9.0
...
Signed-off-by: CICD-at-OPEA <CICD@opea.dev >
2025-05-15 22:41:49 +00:00
CICD-at-OPEA
238fb52a92
Update vLLM version to v0.8.5
...
Signed-off-by: CICD-at-OPEA <CICD@opea.dev >
2025-05-13 22:42:16 +00:00
Ying Hu
4a17638b5c
Merge branch 'main' into update_vLLM
2025-05-13 16:00:56 +08:00
Ying Hu
2596671d3f
Update README.md for remove the docker installer ( #1927 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
2025-05-12 11:40:33 +08:00
Razvan Liviu Varzaru
ebb7c24ca8
Add ChatQnA docker-compose example on Intel Xeon using MariaDB Vector ( #1916 )
...
Signed-off-by: Razvan-Liviu Varzaru <razvan@mariadb.org >
Co-authored-by: Liang Lv <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-05-08 21:08:15 -07:00
CICD-at-OPEA
2160d43a32
Update vLLM version to v0.8.5
...
Signed-off-by: CICD-at-OPEA <CICD@opea.dev >
2025-05-08 08:37:52 +00:00
Sun, Xuehao
b467a13ec3
daily update vLLM&vLLM-fork version ( #1914 )
...
Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com >
2025-05-08 10:34:36 +08:00
Ying Hu
40e44dfcd6
Update README.md of ChatQnA for broken URL ( #1907 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Neo Zhang Jianyu <jianyu.zhang@intel.com >
2025-05-06 13:21:31 +08:00
chen, suyue
c546d96e98
downgrade tei version from 1.6 to 1.5, fix the chatqna perf regression ( #1886 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-04-25 23:00:36 +08:00
chen, suyue
be5933ad85
Update benchmark scripts ( #1883 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-04-25 17:05:48 +08:00
chyundunovDatamonsters
bb7a675665
ChatQnA - refactoring README.md for deploy application on ROCm ( #1857 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Signed-off-by: Chingis Yundunov <c.yundunov@datamonsters.com >
Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Artem Astafev <a.astafev@datamonsters.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-04-25 08:52:24 +08:00
chen, suyue
13ea13862a
Remove proxy in CodeTrans test ( #1874 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-04-24 13:47:56 +08:00
dolpher
87e3c0f59f
Update chatqna values file changes ( #1844 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-04-21 09:38:07 +08:00
Liang Lv
1eb2e36a18
Refine ChatQnA READMEs ( #1850 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-04-20 10:34:24 +08:00
sri-intel
c63e2cd067
Remote inference support for examples in Productivity suite ( #1818 )
...
Signed-off-by: Srinarayan Srikanthan <srinarayan.srikanthan@intel.com >
2025-04-18 14:36:57 +08:00
Louie Tsai
c793dd0b51
Redirect Users to github.io for ChatQnA telemetry materials ( #1845 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-04-17 23:35:30 -07:00
Ying Hu
1b3f1f632a
Update README.md of ChatQnA for layout ( #1842 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
2025-04-18 11:41:35 +08:00
sri-intel
90cfe89e21
new chatqna readme template ( #1755 )
...
Signed-off-by: Srinarayan Srikanthan <srinarayan.srikanthan@intel.com >
2025-04-17 16:38:40 +08:00
Letong Han
ae31e4fb75
Enable health check for dataprep in ChatQnA ( #1799 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2025-04-17 15:01:57 +08:00
xiguiw
4fc19c7d73
Update TEI docker images to CPU-1.6 ( #1791 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-04-17 15:00:06 +08:00
Liang Lv
71fe886ce9
Replaced TGI with vLLM for guardrail serving ( #1815 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-04-16 17:06:11 +08:00
chen, suyue
1095d88c5f
Group log lines in GHA outputs for better readable logs. ( #1821 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-04-16 13:17:53 +08:00
Liang Lv
13dd27e6d5
Update vLLM parameter max-seq-len-to-capture ( #1809 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-04-15 14:27:12 +08:00
pre-commit-ci[bot]
094ca7aefe
[pre-commit.ci] pre-commit autoupdate ( #1771 )
...
Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Sun, Xuehao <xuehao.sun@intel.com >
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com >
2025-04-09 11:51:57 -07:00
ZePan110
5f4b3a6d12
Adaptation to vllm v0.8.3 build paths ( #1761 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-09 13:20:02 +08:00
Lucas Melo
2d8a7e25f6
Update ChatQna & CodeGen README.md with new Automated Terraform Deployment Options ( #1731 )
...
Signed-off-by: lucasmelogithub <lucas.melo@intel.com >
2025-04-09 10:54:01 +08:00
Liang Lv
7b7728c6c3
Fix vLLM CPU initialize engine issue for DeepSeek models ( #1762 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-04-09 09:47:08 +08:00
XinyaoWa
6917d5bdb1
Fix ChatQnA port to internal vllm port ( #1763 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-04-09 09:37:11 +08:00
dolpher
46ebb78aa3
Sync values yaml file for 1.3 release ( #1748 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-04-08 22:39:40 +08:00
ZePan110
42735d0d7d
Fix vllm and vllm-fork tags ( #1766 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-07 22:58:50 +08:00
Louie Tsai
e8cdf7d668
[ChatQnA] update to the latest Grafana Dashboard ( #1728 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-04-03 12:14:55 -07:00
chen, suyue
c48cd651e4
[CICD enhance] ChatQnA run CI with latest base image, group logs in GHA outputs. ( #1736 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-04-03 22:03:20 +08:00
chyundunovDatamonsters
c50dfb2510
Adding files to deploy ChatQnA application on ROCm vLLM ( #1560 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-04-03 17:19:26 +08:00
Louie Tsai
8fe2d5d0be
Update README.md to have Table for contents ( #1721 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-04-01 10:31:05 -07:00
Xiaotian Chen
1bd56af994
Update TGI image versions ( #1625 )
...
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com >
2025-04-01 11:27:51 +08:00
xiguiw
87baeb833d
Update TEI docker image to 1.6 ( #1650 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-03-27 09:40:22 +08:00
Louie Tsai
0736912c69
change gaudi node exporter from default one to 41612 ( #1702 )
...
Signed-off-by: Louie Tsai <louie.tsai@intel.com >
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-03-20 21:38:24 -07:00
XinyaoWa
6d24c1c77a
Merge FaqGen into ChatQnA ( #1654 )
...
1. Delete FaqGen
2. Refactor FaqGen into ChatQnA, serve as a LLM selection.
3. Combine all ChatQnA related Dockerfile into one
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-03-20 17:40:00 +08:00
James Edwards
527b146a80
Add final README.md and set_env.sh script for quickstart review. Previous pull request was 1595. ( #1662 )
...
Signed-off-by: Edwards, James A <jaedwards@habana.ai >
Co-authored-by: Edwards, James A <jaedwards@habana.ai >
2025-03-14 16:05:01 -07:00
Louie Tsai
671dff7f51
[ChatQnA] Enable Prometheus and Grafana with telemetry docker compose file. ( #1623 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-03-13 23:18:29 -07:00
Li Gang
0701b8cfff
[ChatQnA][docker]Check healthy of redis to avoid dataprep failure ( #1591 )
...
Signed-off-by: Li Gang <gang.g.li@intel.com >
2025-03-13 10:52:33 +08:00
Eero Tamminen
4269669f73
Use GenAIComp base image to simplify Dockerfiles & reduce image sizes - part 2 ( #1638 )
...
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-03-13 08:23:07 +08:00
chen, suyue
43d0a18270
Enhance ChatQnA test scripts ( #1643 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-03-10 17:36:26 +08:00
Wang, Kai Lawrence
5362321d3a
Fix vllm model cache directory ( #1642 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-03-10 13:40:42 +08:00
chen, suyue
4cab86260f
Use the latest HabanaAI/vllm-fork release tag to build vllm-gaudi image ( #1635 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: Liang Lv <liang1.lv@intel.com >
2025-03-07 20:40:32 +08:00
wangleflex
694207f76b
[ChatQnA] Show spinner after query to improve user experience ( #1003 ) ( #1628 )
...
Signed-off-by: Wang,Le3 <le3.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-03-07 17:08:53 +08:00
ZePan110
785ffb9a1e
Update compose.yaml for ChatQnA ( #1621 )
...
Update compose.yaml for ChatQnA
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:19:39 +08:00
ZePan110
6ead1b12db
Enable ChatQnA model cache for docker compose test. ( #1605 )
...
Enable ChatQnA model cache for docker compose test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-05 11:30:04 +08:00
chen, suyue
8f8d3af7c3
open chatqna frontend test ( #1594 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-03-04 10:41:22 +08:00
Spycsh
ce38a84372
Revert chatqna async and enhance tests ( #1598 )
...
align with opea-project/GenAIComps#1354
2025-03-03 23:03:44 +08:00