xiguiw
94222d5783
Merge branch 'main' into update_vLLM
2025-05-16 09:04:30 +08:00
CICD-at-OPEA
274af9eabc
Update vLLM version to v0.9.0
...
Signed-off-by: CICD-at-OPEA <CICD@opea.dev >
2025-05-15 22:41:49 +00:00
chen, suyue
26d07019d0
[CICD enhance] CodeTrans run CI with latest base image, group logs in GHA outputs. ( #1929 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-05-14 11:11:54 +08:00
CICD-at-OPEA
2160d43a32
Update vLLM version to v0.8.5
...
Signed-off-by: CICD-at-OPEA <CICD@opea.dev >
2025-05-08 08:37:52 +00:00
Sun, Xuehao
b467a13ec3
daily update vLLM&vLLM-fork version ( #1914 )
...
Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com >
2025-05-08 10:34:36 +08:00
ZePan110
04d527d3b0
Integrate set_env to ut scripts for CodeTrans. ( #1868 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-28 13:53:50 +08:00
chyundunovDatamonsters
1fdab591d9
CodeTrans - refactoring README.md for deploy application on ROCm with Docker Compose ( #1875 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-04-24 15:28:57 +08:00
chen, suyue
13ea13862a
Remove proxy in CodeTrans test ( #1874 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-04-24 13:47:56 +08:00
Letong Han
697f78ea71
Refine the READMEs of CodeTrans ( #1796 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2025-04-21 17:14:46 +08:00
Liang Lv
13dd27e6d5
Update vLLM parameter max-seq-len-to-capture ( #1809 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-04-15 14:27:12 +08:00
ZePan110
00d7a65dd8
Enable model cache for Rocm docker compose test. ( #1614 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-10 09:40:37 +08:00
ZePan110
5f4b3a6d12
Adaptation to vllm v0.8.3 build paths ( #1761 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-09 13:20:02 +08:00
dolpher
46ebb78aa3
Sync values yaml file for 1.3 release ( #1748 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-04-08 22:39:40 +08:00
ZePan110
42735d0d7d
Fix vllm and vllm-fork tags ( #1766 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-07 22:58:50 +08:00
Xiaotian Chen
1bd56af994
Update TGI image versions ( #1625 )
...
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com >
2025-04-01 11:27:51 +08:00
chyundunovDatamonsters
a04463d5e3
Adding files to deploy CodeTrans application on ROCm vLLM ( #1545 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-03-24 15:33:35 +08:00
Eero Tamminen
d397e3f631
Use GenAIComp base image to simplify Dockerfiles - part 3/4 ( #1671 )
...
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-03-24 09:17:12 +08:00
Wang, Kai Lawrence
5362321d3a
Fix vllm model cache directory ( #1642 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-03-10 13:40:42 +08:00
chen, suyue
4cab86260f
Use the latest HabanaAI/vllm-fork release tag to build vllm-gaudi image ( #1635 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: Liang Lv <liang1.lv@intel.com >
2025-03-07 20:40:32 +08:00
Letong Han
9180f1066d
Enable vllm for CodeTrans ( #1626 )
...
Set vllm as default llm serving, and add related docker compose files, readmes, and test scripts.
Issue: https://github.com/opea-project/GenAIExamples/issues/1436
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-03-07 10:56:21 +08:00
ZePan110
5aecea8e47
Update compose.yaml ( #1619 )
...
Update compose.yaml for CodeGen, CodeTrans and DocSum
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:20:28 +08:00
ZePan110
c1b5ba281f
Enable CodeGen,CodeTrans and DocSum model cache for docker compose test. ( #1599 )
...
1.Add cache path check
2.Enable CodeGen,CodeTrans and DocSum model cache for docker compose test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-04 16:10:20 +08:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
xiguiw
45d5da2ddd
HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN ( #1503 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-09 20:33:06 +08:00
dolpher
ee0e5cc8d9
Sync value files from GenAIInfra ( #1428 )
...
All gaudi values updated with extra flags.
Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice.
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-22 17:44:11 +08:00
WenjiaoYue
b721c256f9
Fix Domain Access Issue in Latest Vite Version ( #1444 )
...
Fix the restriction on using domain names when users are using the latest version of Vite
When users use the new version of Vite, the UI cannot be accessed via domain names due to Vite's new rules. This fix adds the corresponding parameters according to Vite's new rules, ensuring that users can access the frontend via domain names when building the UI.
Fixes #1441
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-01-21 23:28:37 +08:00
chen, suyue
927698e23e
Simplify git clone code in CI test ( #1434 )
...
1. Simplify git clone code in CI test.
2. Replace git clone branch in Dockerfile.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-21 23:00:08 +08:00
Eero Tamminen
0eae391fda
Use staged builds to minimize final image sizes ( #1031 )
...
Staged image builds so that final images do not have redundant things like:
- Git tool and its deps
- Git repo history
- Test directories
Fixes : #225
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-01-16 11:14:47 +08:00
Liang Lv
3ca78867eb
Update example code for embedding dependency moving to 3rd_party ( #1368 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-10 15:36:58 +08:00
dolpher
c795ef2203
Add helm deployment instructions for GenAIExamples ( #1373 )
...
Add helm deployment instructions for ChatQnA, AgentQnA, AudioQnA, CodeTrans, DocSum, FaqGen and VisualQnA
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-10 09:55:31 +08:00
ZePan110
aa5c91d7ee
Check duplicated dockerfile ( #1289 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-06 17:30:12 +08:00
XinyaoWa
464e2d3125
Rename streaming to stream to align with OpenAI API ( #1332 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-06 13:25:55 +08:00
chen, suyue
5c7a5bd850
Update Code and README for GenAIComps Refactor ( #1285 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
Signed-off-by: WenjiaoYue <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>
2025-01-02 20:03:26 +08:00
lkk
e18369ba0d
remove examples gateway. ( #1250 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-14 13:19:51 +08:00
Lianhao Lu
6f9f6f0bad
Remove deprecated docker compose files ( #1238 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-12-10 09:43:19 +08:00
lkk
bde285dfce
move examples gateway ( #992 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Sihan Chen <39623753+Spycsh@users.noreply.github.com >
2024-12-06 14:40:25 +08:00
chen, suyue
cc108b5a18
Fix DBQnA image build ( #1165 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-11-20 10:56:49 +08:00
chyundunovDatamonsters
7e62175c2e
Adding files to deploy CodeTrans application on AMD GPU ( #1138 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2024-11-18 14:58:38 +08:00
Louie Tsai
152adf8012
maintain a version info for docker_compose yaml files among release ( #1141 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-11-17 22:39:41 -08:00
Lianhao Lu
cbe952ec5e
Fail CI manifest test if response content is not expected ( #1145 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com >
2024-11-17 12:46:31 +08:00
chen, suyue
2b2c7ee2f5
upgrade setuptools version to fix CVE-2024-6345 ( #999 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-11-14 14:57:16 +08:00
Abolfazl Shahbazi
b5f95f735e
Fix missing end of file chars ( #1106 )
...
Signed-off-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-13 09:40:53 -08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
Neo Zhang Jianyu
2d9aeb3715
fix wrong format which break online doc build ( #1073 )
...
Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com >
2024-11-05 17:01:40 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
Yi Yao
ced68e1834
Add performance benchmark scripts for 4 use cases. ( #1052 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-03 12:41:02 +08:00
lvliang-intel
7197286a14
Fix ChatQnA manifest default port issue ( #1033 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-30 11:52:04 +08:00
XinyaoWa
a2afce1675
update codetrans default model ( #1015 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-28 09:11:54 +08:00
Louie Tsai
90c2d49050
Update CodeTrans README.md for workflow ( #908 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-10-25 12:39:18 +08:00
lvliang-intel
9438d392b4
Update README for some minor issues ( #1000 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-22 10:30:18 +08:00