chyundunovDatamonsters
31b1d69e40
Adding files to deploy CodeGen application on ROCm vLLM ( #1544 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-03-24 14:45:17 +08:00
Eero Tamminen
d397e3f631
Use GenAIComp base image to simplify Dockerfiles - part 3/4 ( #1671 )
...
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-03-24 09:17:12 +08:00
xiguiw
effa2a28cf
Enable CodeGen vLLM ( #1636 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-03-13 10:38:47 +08:00
ZePan110
5aecea8e47
Update compose.yaml ( #1619 )
...
Update compose.yaml for CodeGen, CodeTrans and DocSum
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:20:28 +08:00
ZePan110
c1b5ba281f
Enable CodeGen,CodeTrans and DocSum model cache for docker compose test. ( #1599 )
...
1.Add cache path check
2.Enable CodeGen,CodeTrans and DocSum model cache for docker compose test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-04 16:10:20 +08:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
xiguiw
45d5da2ddd
HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN ( #1503 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-09 20:33:06 +08:00
dolpher
ee0e5cc8d9
Sync value files from GenAIInfra ( #1428 )
...
All gaudi values updated with extra flags.
Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice.
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-22 17:44:11 +08:00
WenjiaoYue
b721c256f9
Fix Domain Access Issue in Latest Vite Version ( #1444 )
...
Fix the restriction on using domain names when users are using the latest version of Vite
When users use the new version of Vite, the UI cannot be accessed via domain names due to Vite's new rules. This fix adds the corresponding parameters according to Vite's new rules, ensuring that users can access the frontend via domain names when building the UI.
Fixes #1441
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-01-21 23:28:37 +08:00
chen, suyue
927698e23e
Simplify git clone code in CI test ( #1434 )
...
1. Simplify git clone code in CI test.
2. Replace git clone branch in Dockerfile.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-21 23:00:08 +08:00
xiguiw
2d5898244c
Enchance health check in GenAIExample docker-compose ( #1410 )
...
Fix service launch issue
1. Update Gaudi TGI image from 2.0.6 to 2.3.1
2. Change the hpu-gaudi TGI health check condition.
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-20 20:13:13 +08:00
xiguiw
698a06edbf
[DOC] Fix document issue ( #1395 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-16 11:30:07 +08:00
Eero Tamminen
0eae391fda
Use staged builds to minimize final image sizes ( #1031 )
...
Staged image builds so that final images do not have redundant things like:
- Git tool and its deps
- Git repo history
- Test directories
Fixes : #225
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-01-16 11:14:47 +08:00
dolpher
c795ef2203
Add helm deployment instructions for GenAIExamples ( #1373 )
...
Add helm deployment instructions for ChatQnA, AgentQnA, AudioQnA, CodeTrans, DocSum, FaqGen and VisualQnA
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-10 09:55:31 +08:00
Jaswanth Karani
ddacb7e86d
fixed build issue ( #1367 )
2025-01-08 22:19:23 +08:00
dolpher
5638075d65
Add helm deployment instructions for codegen ( #1351 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-08 13:20:32 +08:00
ZePan110
aa5c91d7ee
Check duplicated dockerfile ( #1289 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-06 17:30:12 +08:00
XinyaoWa
464e2d3125
Rename streaming to stream to align with OpenAI API ( #1332 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-06 13:25:55 +08:00
chen, suyue
5c7a5bd850
Update Code and README for GenAIComps Refactor ( #1285 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
Signed-off-by: WenjiaoYue <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>
2025-01-02 20:03:26 +08:00
chen, suyue
6b6a08df78
Add minimal containers and ports clean up before test ( #1291 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-12-26 10:59:26 +08:00
lkk
e18369ba0d
remove examples gateway. ( #1250 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-14 13:19:51 +08:00
Lianhao Lu
6f9f6f0bad
Remove deprecated docker compose files ( #1238 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-12-10 09:43:19 +08:00
lkk
bde285dfce
move examples gateway ( #992 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Sihan Chen <39623753+Spycsh@users.noreply.github.com >
2024-12-06 14:40:25 +08:00
WenjiaoYue
8192c3166f
Update OPEA example package.json version ( #1211 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-02 21:33:30 +08:00
chen, suyue
cc108b5a18
Fix DBQnA image build ( #1165 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-11-20 10:56:49 +08:00
Louie Tsai
152adf8012
maintain a version info for docker_compose yaml files among release ( #1141 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-11-17 22:39:41 -08:00
chyundunovDatamonsters
83172e9a99
Adding files to deploy CodeGen application on AMD GPU ( #1130 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-18 14:36:23 +08:00
Lianhao Lu
cbe952ec5e
Fail CI manifest test if response content is not expected ( #1145 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com >
2024-11-17 12:46:31 +08:00
chen, suyue
2b2c7ee2f5
upgrade setuptools version to fix CVE-2024-6345 ( #999 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-11-14 14:57:16 +08:00
Abolfazl Shahbazi
b5f95f735e
Fix missing end of file chars ( #1106 )
...
Signed-off-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-13 09:40:53 -08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
Neo Zhang Jianyu
2d9aeb3715
fix wrong format which break online doc build ( #1073 )
...
Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com >
2024-11-05 17:01:40 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
Yi Yao
ced68e1834
Add performance benchmark scripts for 4 use cases. ( #1052 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-03 12:41:02 +08:00
lvliang-intel
7197286a14
Fix ChatQnA manifest default port issue ( #1033 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-30 11:52:04 +08:00
Yao Qing
2332d22950
[Codegen] Replace codegen default Model to Qwen/Qwen2.5-Coder-7B-Instruct. ( #1013 )
...
Signed-off-by: Yao, Qing <qing.yao@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-28 09:18:01 +08:00
Chun Tao
41955f65ad
Add a sample UI image for CodeGen's TGI monitoring ( #1009 )
...
Signed-off-by: Chun Tao <chun.tao@intel.com >
2024-10-23 14:38:12 +08:00
lvliang-intel
9438d392b4
Update README for some minor issues ( #1000 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-22 10:30:18 +08:00
chen, suyue
eeced9b31c
Enhance CI/CD image build ( #961 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-10-17 14:33:58 +08:00
lkk
088ab98f31
update examples accuracy ( #941 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-14 13:20:50 +08:00
Louie Tsai
12469c92d8
Update CodeGen README for its workflow ( #911 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-10-10 08:47:56 -07:00
David Kinder
3e796ba73d
doc: fix missing references to README.md ( #860 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-24 21:40:42 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
Letong Han
a09395e4a4
[Doc] Update CodeGen and Translation READMEs ( #847 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-19 16:01:35 +08:00
lkk
f04f061f8c
move evaluation scripts ( #842 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 15:59:13 +08:00
XinyaoWa
2f03a3a894
Align parameters for "max_token, repetition_penalty,presence_penalty,frequency_penalty" ( #726 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 14:15:25 +08:00
lvliang-intel
bceacdc804
Fix README issues ( #817 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-18 09:50:17 +08:00
Jaswanth Karani
b84c98983d
Made cogen react ui to use runtime environment variables ( #807 )
...
Signed-off-by: jaswanth8888 <karani.jaswanth@gmail.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-14 09:27:14 +08:00
Malini Bhandaru
558ea3bb7f
adopted tech writing style ( #796 )
...
Signed-off-by: Malini Bhandaru <malini.bhandaru@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-13 09:42:59 +08:00
XinyaoWa
264759d85a
fix path bug for reorg ( #801 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-09-12 17:52:06 +08:00