Xiaotian Chen
1bd56af994
Update TGI image versions ( #1625 )
...
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com >
2025-04-01 11:27:51 +08:00
chyundunovDatamonsters
a04463d5e3
Adding files to deploy CodeTrans application on ROCm vLLM ( #1545 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-03-24 15:33:35 +08:00
Eero Tamminen
d397e3f631
Use GenAIComp base image to simplify Dockerfiles - part 3/4 ( #1671 )
...
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-03-24 09:17:12 +08:00
Wang, Kai Lawrence
5362321d3a
Fix vllm model cache directory ( #1642 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-03-10 13:40:42 +08:00
chen, suyue
4cab86260f
Use the latest HabanaAI/vllm-fork release tag to build vllm-gaudi image ( #1635 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: Liang Lv <liang1.lv@intel.com >
2025-03-07 20:40:32 +08:00
Letong Han
9180f1066d
Enable vllm for CodeTrans ( #1626 )
...
Set vllm as default llm serving, and add related docker compose files, readmes, and test scripts.
Issue: https://github.com/opea-project/GenAIExamples/issues/1436
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-03-07 10:56:21 +08:00
ZePan110
5aecea8e47
Update compose.yaml ( #1619 )
...
Update compose.yaml for CodeGen, CodeTrans and DocSum
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:20:28 +08:00
ZePan110
c1b5ba281f
Enable CodeGen,CodeTrans and DocSum model cache for docker compose test. ( #1599 )
...
1.Add cache path check
2.Enable CodeGen,CodeTrans and DocSum model cache for docker compose test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-04 16:10:20 +08:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
xiguiw
45d5da2ddd
HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN ( #1503 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-09 20:33:06 +08:00
dolpher
ee0e5cc8d9
Sync value files from GenAIInfra ( #1428 )
...
All gaudi values updated with extra flags.
Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice.
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-22 17:44:11 +08:00
WenjiaoYue
b721c256f9
Fix Domain Access Issue in Latest Vite Version ( #1444 )
...
Fix the restriction on using domain names when users are using the latest version of Vite
When users use the new version of Vite, the UI cannot be accessed via domain names due to Vite's new rules. This fix adds the corresponding parameters according to Vite's new rules, ensuring that users can access the frontend via domain names when building the UI.
Fixes #1441
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-01-21 23:28:37 +08:00
chen, suyue
927698e23e
Simplify git clone code in CI test ( #1434 )
...
1. Simplify git clone code in CI test.
2. Replace git clone branch in Dockerfile.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-21 23:00:08 +08:00
Eero Tamminen
0eae391fda
Use staged builds to minimize final image sizes ( #1031 )
...
Staged image builds so that final images do not have redundant things like:
- Git tool and its deps
- Git repo history
- Test directories
Fixes : #225
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-01-16 11:14:47 +08:00
Liang Lv
3ca78867eb
Update example code for embedding dependency moving to 3rd_party ( #1368 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-10 15:36:58 +08:00
dolpher
c795ef2203
Add helm deployment instructions for GenAIExamples ( #1373 )
...
Add helm deployment instructions for ChatQnA, AgentQnA, AudioQnA, CodeTrans, DocSum, FaqGen and VisualQnA
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-10 09:55:31 +08:00
ZePan110
aa5c91d7ee
Check duplicated dockerfile ( #1289 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-06 17:30:12 +08:00
XinyaoWa
464e2d3125
Rename streaming to stream to align with OpenAI API ( #1332 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-06 13:25:55 +08:00
chen, suyue
5c7a5bd850
Update Code and README for GenAIComps Refactor ( #1285 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
Signed-off-by: WenjiaoYue <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>
2025-01-02 20:03:26 +08:00
lkk
e18369ba0d
remove examples gateway. ( #1250 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-14 13:19:51 +08:00
Lianhao Lu
6f9f6f0bad
Remove deprecated docker compose files ( #1238 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-12-10 09:43:19 +08:00
lkk
bde285dfce
move examples gateway ( #992 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Sihan Chen <39623753+Spycsh@users.noreply.github.com >
2024-12-06 14:40:25 +08:00
chen, suyue
cc108b5a18
Fix DBQnA image build ( #1165 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-11-20 10:56:49 +08:00
chyundunovDatamonsters
7e62175c2e
Adding files to deploy CodeTrans application on AMD GPU ( #1138 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2024-11-18 14:58:38 +08:00
Louie Tsai
152adf8012
maintain a version info for docker_compose yaml files among release ( #1141 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-11-17 22:39:41 -08:00
Lianhao Lu
cbe952ec5e
Fail CI manifest test if response content is not expected ( #1145 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com >
2024-11-17 12:46:31 +08:00
chen, suyue
2b2c7ee2f5
upgrade setuptools version to fix CVE-2024-6345 ( #999 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-11-14 14:57:16 +08:00
Abolfazl Shahbazi
b5f95f735e
Fix missing end of file chars ( #1106 )
...
Signed-off-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-13 09:40:53 -08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
Neo Zhang Jianyu
2d9aeb3715
fix wrong format which break online doc build ( #1073 )
...
Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com >
2024-11-05 17:01:40 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
Yi Yao
ced68e1834
Add performance benchmark scripts for 4 use cases. ( #1052 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-03 12:41:02 +08:00
lvliang-intel
7197286a14
Fix ChatQnA manifest default port issue ( #1033 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-30 11:52:04 +08:00
XinyaoWa
a2afce1675
update codetrans default model ( #1015 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-28 09:11:54 +08:00
Louie Tsai
90c2d49050
Update CodeTrans README.md for workflow ( #908 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-10-25 12:39:18 +08:00
lvliang-intel
9438d392b4
Update README for some minor issues ( #1000 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-22 10:30:18 +08:00
chen, suyue
eeced9b31c
Enhance CI/CD image build ( #961 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-10-17 14:33:58 +08:00
David Kinder
3e796ba73d
doc: fix missing references to README.md ( #860 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-24 21:40:42 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
XinyaoWa
2f03a3a894
Align parameters for "max_token, repetition_penalty,presence_penalty,frequency_penalty" ( #726 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 14:15:25 +08:00
lvliang-intel
bceacdc804
Fix README issues ( #817 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-18 09:50:17 +08:00
ZhaoqiongZ
f990f7966e
update doc according to comments ( #805 )
...
Signed-off-by: Zheng, Zhaoqiong <zhaoqiong.zheng@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-13 19:55:33 +08:00
Malini Bhandaru
558ea3bb7f
adopted tech writing style ( #796 )
...
Signed-off-by: Malini Bhandaru <malini.bhandaru@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-13 09:42:59 +08:00
XinyaoWa
264759d85a
fix path bug for reorg ( #801 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-09-12 17:52:06 +08:00
XinyaoWa
d2bab99835
refine readme for reorg ( #782 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 14:57:29 +08:00
Lianhao Lu
ff6f841ec0
README: fix broken links ( #781 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-11 09:41:01 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00
Lianhao Lu
ba94e0130d
Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum ( #773 )
...
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-10 16:30:14 +08:00
Lianhao Lu
0629696333
K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum
...
- Update ChatQnA/CodeGen/CodeTrans/DocSum k8s manifest
to avoid requiring creating directory for cache model.
- Add chatqna-guardrails manifest files.
- Fix bug #752 introduced by PR #669
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com >
2024-09-06 16:09:42 +08:00
David Kinder
67394b88fa
doc: fix headings and indenting ( #748 )
...
* doc: fix headings and indenting
* only one H1 header (for title) is allowed
* fix indenting under ordered lists
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-06 12:59:33 +08:00