Liang Lv
13dd27e6d5
Update vLLM parameter max-seq-len-to-capture ( #1809 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-04-15 14:27:12 +08:00
dolpher
46ebb78aa3
Sync values yaml file for 1.3 release ( #1748 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-04-08 22:39:40 +08:00
Xiaotian Chen
1bd56af994
Update TGI image versions ( #1625 )
...
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com >
2025-04-01 11:27:51 +08:00
dolpher
ee0e5cc8d9
Sync value files from GenAIInfra ( #1428 )
...
All gaudi values updated with extra flags.
Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice.
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-22 17:44:11 +08:00
dolpher
c795ef2203
Add helm deployment instructions for GenAIExamples ( #1373 )
...
Add helm deployment instructions for ChatQnA, AgentQnA, AudioQnA, CodeTrans, DocSum, FaqGen and VisualQnA
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-10 09:55:31 +08:00
ZePan110
aa5c91d7ee
Check duplicated dockerfile ( #1289 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-06 17:30:12 +08:00
XinyaoWa
464e2d3125
Rename streaming to stream to align with OpenAI API ( #1332 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-06 13:25:55 +08:00
Sihan Chen
cc1d97f816
Refactor AudioQnA/MultiModalQnA/AvatarChatbot ( #1310 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chensuyue <suyue.chen@intel.com >
2024-12-31 12:47:30 +08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
lvliang-intel
7197286a14
Fix ChatQnA manifest default port issue ( #1033 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-30 11:52:04 +08:00
David Kinder
3e796ba73d
doc: fix missing references to README.md ( #860 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-24 21:40:42 +08:00
Steve Zhang
954a22051b
Make all xeon tgi image version consistent ( #851 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-24 11:19:37 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
XinyaoWa
d2bab99835
refine readme for reorg ( #782 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 14:57:29 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00
lvliang-intel
adb157f2e7
Update readme for manifests of some examples ( #708 )
...
* Update readme for manifests of some examples
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-09-03 15:00:41 +08:00
Steve Zhang
4133757642
Change docs of kubernetes for curl commands in README ( #661 )
...
* change docs for curl commands in README.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
* The Namespace 'CT' is invalid.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-27 19:36:37 +08:00
Sihan Chen
ac324a9ec2
minor fix mismatched hf token ( #651 )
2024-08-22 15:11:31 +08:00
Steve Zhang
c86cf8536d
Add AudioQnA example via GMC ( #597 )
...
* add AudioQnA example via GMC.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
* add more information for e2e test scritpts.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
* fix bug in e2e test scripts.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com >
2024-08-16 14:25:50 +08:00
Sihan Chen
0a6bad0ab9
add k8s support for audioqna ( #583 )
...
* add k8s support for audioqna
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-08-13 17:38:18 +08:00