lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
lvliang-intel
7197286a14
Fix ChatQnA manifest default port issue ( #1033 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-30 11:52:04 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
XinyaoWa
d2bab99835
refine readme for reorg ( #782 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-11 14:57:29 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00
David Kinder
67394b88fa
doc: fix headings and indenting ( #748 )
...
* doc: fix headings and indenting
* only one H1 header (for title) is allowed
* fix indenting under ordered lists
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-06 12:59:33 +08:00
XinyaoWa
d487093d10
Add default model in readme for FaqGen and DocSum ( #693 )
...
* update default model in readme for DocSum
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-08-30 12:40:36 +08:00
Sihan Chen
6674832162
fix tgi xeon tag ( #641 )
2024-08-21 22:17:07 +08:00
lvliang-intel
b2771ad3f2
Using TGI official release docker image for intel cpu ( #581 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-08-18 17:17:44 +08:00
XinyaoWa
80e3e2a2d3
Update mainifest for FaqGen ( #582 )
...
* update tgi version
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* add k8s for faq
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* add benchmark for faq
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* refine k8s for faq
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* add tuning for faq
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* add prompts with different length for faq
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* add tgi docker for llama3.1
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* remove useless code
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* remove nodeselector
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* remove hg token
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* refine code structure
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix readme
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
---------
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-08-13 16:29:15 +08:00
David Kinder
8d0c8fb949
doc: fix missing title H1 heading ( #458 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
Co-authored-by: Haihao Shen <haihao.shen@intel.com >
2024-07-26 09:32:54 +08:00
Yogesh Pandey
8c4a2534c1
FAQGen Megaservice ( #425 )
...
* Added FAQGEN v1
Signed-off-by: Yogesh Pandey <yogesh.pandey@intel.com >
---------
Signed-off-by: Yogesh Pandey <yogesh.pandey@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-24 23:37:20 +08:00