dolpher
|
5638075d65
|
Add helm deployment instructions for codegen (#1351)
Signed-off-by: Dolpher Du <dolpher.du@intel.com>
|
2025-01-08 13:20:32 +08:00 |
|
XinyaoWa
|
2f03a3a894
|
Align parameters for "max_token, repetition_penalty,presence_penalty,frequency_penalty" (#726)
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2024-09-19 14:15:25 +08:00 |
|
XinyaoWa
|
d73129cbf0
|
Refactor folder to support different vendors (#743)
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
|
2024-09-10 23:27:19 +08:00 |
|
Steve Zhang
|
f5f1e323bb
|
Revert the LLM model for kubernetes GMS (#675)
* revert the LLM model to meta-llama/CodeLlama-7b-hf
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
|
2024-08-30 13:54:42 +08:00 |
|
Steve Zhang
|
290a74fae9
|
Update all examples yaml files of GMC in GenAIExample (#436)
* Update all examples yaml files of GMC in GenAIExample.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
|
2024-07-23 16:40:51 +08:00 |
|
Ying Chun Guo
|
15fc6f9711
|
Optimize gmc manifest e2e tests (#382)
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>
|
2024-07-09 14:46:58 +08:00 |
|
Steve Zhang
|
2e62ecc18a
|
add docsum example e2e test for GMC. (#347)
* add docsum example e2e test for GMC.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
* fix curl error for docsum.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
* change the manifest e2e yaml.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
* change the image format.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
* fixing image mapping error.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
* change the gmc e2e test.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
* accelarate the e2e test.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
* change the gmc e2e configuration.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
* retrigger.
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Co-authored-by: Yingchun Guo <yingchun.guo@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>
|
2024-06-28 03:15:51 -07:00 |
|
Steve Zhang
|
960cf38d33
|
Add codegen e2e test of genaiexample (#337)
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
|
2024-06-28 10:42:00 +08:00 |
|