Files
GenAIExamples/comps/llms/text-generation/vllm
Yao Qing 2159f9ad00 Fix vllm microservice performance issue. (#731)
* Fix vllm microservice performance issue.

Signed-off-by: Yao, Qing <qing.yao@intel.com>

* Refine llm generate parameters

Signed-off-by: Yao, Qing <qing.yao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Yao, Qing <qing.yao@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-25 21:37:38 +08:00
..