* support vllm for chatqna
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* add vllm-on-ray into ChatQnA
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* support ray serve in ChatQnA
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* fix conflice
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* refine readme
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* add UT for chatqna vllm
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* add UT for ChatQnA Ray Serve
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* add UT for chatqna vllm ray
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add vllm for chatqna on xeon
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* fix bug for vllm chatqna cpu
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add ut for chatqna vllm
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
---------
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Add README to install the following workloads using helm chart:
- ChatQnA
- CodeGen
- CodeTrans
- DocSum
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
Embedding and reranking services failed to run on GPU H100.
Change the image tag and use CPU for these services. This PR will
fix#442
Signed-off-by: PeterYang12 <yuhan.yang@intel.com>
* update readme gaudi part & add tei-gaudi params
Signed-off-by: letonghan <letong.han@intel.com>
* modify supported habana driver version
Signed-off-by: letonghan <letong.han@intel.com>
* update env set part
Signed-off-by: letonghan <letong.han@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add example for no_proxy
Signed-off-by: letonghan <letong.han@intel.com>
* add an example of public ip
Signed-off-by: letonghan <letong.han@intel.com>
---------
Signed-off-by: letonghan <letong.han@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* update chatqna readme and set env script
Signed-off-by: letonghan <letong.han@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update for comments
Signed-off-by: letonghan <letong.han@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add consume
Signed-off-by: letonghan <letong.han@intel.com>
* modify details
Signed-off-by: letonghan <letong.han@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update codegen readme
Signed-off-by: letonghan <letong.han@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add patch modifications
Signed-off-by: letonghan <letong.han@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update codegen readme
Signed-off-by: letonghan <letong.han@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update ui options
Signed-off-by: letonghan <letong.han@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* udpate codetrans readme
Signed-off-by: letonghan <letong.han@intel.com>
* update docsum & searchqna readme
Signed-off-by: letonghan <letong.han@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: letonghan <letong.han@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* change to LF
* add readme for windows pc
* add OLLAMA_MODEL param
* readme
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update README.md
* Update docker_compose.yaml
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Updated READMEs for kubernetes.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Kubernetes related Readme.
Signed-off-by: mkbhanda <malini.bhandaru@intel.com>
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Added README and docker-compose updates for running chat Conversation on Gaudi
Signed-off-by: Yogesh Pandey <yogesh.pandey@intel.com>
* Updated tests
Signed-off-by: Yogesh Pandey <yogesh.pandey@intel.com>
* updates README and compose file as per review comments
Signed-off-by: Yogesh Pandey <yogesh.pandey@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: Yogesh Pandey <yogesh.pandey@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
* [Doc] Add valid micro-service details
Signed-off-by: Wang, Xigui <xigui.wang@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: Wang, Xigui <xigui.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Add codegen flowchart
Signed-off-by: Chun Tao <chun.tao@intel.com>
* update flowchart to markdown format
Signed-off-by: Chun Tao <chun.tao@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update markdown diagram
Signed-off-by: Chun Tao <chun.tao@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update
Signed-off-by: Chun Tao <chun.tao@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* delete last line
Signed-off-by: Chun Tao <chun.tao@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Add flowchart for CodeGen, update readme
Signed-off-by: Chun Tao <chun.tao@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* udpates
Signed-off-by: Chun Tao <chun.tao@intel.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: Chun Tao <chun.tao@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>