Letong Han 3913c7bb36 Refine docker_compose for dataprep param settings (#486)
* add TEI_ENDPOINT for dataprep

Signed-off-by: letonghan <letong.han@intel.com>

* add TEI_ENDPOINT for dataprep

Signed-off-by: letonghan <letong.han@intel.com>

* update ui test file

Signed-off-by: letonghan <letong.han@intel.com>

* add dataprep test into script

Signed-off-by: letonghan <letong.han@intel.com>

* refine e2e test of chatqna

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix missing dockerfile path

Signed-off-by: letonghan <letong.han@intel.com>

* check test issue

Signed-off-by: letonghan <letong.han@intel.com>

* check dataprep log

Signed-off-by: letonghan <letong.han@intel.com>

* reverse codetrans

Signed-off-by: letonghan <letong.han@intel.com>

* cd log path

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add hf token in compose yaml

Signed-off-by: letonghan <letong.han@intel.com>

* add redis_host

Signed-off-by: letonghan <letong.han@intel.com>

* add upload link test

Signed-off-by: letonghan <letong.han@intel.com>

* re-format validate dataprep

Signed-off-by: letonghan <letong.han@intel.com>

* fix typo

Signed-off-by: letonghan <letong.han@intel.com>

* fix frontend env name

Signed-off-by: letonghan <letong.han@intel.com>

* fix ci issue

Signed-off-by: letonghan <letong.han@intel.com>

---------

Signed-off-by: letonghan <letong.han@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-08-06 11:57:01 +08:00
2024-08-02 09:31:30 +08:00
2024-08-02 09:31:30 +08:00
2024-08-02 09:31:30 +08:00
2024-08-02 09:31:30 +08:00
2024-08-02 09:31:30 +08:00
2024-08-02 09:31:30 +08:00
2024-08-02 09:31:30 +08:00
2024-07-15 17:11:49 +08:00
2024-05-16 10:47:43 +08:00
2024-03-22 06:56:14 +08:00
2024-03-27 19:14:19 +08:00
2024-08-05 15:32:26 +08:00
2024-08-05 15:32:26 +08:00

Generative AI Examples

version license


Introduction

GenAIComps-based Generative AI examples offer streamlined deployment, testing, and scalability. All examples are fully compatible with Docker and Kubernetes, supporting a wide range of hardware platforms such as Gaudi, Xeon, and other hardwares.

Architecture

GenAIComps is a service-based tool that includes microservice components such as llm, embedding, reranking, and so on. Using these components, various examples in GenAIExample can be constructed, including ChatQnA, DocSum, etc.

GenAIInfra, part of the OPEA containerization and cloud-native suite, enables quick and efficient deployment of GenAIExamples in the cloud.

GenAIEval measures service performance metrics such as throughput, latency, and accuracy for GenAIExamples. This feature helps users compare performance across various hardware configurations easily.

Getting Started

GenAIExamples offers flexible deployment options that cater to different user needs, enabling efficient use and deployment in various environments. Heres a brief overview of the three primary methods: Python startup, Docker Compose, and Kubernetes.

  1. Docker Compose: Check the released docker images in docker image list for detailed information.
  2. Kubernetes: Follow the steps at K8s Install and GMC Install to setup k8s and GenAI environment .

Users can choose the most suitable approach based on ease of setup, scalability needs, and the environment in which they are operating.

Deployment

Use Cases Deployment
Docker Compose Kubernetes
Xeon Gaudi
ChatQnA Xeon Link Gaudi Link K8s Link
CodeGen Xeon Link Gaudi Link K8s Link
CodeTrans Xeon Link Gaudi Link K8s Link
DocSum Xeon Link Gaudi Link K8s Link
SearchQnA Xeon Link Gaudi Link K8s Link
FaqGen Xeon Link Gaudi Link K8s Link
Translation Xeon Link Gaudi Link K8s Link
AudioQnA Xeon Link Gaudi Link Not supported yet
VisualQnA Xeon Link Gaudi Link Not supported yet

Support Examples

Check here for detailed information of supported examples, models, hardwares, etc.

Additional Content

Description
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Readme 379 MiB
Languages
Shell 34%
Python 24.1%
TypeScript 16.1%
Svelte 14.2%
Vue 4.7%
Other 6.9%