Go to file

XinyaoWa 80e3e2a2d3 Update mainifest for FaqGen (#582 )

* update tgi version

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add k8s for faq

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add benchmark for faq

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* refine k8s for faq

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add tuning for faq

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add prompts with different length for faq

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add tgi docker for llama3.1

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* remove useless code

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* remove nodeselector

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* remove hg token

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* refine code structure

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix readme

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

2024-08-13 16:29:15 +08:00

.github

Build up docker images CD workflow (#576 )

2024-08-13 15:20:34 +08:00

AudioQnA

Remove deprecated folder. (#536 )

2024-08-07 10:26:23 +08:00

ChatQnA

fix script issues in MD file (#538 )

2024-08-13 09:20:30 +08:00

CodeGen

Remove LangSmith from Examples (#545 )

2024-08-08 09:14:38 +08:00

CodeTrans

Modify the language variable to match language highlight. (#543 )

2024-08-08 14:27:57 +08:00

DocSum

Remove LangSmith from Examples (#545 )

2024-08-08 09:14:38 +08:00

FaqGen

Update mainifest for FaqGen (#582 )

2024-08-13 16:29:15 +08:00

SearchQnA

Remove LangSmith from Examples (#545 )

2024-08-08 09:14:38 +08:00

Translation

Remove LangSmith from Examples (#545 )

2024-08-08 09:14:38 +08:00

VisualQnA

Add VisualQnA UI (#572 )

2024-08-12 15:01:49 +08:00

.gitattributes

Fix win PC issues (#399 )

2024-07-15 17:11:49 +08:00

.gitignore

Add UI test for ChatQnA (#134 )

2024-05-16 10:47:43 +08:00

.pre-commit-config.yaml

[pre-commit.ci] pre-commit autoupdate (#364 )

2024-07-02 13:13:25 +08:00

.prettierignore

Change pre-commit-config to support helm charts and k8s manifest yamls (#208 )

2024-05-29 14:06:10 +08:00

docker_images_list.md

Convert HTML to markdown format. (#564 )

2024-08-09 16:18:28 +08:00

LEGAL_INFORMATION.md

bump release version into v0.6 (#238 )

2024-05-31 17:54:53 +08:00

LICENSE

Update additional content (#8 )

2024-03-22 06:56:14 +08:00

pyproject.toml

add pre commit (#14 )

2024-03-27 19:14:19 +08:00

README.md

fix readme typo (#524 )

2024-08-05 15:32:26 +08:00

supported_examples.md

fix readme typo (#524 )

2024-08-05 15:32:26 +08:00

third-party-programs.txt

optimize docker compose CI workflow (#190 )

2024-05-27 13:49:20 +08:00

README.md

Generative AI Examples

Introduction

GenAIComps-based Generative AI examples offer streamlined deployment, testing, and scalability. All examples are fully compatible with Docker and Kubernetes, supporting a wide range of hardware platforms such as Gaudi, Xeon, and other hardwares.

Architecture

GenAIComps is a service-based tool that includes microservice components such as llm, embedding, reranking, and so on. Using these components, various examples in GenAIExample can be constructed, including ChatQnA, DocSum, etc.

GenAIInfra, part of the OPEA containerization and cloud-native suite, enables quick and efficient deployment of GenAIExamples in the cloud.

GenAIEval measures service performance metrics such as throughput, latency, and accuracy for GenAIExamples. This feature helps users compare performance across various hardware configurations easily.

Getting Started

GenAIExamples offers flexible deployment options that cater to different user needs, enabling efficient use and deployment in various environments. Here’s a brief overview of the three primary methods: Python startup, Docker Compose, and Kubernetes.

Docker Compose: Check the released docker images in docker image list for detailed information.
Kubernetes: Follow the steps at K8s Install and GMC Install to setup k8s and GenAI environment .

Users can choose the most suitable approach based on ease of setup, scalability needs, and the environment in which they are operating.

Deployment

Use Cases	Deployment
	Docker Compose		Kubernetes
	Xeon	Gaudi	Kubernetes
ChatQnA	Xeon Link	Gaudi Link	K8s Link
CodeGen	Xeon Link	Gaudi Link	K8s Link
CodeTrans	Xeon Link	Gaudi Link	K8s Link
DocSum	Xeon Link	Gaudi Link	K8s Link
SearchQnA	Xeon Link	Gaudi Link	K8s Link
FaqGen	Xeon Link	Gaudi Link	K8s Link
Translation	Xeon Link	Gaudi Link	K8s Link
AudioQnA	Xeon Link	Gaudi Link	Not supported yet
VisualQnA	Xeon Link	Gaudi Link	Not supported yet

Support Examples

Check here for detailed information of supported examples, models, hardwares, etc.

Additional Content

Languages

Shell 34%

Python 24.1%

TypeScript 16.1%

Svelte 14.2%

Vue 4.7%

Other 6.9%

README.md Unescape Escape

Generative AI Examples

Introduction

Architecture

Getting Started

Deployment

Support Examples

Additional Content

README.md