114 lines
6.8 KiB
Markdown
114 lines
6.8 KiB
Markdown
<div align="center">
|
||
|
||
# Generative AI Examples
|
||
|
||
[](https://github.com/opea-project/GenAIExamples/releases)
|
||
[](https://github.com/intel/neural-compressor/blob/master/LICENSE)
|
||
|
||
---
|
||
|
||
<div align="left">
|
||
|
||
## Introduction
|
||
|
||
GenAIComps-based Generative AI examples offer streamlined deployment, testing, and scalability. All examples are fully compatible with Docker and Kubernetes, supporting a wide range of hardware platforms such as Gaudi, Xeon, and other hardwares.
|
||
|
||
## Architecture
|
||
|
||
[GenAIComps](https://github.com/opea-project/GenAIComps) is a service-based tool that includes microservice components such as llm, embedding, reranking, and so on. Using these components, various examples in GenAIExample can be constructed, including ChatQnA, DocSum, etc.
|
||
|
||
[GenAIInfra](https://github.com/opea-project/GenAIInfra), part of the OPEA containerization and cloud-native suite, enables quick and efficient deployment of GenAIExamples in the cloud.
|
||
|
||
[GenAIEval](https://github.com/opea-project/GenAIEval) measures service performance metrics such as throughput, latency, and accuracy for GenAIExamples. This feature helps users compare performance across various hardware configurations easily.
|
||
|
||
## Getting Started
|
||
|
||
GenAIExamples offers flexible deployment options that cater to different user needs, enabling efficient use and deployment in various environments. Here’s a brief overview of the three primary methods: Python startup, Docker Compose, and Kubernetes.
|
||
|
||
1. <b>Docker Compose</b>: Check the released docker images in [docker image list](./docker_images_list.md) for detailed information.
|
||
2. <b>Kubernetes</b>: Follow the steps at [K8s Install](https://github.com/opea-project/docs/tree/main/guide/installation/k8s_install) and [GMC Install](https://github.com/opea-project/docs/blob/main/guide/installation/gmc_install/gmc_install.md) to setup k8s and GenAI environment .
|
||
|
||
Users can choose the most suitable approach based on ease of setup, scalability needs, and the environment in which they are operating.
|
||
|
||
### Deployment
|
||
|
||
<table>
|
||
<tr>
|
||
<th rowspan="3" style="text-align:center;">Use Cases</th>
|
||
<th colspan="4" style="text-align:center;">Deployment</th>
|
||
</tr>
|
||
<tr>
|
||
<td colspan="2" style="text-align:center;">Docker Compose</td>
|
||
<td rowspan="2" style="text-align:center;">Kubernetes</td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">Xeon</td>
|
||
<td style="text-align:center;">Gaudi</td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">ChatQnA</td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/ChatQnA/docker/xeon/README.md">Xeon Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/ChatQnA/docker/gaudi/README.md">Gaudi Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/ChatQnA/kubernetes/README.md">K8s Link</a></td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">CodeGen</td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/CodeGen/docker/xeon/README.md">Xeon Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/CodeGen/docker/gaudi/README.md">Gaudi Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/CodeGen/kubernetes/README.md">K8s Link</a></td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">CodeTrans</td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/CodeTrans/docker/xeon/README.md">Xeon Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/CodeTrans/docker/gaudi/README.md">Gaudi Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/CodeTrans/kubernetes/README.md">K8s Link</a></td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">DocSum</td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/DocSum/docker/xeon/README.md">Xeon Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/DocSum/docker/gaudi/README.md">Gaudi Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/DocSum/kubernetes/README.md">K8s Link</a></td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">SearchQnA</td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/SearchQnA/docker/xeon/README.md">Xeon Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/SearchQnA/docker/gaudi/README.md">Gaudi Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/SearchQnA/kubernetes/README.md">K8s Link</a></td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">FaqGen</td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/FaqGen/docker/xeon/README.md">Xeon Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/FaqGen/docker/gaudi/README.md">Gaudi Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/FaqGen/kubernetes/manifests/README.md">K8s Link</a></td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">Translation</td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/Translation/docker/xeon/README.md">Xeon Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/Translation/docker/gaudi/README.md">Gaudi Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/tree/main/Translation/kubernetes">K8s Link</a></td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">AudioQnA</td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/AudioQnA/docker/xeon/README.md">Xeon Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/blob/main/AudioQnA/docker/gaudi/README.md">Gaudi Link</a></td>
|
||
<td>Not supported yet</td>
|
||
</tr>
|
||
<tr>
|
||
<td style="text-align:center;">VisualQnA</td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/tree/main/VisualQnA">Xeon Link</a></td>
|
||
<td><a href="https://github.com/opea-project/GenAIExamples/tree/main/VisualQnA">Gaudi Link</a></td>
|
||
<td>Not supported yet</td>
|
||
</tr>
|
||
</table>
|
||
|
||
## Support Examples
|
||
|
||
Check [here](./supported_examples.md) for detailed information of supported examples, models, hardwares, etc.
|
||
|
||
## Additional Content
|
||
|
||
- [Code of Conduct](https://github.com/opea-project/docs/tree/main/community/CODE_OF_CONDUCT.md)
|
||
- [Contribution](https://github.com/opea-project/docs/tree/main/community/CONTRIBUTING.md)
|
||
- [Security Policy](https://github.com/opea-project/docs/tree/main/community/SECURITY.md)
|
||
- [Legal Information](/LEGAL_INFORMATION.md)
|