Commit Graph

698 Commits

Author SHA1 Message Date
lkk
a0c80810eb update redis persistent. 2025-01-17 14:05:07 +00:00
lkk
05bd2512d6 Merge branch 'main' into enable_persistent_db 2025-01-17 19:01:50 +08:00
XinyuYe-Intel
28f9c31129 textgen ollama code refactor. (#1158)
Remove Ollama folder since default openai API is able to consume Ollama service, modified Ollama readme and added UT.
#998
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
2025-01-17 16:49:31 +08:00
Sihan Chen
ff094b555c fix wrong link in asr doc (#1163)
fix wrong link in asr doc

Signed-off-by: 39623753+Spycsh@users.noreply.github.com
2025-01-17 16:13:09 +08:00
lkk
cd747fc753 Merge branch 'main' into enable_persistent_db 2025-01-17 14:09:10 +08:00
Louie Tsai
ca89cf4c9f add opentelemetry tracing into OPEA DAG and couple microservices code path related to ChatQnA (#1122)
* add opentelemetry tracing into ChatQnA workflow

Signed-off-by: Louie, Tsai <louie.tsai@intel.com>
Signed-off-by: louie-tsai <louie.tsai@intel.com>

* handle stream/non-stream tracing

Signed-off-by: louie-tsai <louie.tsai@intel.com>

* pre-commit fix

Signed-off-by: louie-tsai <louie.tsai@intel.com>

* add a tag for async llm microservice execute

Signed-off-by: louie-tsai <louie.tsai@intel.com>

* add oltp tracing for retriever

Signed-off-by: louie-tsai <louie.tsai@intel.com>

* fix CI issue

Signed-off-by: louie-tsai <louie.tsai@intel.com>

---------

Signed-off-by: Louie, Tsai <louie.tsai@intel.com>
Signed-off-by: louie-tsai <louie.tsai@intel.com>
Co-authored-by: Spycsh <sihan.chen@intel.com>
2025-01-16 21:42:40 -08:00
XinyaoWa
31b9265846 Align OpenAI API for FaqGen, DocSum, TextGen-native (#1161)
* Align OpenAI API for FaqGen, DocSum, TextGen-native

Algin all the inputs to OpenAI API format for FaqGen, DocSum, TextGen-native, now all the services in llm comps should be OpenAI API compatiable

Related to issue https://github.com/opea-project/GenAIComps/issues/998

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-17 10:51:54 +08:00
XinyaoWa
a65e559388 Standardize name for LLM comps (#1162)
Update all the names for classes and files in llm comps to follow the standard format

Related to issue https://github.com/opea-project/GenAIComps/issues/998

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2025-01-16 23:06:47 +08:00
lkk
3ce77ad896 Refactor agent create assistant api (#1142)
- instantiate agent when request http://${ip_address}:9091/v1/assistants
- allow user to config agent through api call

#982
2025-01-16 22:51:37 +08:00
Letong Han
8e48849c49 Refactor Vectorstores to E-RAG Style (#1159)
Move DBs with yaml/Dockerfile into third_parties folder and delete vectorstores folder.
Modify related components for change of vectordb folder path.
Remained vectorestores:
- thirdparties
    - elasticsearch
    - milvus
    - opensearch
    - pathway
    - pgvector
    - redis
    - vdms

Fix issue https://github.com/opea-project/GenAIComps/issues/1008.

Signed-off-by: letonghan <letong.han@intel.com>
2025-01-16 19:46:11 +08:00
XinyaoWa
d6312a73b9 Fix vllm hpu to a stable release (#1156)
There exist risks with vllm-fork main branch, change to latest stable release v0.6.4.post2+Gaudi-1.19.0

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2025-01-16 17:48:06 +08:00
Yao Qing
2c42941a31 Set the wav2lip's numpy version to 1.23.5 to ensure that there are no version mismatch errors in the Gaudi image. (#1160)
In animation/wav2lip component on Gaudi, a module that was compiled using NumPy 1.x cannot be run in NumPy 2.0.2 as it may crash. So we set numpy's version to 1.23.5.

Signed-off-by: Yao, Qing <qing.yao@intel.com>
2025-01-16 16:01:58 +08:00
Yao Qing
3802cd84a7 Refactor chathistory based on E-RAG code structure. (#1154)
Previously, the code for implementing microservice and functionalities was placed in one folder. Now the microservice and functionalities code have been separated.

Fix [Issue 989](https://github.com/opea-project/GenAIComps/issues/989)

Signed-off-by: Yao, Qing <qing.yao@intel.com>
2025-01-16 14:17:16 +08:00
Letong Han
c49db2b037 Refactor Retrievers (#1138)
Refactor retrievers into E-RAG style.
The folder structure after refactored is as below:

- retrievers
    - deployment
        - docker_compose
        - kubernetes
    - src
        - integrations
            - redis.py
            - milvus.py
            - ...
        - Dockerfile
        - opea_retrievers_microservice.py
        - README.md
        - requirements.txt
        - set_env.sh

#1004

Signed-off-by: letonghan <letong.han@intel.com>
2025-01-16 14:12:30 +08:00
XinyaoWa
6d07a064f8 Refactor LLM text generation native comps (#1151)
Part work of code refactor to combine different text generation backends, remove duplcated native langchain and llama_index folder, consice the optimum habana implementation as a native integration OPEATextGen_Native.

Add feature for issue https://github.com/opea-project/GenAIComps/issues/998

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2025-01-16 13:21:46 +08:00
XinyaoWa
5098e82040 Refactor vllm openvino to third parties (#1141)
vllm-openvino is a dependency for text generation comps, move it to third-parties folder, add UT for both cpu and gpu.

Related to feature issue#998

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2025-01-16 09:38:18 +08:00
qgao007
4db13298cd Build guardrail "Hallucination Detection" microservice. (#1108)
* Build guardrail "Hallucination Detection" microservice.

Signed-off-by: Qun Gao <qun.gao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update comps/guardrails/hallucination_detection/README.md

Co-authored-by: Daniel De León <111013930+daniel-de-leon-user293@users.noreply.github.com>

* - register Dockerfile

- rename file for hpu

- update endpoints to be consistent

Signed-off-by: Qun Gao <qun.gao@intel.com>

* Update repo structure

Signed-off-by: Qun Gao <qun.gao@intel.com>

* refactor

Signed-off-by: Qun Gao <qun.gao@intel.com>

* Refactored Hallucination Guardrail to wrap code under new OpeaComponent Class and leaverage OpeaComponentLoader Class for serving inference.

Signed-off-by: Qun Gao <qun.gao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Updated Copyright year to reflect right date

Signed-off-by: Qun Gao <qun.gao@intel.com>

---------

Signed-off-by: Qun Gao <qun.gao@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>
Co-authored-by: Daniel De León <111013930+daniel-de-leon-user293@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2025-01-15 10:31:08 -08:00
Zhu Yongbo
9939061e38 Fix vllm openvino Dockerfile.intel_gpu build issue (#1150)
Build Dockerfile.intel_gpu will fail because there are dependency conflict when vllm==v0.6.3.post1. Upgrading vllm to v0.6.6.post1 could solve the issue

Fixes #1141
Signed-off-by: Zhu, Yongbo <yongbo.zhu@intel.com>
2025-01-15 15:32:35 +08:00
lkk
3e6c5cd8bd remove ragas. (#1149)
* remove ragas.

* remove ci docker compose.
2025-01-15 13:28:45 +08:00
XinyaoWa
4c21738ad5 Refactor llm predictionguard (#1143)
* refactor llm predictionguard

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* refine predictionguard ut

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* remove duplicated dockerfile path

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2025-01-15 10:13:42 +08:00
Neo Zhang Jianyu
89dd6285d8 Update check-online-doc-build.yml (#1145) 2025-01-15 09:07:14 +08:00
pre-commit-ci[bot]
bfee1b14d7 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-01-14 11:27:01 +00:00
lkk
1652f00234 add initial redis persistence. 2025-01-14 11:27:27 +00:00
XinyaoWa
afb8a28b2f remove vllm hpu commit id limit (#1139)
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2025-01-14 11:05:05 +08:00
dolpher
1cc4d2119d Add kubernetes deployment for GenAIComps (#1104)
* Add kubernetes deployment for GenAIComps

---------

Signed-off-by: Dolpher Du <dolpher.du@intel.com>
2025-01-13 15:42:33 +08:00
XinyaoWa
88f93733b0 Refactor llm Docsum (#1101)
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2025-01-13 15:24:43 +08:00
lkk
3a7ccb0a75 add tool choices for agent. (#1126) 2025-01-13 14:42:31 +08:00
Liang Lv
fe24decd72 Fix docker compose health check issue (#1133)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>
2025-01-13 13:54:30 +08:00
Sihan Chen
feef30b0ea Refactor lvms (#1096)
Co-authored-by: ZePan110 <ze.pan@intel.com>
2025-01-13 13:06:59 +08:00
XinyaoWa
ea72c943bd Refactor FaqGen (#1093)
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-13 11:30:59 +08:00
Yao Qing
3f23bf582a Remove version restrictions in animations (#1132)
Signed-off-by: Yao, Qing <qing.yao@intel.com>
2025-01-10 11:00:15 -08:00
XinyuYe-Intel
9349478601 Make naming compatible to the defined style (#1129)
* make naming style compatible to the defined style.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-10 15:31:58 +08:00
Liang Lv
b91911a543 Refine embedding naming and move dependency to 3rd_party (#1125)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2025-01-10 14:44:25 +08:00
Yao Qing
4f9f95574b Rename folder name integration to integrations in image2image and animation (#1130)
Signed-off-by: Yao, Qing <qing.yao@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2025-01-10 10:35:21 +08:00
chen, suyue
e7b8d25bfc Update action token (#1131)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2025-01-09 17:19:04 +08:00
XinyuYe-Intel
efd95780fd Finetuning code refactor (#1081)
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
2025-01-09 11:50:15 +08:00
XinyuYe-Intel
2587a2978a Text2image code refactor (#1054)
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
2025-01-09 11:45:15 +08:00
Liang Lv
179b5da06b Refactor prompt registry microservice (#1124)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2025-01-09 11:26:03 +08:00
Liang Lv
ec66b91c51 Feedback management microservice refactor (#1057)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2025-01-09 11:18:15 +08:00
Sihan Chen
962e097893 Refactor web retriever (#1102) 2025-01-08 15:24:08 +08:00
Lianhao Lu
4480d807a3 CI: Automatically create issue in CIInfra (#1123)
Automatically create issue in CIInfra to track the changes of docker compose
files for correspdonding helm charts.

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2025-01-08 15:09:27 +08:00
Liang Lv
631b570481 Refactor guardrails microservice (#1116)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2025-01-08 13:29:23 +08:00
lkk
650be0d660 fix stream issue. (#1120) 2025-01-08 10:40:27 +08:00
WenjiaoYue
267cad1f44 Refactor reranking (#1113)
Signed-off-by: WenjiaoYue <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>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2025-01-08 10:19:04 +08:00
ZePan110
ca21633a73 Change license template 2024 to 2025 (#1119)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2025-01-07 19:29:45 +08:00
pre-commit-ci[bot]
76f43c9368 [pre-commit.ci] pre-commit autoupdate (#1114)
Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-07 10:49:40 +08:00
Liang Lv
bf09739585 Refine Component Interface (#1106)
* Refine component interface

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update env

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* add health check

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update mulimodal embedding

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update import

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* refine other components

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix dataprepissue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix tts issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix ci issues

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix tts response issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix comments

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2025-01-07 09:24:47 +08:00
lkk
cf90932fef refine agent directories. (#1109) 2025-01-06 17:35:39 +08:00
ZePan110
b933b66f15 Check duplicated dockerfile (#1073)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2025-01-06 17:27:59 +08:00
chen, suyue
e96b5d91a9 Update code owner list (#1110)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2025-01-06 14:53:58 +08:00