Commit Graph

535 Commits

Author SHA1 Message Date
Eero Tamminen
a6998a1dbd Add E2E Promeheus metrics to applications (#845)
* Fix typos in BaseStatistics method names

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Add HttpService "inprogress" (pending) request count metrics

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Add E2E Prometheus metrics to ServiceOrchestrator

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Fix: support metrics with multiple ServiceOrchestrator instances

Unlike apps, CI tests create multiple of them.

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Fix: require named MicroService -> HTTPService instances

Creating multiple MicroService()s creates multiple HTTPService()s
which creates multiple Prometheus fastapi instrumentor instances.

While latter handled that fine for ChatQnA and normal HTTP metrics,
that was not the case for its "inprogress" metrics in CI.

Therefore MicroService constructor name argument is now mandatory, so
that it can be used to make "inprogress" metrics for HTTPService
instances unique.

PS. instrumentor requires HTTPService instance specific Starlette
instance, so it cannot be made singleton.

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Fix: update test_token_generator()

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 09:58:23 +08:00
minmin-intel
c8e363901a Update RAGAgentLlama and ReActLlama (#843)
* use ChatOpenAI to interface with TGI-gaudi

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* update tools for unit test

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update readme and test

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix test script

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* consolidate chat model setup

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update tests and readme

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* revert react llama output parser

---------

Signed-off-by: minmin-intel <minmin.hou@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-01 15:40:14 -07:00
Letong Han
9f68bd394b enable parameter k to get web resources (#844)
Signed-off-by: letonghan <letong.han@intel.com>
2024-11-01 15:58:36 +08:00
Lianhao Lu
6dbb0a7fd7 gpt-sovits: Run as normal user (#839)
Make gpt-sovits to run as normal user as all other opea container
images.

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-11-01 10:03:41 +08:00
sgurunat
74df6bb728 Remote TGI/TGI services with OAuth Client Credentials authentication (#836)
* Add get_access_token method in utils.py to get the token for OAuth protected tgi and tei remote endpoints

* Update embedding_tei.py to support authentication for tei endpoints. Uses get_access_token from utils

* Update llm.py under llms faq-generation to support authentication for tgi endpoints. Uses get_access_token from utils

* Update llm.py under llms summarization to support authentication for tgi endpoints. Uses get_access_token from utils

* Update llm.py under llms text-generation to support authentication for tgi endpoints. Uses get_access_token from utils

* Update reranking_tei to support authentication for tgi endpoints. Uses get_access_token from utils

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-31 20:38:48 +08:00
Dan
d2e9c0a9dd Pinecone retriever index fix (#816)
* Update huggingface endpoint api

Replace deprecated HuggingFace Embeddings class.

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* Change timeout strategy launching retriever

Use wait-for-it script rather than an arbitrarily determined sleep.
This should be more resilient to timing variances both saving time
vs unnecessarily long sleeps and avoiding spurious failures if
CI is slow causing a container to take longer than normal to start.

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* Supress irrelevant warning during docker build

This path warning during docker build is not relevant.

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* Fix Pinecone service using ServerlessSpec index

Pinecone requires a hosted index.
That index can be a ServerlessSpec instance at a CSP or a more expensive
PodSpec based instance also hosted at a CSP.

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* Recreate index each run

For CI we need the index to be removed so the tests can be rerun
from a clean state.

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Dan Middleton <dan.middleton@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-30 21:35:16 +08:00
Letong Han
9f692c4215 Fix web_retriever batch size issue (#834)
* fix add doc batch size issue

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* simplify code

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: letonghan <letong.han@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-30 17:11:19 +08:00
rbrugaro
19330ea23f GraphRAG with llama-index (#793)
* graphRAG dataprep llama-index validated w openai endpoints

Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com>

* llama-index graphRAG retrieval validated with openai models

Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* validated code usng TGI/TEI instead of openai

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* compose.yaml for dataprep validated with neo4j, TGI/TEI, openai

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* graphRAG retriever validated and full compose.yaml

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* minor fix

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add graphRAGGateway WIP

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* graphragGateway working in E2E Example

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix schedule in orchestrator to support ChatCompletionRequest input

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* change default to TGI instead of openAI and add test code for neo4jretriever

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* test code for dataprep-neo4j microservice

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* improved READMES

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update docker build path for tests

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* minor fix

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix typo in container name

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* resolve image name conflict for hub publishing

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* add tgi validation to miicroservice tests

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* rename test script to trigger cicd correctly w hpu

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* rename test containers for cicd

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* update HF_TOKEN in TGI/TEI test scripts

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* swaped larger model so the graph isnt empty in ci test

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* set 4 hpu for 70B model in ci test

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* add extra time for large model loading cicd

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix tgi gaudi shard args

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* switch to chat cause chat template is needed

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* enable logs in test

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* use locally downloaded model in CI machine

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* use local model path and reduce wait time

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* clear ports before ci run

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix cache model access

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix cache model access

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* incrased wait time for tgi shards ready

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* wait until tgi connected

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* switch back to small model for testing

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* minor readability fixes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* README fixes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

---------

Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-29 22:45:44 -07:00
Chun Tao
00abba253a Enable bf16 quantization of models, and fix resetting args.audio in the microservice between runs (#832)
* Update audioqna gateway to print text, in gateway.py

Signed-off-by: Chun Tao <chun.tao@intel.com>

* updates needed for demo

Signed-off-by: Chun Tao <chun.tao@intel.com>

* original pr content

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Revert "updates needed for demo"

This reverts commit f0c7a026305ace410610c9dba771699e13dde8ea.

Signed-off-by: Chun Tao <chun.tao@intel.com>

* remove improper images

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Addressed some comments on previous pr

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Add Dockerfile for cpu support

Signed-off-by: Chun Tao <chun.tao@intel.com>

* CODEOWNER: Update comp CODEOWNER (#757)

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Add stable diffusion microservice (#729)

* add stable diffusion microservice.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* added test.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* changed output to images bytes data

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* unified inference and wrapper into one microservice.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix test.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

---------

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Compatible with different platforms. (#766)

* Compatible with different platforms.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix issue.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix issue

Signed-off-by: ZePan110 <ze.pan@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Optimize path and link validity check. (#745)

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Add timeout for ut test (#773)

Signed-off-by: chensuyue <suyue.chen@intel.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink issue

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink issue

Signed-off-by: Chun Tao <chun.tao@intel.com>

* put back hyperlinks in readme

Signed-off-by: Chun Tao <chun.tao@intel.com>

* remove possible error hyperlink

Signed-off-by: Chun Tao <chun.tao@intel.com>

* put hyperlink back

Signed-off-by: Chun Tao <chun.tao@intel.com>

* major update to use FastAPI for wav2lip, and structure component format

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add dockerfiles in animation-compose-cd.yaml

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Fix end of file issue in animation-compose-cd.yaml

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Fix Docker deployment on Xeon

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add versioning for all pip packages

Signed-off-by: Chun Tao <chun.tao@intel.com>

* e2e test script for animation

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update e2e test script

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update e2e test script

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update readme

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* udpate gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Fix AVATAR_CHATBOT

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* fix max_tokens in AvatarChatbot gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Model download script moved from Dockerfiles to Docker entrypoint

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update paths

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Correct paths in readme

Signed-off-by: Chun Tao <chun.tao@intel.com>

* revert changes to audioqna gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* longer wait time after docker run

Signed-off-by: Chun Tao <chun.tao@intel.com>

* add mount volume in test scripts

Signed-off-by: Chun Tao <chun.tao@intel.com>

* add volume mount in test scripts

Signed-off-by: Chun Tao <chun.tao@intel.com>

* udpate test script

Signed-off-by: Chun Tao <chun.tao@intel.com>

* udpate optimizations

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* need outputs folder

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test

Signed-off-by: Chun Tao <chun.tao@intel.com>

---------

Signed-off-by: Chun Tao <chun.tao@intel.com>
Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
Co-authored-by: Hoong Tee, Yeoh <hoong.tee.yeoh@intel.com>
Co-authored-by: XinyuYe-Intel <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-10-30 10:04:56 +08:00
XinyaoWa
9fec226261 Add huggingface token for native llm (#827)
* add huggingface token for native llm

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-29 16:44:52 +08:00
minmin-intel
6c670c92eb fix pydantic version (#826)
Signed-off-by: minmin-intel <minmin.hou@intel.com>
2024-10-27 17:12:00 +08:00
lvliang-intel
49a8e35e2d Fix docsum crash issue for text summary case (#825)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-10-25 12:01:15 +08:00
lvliang-intel
fa2ea642ae Support file upload summary for DocSum microservice (#823)
* Support file upload summary for DocSum microservice

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add aiofiles

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* move file handlding in gateway

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* rollback docsum code

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* only import iin docsum

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* clean dependency

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update code

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-24 14:14:48 +08:00
Yao Qing
5ed428f413 Set a higher default value(1.2) about repetition_penalty for codegen example to reduce repetition (#820)
Signed-off-by: Yao, Qing <qing.yao@intel.com>
2024-10-24 10:01:44 +08:00
XinyaoWa
0bb019f80f refine codetrans prompt, support parameter input (#822)
* refine codetrans prompt and parameter

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* support parameters input for codetrans

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-24 10:01:32 +08:00
xiguiw
9de71c6e8e Update Nginx default set (#821)
Nginx default file size is 1MB, change to 10GB.

Signed-off-by: Wang, Xigui <xigui.wang@intel.com>
2024-10-23 15:15:29 +08:00
Sihan Chen
c801a1f5dc fix whisper hpu and long form issue (#819) 2024-10-23 13:46:17 +08:00
Sihan Chen
02c3dfe3bf fix pydantic BaseModel in/out in dataflow (#818)
* fix protocol in/out supported types

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-23 10:22:13 +08:00
Chendi.Xue
3473bfb38b [Agent] support custom prompt (#798)
* habana_main issue fixed, now use original dockerfile

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

* Enable custom_prompt

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update README

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

---------

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-22 19:26:32 +08:00
lvliang-intel
3a166c1833 Update nginx config template (#815)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-10-22 16:10:37 +08:00
XinyuYe-Intel
52c1826fb9 Add image2image microservice (#794)
* added image2image microservice.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* added UT.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-22 15:49:58 +08:00
chen, suyue
6518c0f046 upgrade setuptools version to fix CVE-2024-6345 (#806)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-10-22 15:02:48 +08:00
Hoong Tee, Yeoh
1987d39f31 feedback_management: Fix mongo dependencies version (#814)
Remove fixed version for motor packages

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
2024-10-22 15:02:13 +08:00
Chun Tao
aea7364d4e Move Wav2Lip model download commands from Dockerfiles to entrypoint.sh (#811)
* Update audioqna gateway to print text, in gateway.py

Signed-off-by: Chun Tao <chun.tao@intel.com>

* updates needed for demo

Signed-off-by: Chun Tao <chun.tao@intel.com>

* original pr content

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Revert "updates needed for demo"

This reverts commit f0c7a026305ace410610c9dba771699e13dde8ea.

Signed-off-by: Chun Tao <chun.tao@intel.com>

* remove improper images

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Addressed some comments on previous pr

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Add Dockerfile for cpu support

Signed-off-by: Chun Tao <chun.tao@intel.com>

* CODEOWNER: Update comp CODEOWNER (#757)

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Add stable diffusion microservice (#729)

* add stable diffusion microservice.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* added test.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* changed output to images bytes data

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* unified inference and wrapper into one microservice.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix test.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

---------

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Compatible with different platforms. (#766)

* Compatible with different platforms.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix issue.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix issue

Signed-off-by: ZePan110 <ze.pan@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Optimize path and link validity check. (#745)

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Add timeout for ut test (#773)

Signed-off-by: chensuyue <suyue.chen@intel.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink issue

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink issue

Signed-off-by: Chun Tao <chun.tao@intel.com>

* put back hyperlinks in readme

Signed-off-by: Chun Tao <chun.tao@intel.com>

* remove possible error hyperlink

Signed-off-by: Chun Tao <chun.tao@intel.com>

* put hyperlink back

Signed-off-by: Chun Tao <chun.tao@intel.com>

* major update to use FastAPI for wav2lip, and structure component format

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add dockerfiles in animation-compose-cd.yaml

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Fix end of file issue in animation-compose-cd.yaml

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Fix Docker deployment on Xeon

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add versioning for all pip packages

Signed-off-by: Chun Tao <chun.tao@intel.com>

* e2e test script for animation

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update e2e test script

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update e2e test script

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update readme

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* udpate gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Fix AVATAR_CHATBOT

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* fix max_tokens in AvatarChatbot gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Model download script moved from Dockerfiles to Docker entrypoint

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update paths

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Correct paths in readme

Signed-off-by: Chun Tao <chun.tao@intel.com>

* revert changes to audioqna gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* longer wait time after docker run

Signed-off-by: Chun Tao <chun.tao@intel.com>

* add mount volume in test scripts

Signed-off-by: Chun Tao <chun.tao@intel.com>

* add volume mount in test scripts

Signed-off-by: Chun Tao <chun.tao@intel.com>

* udpate test script

Signed-off-by: Chun Tao <chun.tao@intel.com>

---------

Signed-off-by: Chun Tao <chun.tao@intel.com>
Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
Co-authored-by: Hoong Tee, Yeoh <hoong.tee.yeoh@intel.com>
Co-authored-by: XinyuYe-Intel <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-10-22 10:32:43 +08:00
JoshuaL3000
f36ca8086e Fix llm not recognizing pydantic tool definition for dict types (#782)
* Update src/tools.py pydantic imports to use v1 from v2

Signed-off-by: JoshuaL3000 <joshua.jian.ern.liew@intel.com>
2024-10-21 17:33:25 +08:00
bjzhjing
7e085a4bc1 retriever/milvus: Update README.md (#789)
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>
Co-authored-by: lvliang-intel <liang1.lv@intel.com>
2024-10-21 17:32:26 +08:00
Sihan Chen
70644465ee add optimization params to dockerfile (#804) 2024-10-21 17:30:43 +08:00
Chun Tao
c26d37e709 Initiate "animation" component (#775)
* updates needed for demo

Signed-off-by: Chun Tao <chun.tao@intel.com>

* original pr content

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Revert "updates needed for demo"

This reverts commit f0c7a026305ace410610c9dba771699e13dde8ea.

Signed-off-by: Chun Tao <chun.tao@intel.com>

* remove improper images

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Addressed some comments on previous pr

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Add Dockerfile for cpu support

Signed-off-by: Chun Tao <chun.tao@intel.com>

* CODEOWNER: Update comp CODEOWNER (#757)

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Add stable diffusion microservice (#729)

* add stable diffusion microservice.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* added test.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* changed output to images bytes data

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* unified inference and wrapper into one microservice.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix test.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

---------

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Compatible with different platforms. (#766)

* Compatible with different platforms.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix issue.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix issue

Signed-off-by: ZePan110 <ze.pan@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Optimize path and link validity check. (#745)

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* Add timeout for ut test (#773)

Signed-off-by: chensuyue <suyue.chen@intel.com>
Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink issue

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test hyperlink issue

Signed-off-by: Chun Tao <chun.tao@intel.com>

* put back hyperlinks in readme

Signed-off-by: Chun Tao <chun.tao@intel.com>

* remove possible error hyperlink

Signed-off-by: Chun Tao <chun.tao@intel.com>

* put hyperlink back

Signed-off-by: Chun Tao <chun.tao@intel.com>

* major update to use FastAPI for wav2lip, and structure component format

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add dockerfiles in animation-compose-cd.yaml

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Fix end of file issue in animation-compose-cd.yaml

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Fix Docker deployment on Xeon

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add versioning for all pip packages

Signed-off-by: Chun Tao <chun.tao@intel.com>

* e2e test script for animation

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update e2e test script

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update e2e test script

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update readme

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* udpate gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* Fix AVATAR_CHATBOT

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* fix max_tokens in AvatarChatbot gateway

Signed-off-by: Chun Tao <chun.tao@intel.com>

* test

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* update

Signed-off-by: Chun Tao <chun.tao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Chun Tao <chun.tao@intel.com>
Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
Co-authored-by: Hoong Tee, Yeoh <hoong.tee.yeoh@intel.com>
Co-authored-by: XinyuYe-Intel <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-10-21 15:52:24 +08:00
Sihan Chen
f6c3a82f9a fix audio/wav response handling (#802)
* fix audio output

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-18 16:07:27 +08:00
lvliang-intel
1a807407eb Fix nginx streaming without caching the response (#797)
* Fix nginx streaming without caching the response

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* add gzip off

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-10-17 20:41:45 +08:00
XinyaoWa
9a00a3eac9 Support Chinese for Docsum (#799)
* support Chinese for DocSum

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* refine readme for docsum

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-10-17 14:58:40 +08:00
lvliang-intel
2f0d78e504 Add dataprep endpoint in nginx microservice (#795)
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-10-16 17:43:53 +08:00
Harsha Ramayanam
c6dfeb0a2d Changes to comps/llms/text-generation/README.md (#783)
* Removed redundant commands

Signed-off-by: Harsha Ramayanam <harsha.ramayanam@intel.com>

* Changes to README and minor code improvements

Signed-off-by: Harsha Ramayanam <harsha.ramayanam@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Changes added per review

Signed-off-by: Harsha Ramayanam <harsha.ramayanam@intel.com>

---------

Signed-off-by: Harsha Ramayanam <harsha.ramayanam@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-14 17:34:39 -07:00
Hoong Tee, Yeoh
a719b61813 [rerank]: Refine documentation for rerank comps (#758)
* [rerank]: Refine documentation for rerank comps

Restructure and refine README documentation for rerank components.

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* rerank-doc: Break up README lines for viewing experience

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>

---------

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-10-14 15:35:33 +08:00
Sihan Chen
6da7db9ea4 Add GPT-SoVITS microservice (#784)
* add gpt-sovits microservice

* add readme

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix

* fix eol

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-12 10:44:46 +08:00
Daniel De León
5bb4046b34 Add WildGuard Guardrail Microservice (#710)
* add wildguard microservice

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>

* fix paths for wildguard dir

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>

* add README

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>

* add wildguard to guardrail README table

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* updates per code review

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update tokens

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>

* fix endpoint for testing

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>

* fixed formating

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>

* add more description of wildguard to distinguish from llamaguard

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>

---------

Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Letong Han <106566639+letonghan@users.noreply.github.com>
Co-authored-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-10-11 12:10:15 -07:00
Dan
367b3aa5db Remove unused imports across all comps (#762)
* Remove unused imports across all comps

Linting was configured to ignore unused imports.
Remove this accumulation of lint.

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* Explain why we ignore unused imports with ruff

In general unused imports should be removed, but ruff can't and probably
shouldn't automatically fix conditional imports. Consequently,
conditional imports will always be left over causing false
positive errors in CI.

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* Revert pinecone import changes

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Dan Middleton <dan.middleton@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 22:11:59 +08:00
bjzhjing
84374a5710 Refactor milvus dataprep and retriever (#728)
* milvus: Refactor embedding settings for mivlus dataprep and retriever

Milvus dataprep and retriever leverage the same embedding enpoints, but
the embedding-related code is somewhat messed up, unify the namings and
logic to improve code readability and facilitate user-friendly
configuration.

Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>

* MOSEC: Rename EMB_MODEL env as MOSEC_EMBEDDING_MODEL

Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>

* milvus/dataprep: Update README for milvus dataprep

Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>

* Add OCR package for Milvus dataprep

Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>

* Update Milvus dataprep test script

This is to fix the CI issue for MILVUS environment variable name is
update.

Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-09 23:57:19 -07:00
Abolfazl Shahbazi
9e85b35b0b Remove proxy definition from within Dockerfiles (#778)
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
2024-10-10 12:00:32 +08:00
Abolfazl Shahbazi
251907d494 Remove 'vim' from Dockerfiles (#776)
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
2024-10-10 11:37:35 +08:00
Abolfazl Shahbazi
4f5010cfbb add 'no-cache-dir' to all pip installs (#777)
* add 'no-cache-dir' to all pip installs

Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>

* Fix a malformed line break

Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>

---------

Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
2024-10-10 11:09:47 +08:00
lvliang-intel
f3746dc85c Async support for some microservices (#763)
* Async support for some microservices

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix issues

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix issues

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix import issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add dependency library

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* roll back pinecone change

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-10 09:51:27 +08:00
XinyuYe-Intel
5d0c436704 Add stable diffusion microservice (#729)
* add stable diffusion microservice.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* added test.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* changed output to images bytes data

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* unified inference and wrapper into one microservice.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix test.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

---------

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-08 14:13:52 +08:00
Abolfazl Shahbazi
dfd89affe6 Misc docker updates (#761)
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
2024-10-08 08:49:47 +08:00
David Kinder
6a1263b52b doc: fix heading level (#759)
Signed-off-by: David B. Kinder <dbkinder1@msn.com>
Co-authored-by: David B. Kinder <dbkinder1@msn.com>
2024-10-07 10:03:04 +08:00
Krishna Murti
2c0e43c318 🐛 bugfix : Retrieval on llm-vllm service not working (#756)
* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* 🐛 bugfix: Retrieval on llm-vllm service not working

Signed-off-by: Krishna Murti <krishna.murti@intel.com>

* removed LLMParamDocs instantiation

Signed-off-by: Krishna Murti <krishna.murti@intel.com>

* applied patch for recent updates for llm-vllm

Signed-off-by: Krishna Murti <krishna.murti@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Krishna Murti <krishna.murti@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-03 09:46:04 +05:30
Chendi.Xue
4638c1d45d Enable vllm for Agent (#752)
* Update Agent vllm client codes and test

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

* Update README with vllm related update

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove unnecessary env

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

* support plan_execute with vllm

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

* Update README and test llama vllm support

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update vllm_port to 8086 avoid conflict

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

---------

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-01 10:44:14 -07:00
Hoong Tee, Yeoh
c612a7f333 [text-to-sql]: Refine and minor code refactor (#755)
* Minor code refactor

Remove unused import, unused variables and enhance docstring of code.

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>

* text-to-sql: Refine documentation

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [text-to-sql]: Restructure README documentation

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Yeoh, Hoong Tee <hoong.tee.yeoh@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-01 09:00:36 +05:30
lvliang-intel
534c227a6e Support Llama3.2 vision and vision guard model (#753)
* Support llama3.2 models

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix issues

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update code and doc

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* add llama vision guard support

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* add llama guard prompt format utils

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* add tp support

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* add wheel

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix accuracy issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update tp service code

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update dockerfile

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* support lvm tp serving

Signed-off-by: letonghan <letong.han@intel.com>

* update dockerfile

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* add run tp script

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix max_new_tokens

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update run tp script

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* refine code and doc

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* install transformers from local wheel

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update  code using official transformers

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* remove unnecessary code

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove blank line

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix precommit issues

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix cd issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Signed-off-by: letonghan <letong.han@intel.com>
Co-authored-by: letonghan <letong.han@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-30 17:27:41 +09:00
Dan
405a2fc68a Remove unused imports from reranks (#751)
* Remove unused imports

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* Remove unused imports

Signed-off-by: Dan Middleton <dan.middleton@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Dan Middleton <dan.middleton@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-30 09:37:15 +08:00