Commit Graph

698 Commits

Author SHA1 Message Date
ZePan110
dca337d90b Add empty list check (#914)
* Add outputs.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Add empty list check

Signed-off-by: ZePan110 <ze.pan@intel.com>

* test CI.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Remove test files

Signed-off-by: ZePan110 <ze.pan@intel.com>

* remove debug code

Signed-off-by: chensuyue <suyue.chen@intel.com>

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
Co-authored-by: chensuyue <suyue.chen@intel.com>
2024-11-18 22:46:00 +08:00
Chun Tao
2a98120edf Add "--no-verbose" flag to wget download commands in entrypoint (#909)
Signed-off-by: Chun Tao <chun.tao@intel.com>
2024-11-18 11:12:35 +08:00
lvliang-intel
8e148a3924 Add env for pass down model id in ChatQnA gateway (#906)
* Pass down model id for ChatQnA

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update logic

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-16 10:13:52 +08:00
Melanie Hart Buehler
c823157428 Fix units of incorrect caption timestamps (#907)
Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
2024-11-15 11:12:17 -08:00
Sihan Chen
d547872c9c add zero-shot vc readme (#904) 2024-11-15 15:10:39 +08:00
XinyaoWa
e1475acb55 vllm hpu fix version for bug fix (#903)
* vllm test

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix vllm hpu version to fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* refine readme

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix vllm version

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* update vllm ut model

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* revert agent

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2024-11-15 15:10:27 +08:00
Mustafa
d211cb2dbd Docsum Gateway Fix (#902)
* update gateway

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update the gateway

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update the gateway

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Mustafa <mustafa.cetin@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-15 11:14:50 +08:00
Melanie Hart Buehler
405a632b31 Bugfix for follow-up query with a .png image (#900)
* MultimodalQnA bugfix for follow-up query with a .png image

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>
2024-11-14 15:42:22 -08:00
rbrugaro
0163ea6f4e trim input to TGI, moved clustering and summarization to dataprep and store in DB (#893)
* trim input to TGI, moved clustering and summarization to dataprep and DB store

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* removed inspect_db causing error in precommit

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add HF token to dataprep container because tokenizer is used now

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* updated READMEs to reflect latest changes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* bug fix all files are ingested and graph extracted first followed by 1 cluster call for full graph in database

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update README based on fix for multifile

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* Changes to make graphrag ui work

Signed-off-by: theresa <theresa.shan@intel.com>

* fix bug build communities done once at end of ingestion

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* minor fixes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* README fixes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

---------

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: theresa <theresa.shan@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: theresa <theresa.shan@intel.com>
2024-11-14 15:29:23 -08:00
lvliang-intel
517a5b04a8 Fix LLM special token issue (#895)
* Fix LLM special token issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update code

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update logic

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update vllm llm

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-11-14 21:26:15 +08:00
lkk
32bcde4528 fix history content from agent memory. (#899)
* fix history content from agent memory.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-14 21:26:01 +08:00
ZePan110
0dbf57751b Standardize the naming format of images (#898)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-14 18:22:27 +08:00
XinyaoWa
7bf1953c23 Embedding compatible with OpenAI API (#892)
* Embedding TEI Langchain compatible with OpenAI API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* TextDoc support list

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* support tei llama index openai compatible API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* support mosec langchain openai compatible API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* update UT for embedding tests

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix ut bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* support embedding predictionguard  openai compatible API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* support embedding multimodal clip OpenAI compatible API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* enable debug mode for embedding UT

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-11-14 09:29:36 +08:00
Eero Tamminen
441882419a Minor simplication to ServiceOrchestrator code (#889)
* Drop dump_outputs() method that obfuscates the code

dump_outputs() method in ServiceOrchestrator:
* Is not real method (does not use self)
* Adds a member to a dict instead of "dump"ing (drop or output) something
* Obfuscates how schedule() method return value is constructed, and
* Makes calling code unnecessary longer

Similar method in "ServiceOrchestratorWithYaml" is reasonable except
for the name, but drop also that for consistency.

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Apply pylint simplification suggestion to execute()

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

---------

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
Co-authored-by: Sihan Chen <39623753+Spycsh@users.noreply.github.com>
2024-11-13 23:44:11 +08:00
sgurunat
e3812a7417 Multiple models and remote service support for langchain vLLM text-generation (#887)
* Multiple models support for langchain vLLM text-generation

Signed-off-by: sgurunat <gurunath.s@intel.com>

* Add authentication support for langchain vLLM text-generation remote endpoints

Signed-off-by: sgurunat <gurunath.s@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: sgurunat <gurunath.s@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-13 21:58:19 +08:00
ZePan110
a1712035a4 Block links that require real person verification (#897)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-13 21:43:20 +08:00
sgurunat
e879366cf8 Multiple models support for LLM TGI (#835)
* Update gateway and docarray from mega and proto services to have model field for ChatQnAGateway and LLMParams respectively

* Add load_model_configs method in utils.py to validate and load the model_configs

* Update llms text-generation tgi file (llm.py) to support multiple models. Uses load_model_configs method from utils

* Update llms text-generation tgi template to add different templates for different models

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fixed llm_endpoint empty string issue on error scenario

Signed-off-by: sgurunat <gurunath.s@intel.com>

* Function to get llm_endpoint and keep the code clean

Signed-off-by: sgurunat <gurunath.s@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: sgurunat <gurunath.s@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-13 17:11:43 +08:00
ZePan110
9e471a9ecc Block links that require real person verification (#896)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-13 16:42:57 +08:00
sgurunat
550325d8cb vLLM support for DocSum (#885)
* Add model parameter for DocSumGateway in gateway.py file

Signed-off-by: sgurunat <gurunath.s@intel.com>

* Add langchain vllm support for DocSum along with authentication support for vllm endpoints

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Updated docker_compose_llm.yaml and README file with vLLM information

Signed-off-by: sgurunat <gurunath.s@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Updated docsum-vllm Dockerfile into llm-compose-cd.yaml under github workflows

Signed-off-by: sgurunat <gurunath.s@intel.com>

* Updated llm-compose.yaml file to include vllm sumarization docker build

Signed-off-by: sgurunat <gurunath.s@intel.com>

---------

Signed-off-by: sgurunat <gurunath.s@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: lvliang-intel <liang1.lv@intel.com>
2024-11-13 15:20:15 +08:00
sgurunat
f5c60f10b1 vLLM support for FAQGen (#884)
* Add model parameter for FaqGenGateway in gateway.py file

Signed-off-by: sgurunat <gurunath.s@intel.com>

* Add langchain vllm support for FaqGen along with authentication support for vllm endpoints

Signed-off-by: sgurunat <gurunath.s@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Updated docker_compose_llm.yaml and README file with vLLM information

Signed-off-by: sgurunat <gurunath.s@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Updated faq-vllm Dockerfile into llm-compose-cd.yaml under github workflows

Signed-off-by: sgurunat <gurunath.s@intel.com>

* Updated llm-compose.yaml file to include vllm faqgen build

Signed-off-by: sgurunat <gurunath.s@intel.com>

---------

Signed-off-by: sgurunat <gurunath.s@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-13 14:17:49 +08:00
Mustafa
baafa402c2 Add support for Audio and Video summarization to Docsum (#865)
* v2a services

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* add a2t - llm

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update whisper serve

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* updates

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* add data service

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* gateway

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* clean gateway & orchestrator

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* updates

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* updates

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* adding functional tests

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* updates

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* updates

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* updates read me file

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* name changes

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update readme file

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update readme file

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update readme file

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update readme file

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update readme file

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update max token option

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update the test files

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* readme updtes

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* readme updtes

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* clean code

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update dataprep-compose-cd.yaml file

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* merge and sync

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* merge and sync gateway

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* adding the copyright header

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update the end of file char

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update gateway

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update gateway-docsum

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix test files

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* fix test files

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* merge & sync

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* Update dataprep-compose.yaml

* update test

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update dataprep-compose

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

---------

Signed-off-by: Mustafa <mustafa.cetin@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-11-12 21:51:45 -08:00
Eero Tamminen
3b106c82ef Replace HTTP "inprogress" gauge with megaservice "request_pending" one (#864)
* Add "megaservice_request_pending" metric

Unlike other megaservice ServiceOrchestrator metrics, this covers (can
cover) also non-streaming requests, as suggested in PR review.

This does not have issues Prometheus-fastapi-instrumentator
"inprogress" metric did:
* Extra instances which have to be differentiated e.g. for CI
* Rely on name -> suffix coming through obscure kwargs calls

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Remove HTTP "inprogress" gauge as redundant

Now that ServiceOrchestrator provides pending metric.

Reverts the "inprogress" metric part of commit a6998a1dbd.

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Document megaservice metrics

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-13 09:52:24 +08:00
ZePan110
2d0eea90d2 quick fix (#894)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-13 09:41:02 +08:00
Abolfazl Shahbazi
f1594cb54f Fix missing end of file chars (#874)
Signed-off-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>
2024-11-13 09:34:03 +08:00
lkk
e39b08f3d0 agent short & long term memory with langgraph. (#851)
* draft a demo code for memory.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add agent short-term memory with langgraph checkpoint.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add save long-term memory func.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add save long-term memory func.

* add timeout for llm response.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix ut with adding -e HABANA_VISIBLE_DEVICES=all.

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-12 17:28:37 +08:00
sgurunat
24b9f03f48 vLLM support for Codegen (#886)
* Add model parameter for CodeGenGateway in gateway.py file

Signed-off-by: sgurunat <gurunath.s@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: sgurunat <gurunath.s@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-12 15:23:31 +08:00
ZePan110
23c99c1170 Combine CI/CD docker compose. (#861)
* Combine CI/CD docker compose yaml files.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix path issue in script

Signed-off-by: ZePan110 <ze.pan@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Combine CI/CD docker compose yaml files.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix path issue in script

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix path

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Combine CI/CD for .github/workflows/_comps-workflow.yml and .github/workflows/_get-image-list.yml

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Combine compose yaml file

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Remove CD file

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Add vllm-fork to CI.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix issue

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Add CICD mode

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Add embedding-reranking-local to compose.yaml

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Change descriptions

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix script issue.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Add new image.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* remove useless image from llms-compose.yaml

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Change folder name.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Standardize the format of image names.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Add commit print

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Standardize the format of image names

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Add vllm-arc to compose file.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Restore test_agent_langchain_on_intel_hpu.sh

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Remove useless comments

Signed-off-by: ZePan110 <ze.pan@intel.com>

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-12 15:15:21 +08:00
XinyuYe-Intel
37f35140cc Add DPO support in finetuning microservice (#857)
* added dpo support.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* make dpo trainer compatible with newest transformers.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* added ut for dpo.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* added training successfulness check in finetuning ut.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* updated broken link.

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>

---------

Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-11-12 11:35:03 +08:00
qgao007
9a50131d69 Enable bash scr to to be path-independent using $0 to address ERROR: failed to solve: failed to read dockerfile: open Dockerfile.intel_hpu: no such file or director when following README (#808)
Signed-off-by: Qun Gao <qun.gao@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>
2024-11-08 12:04:06 -08:00
senhui2intel
52757b382c Enable Intel ARC gpu test for vllm openvino. (#856)
Signed-off-by: senhui2intel <senhui.zhang@intel.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-11-08 22:38:27 +08:00
Neo Zhang Jianyu
09980b5355 opt CI to skip none MD and RST files (#873)
Signed-off-by: ZhangJianyu <zhang.jianyu@outlook.com>
2024-11-08 22:07:10 +08:00
Letong Han
75eb864d78 update llm endpoint validation commands (#869)
Signed-off-by: letonghan <letong.han@intel.com>
2024-11-08 19:45:06 +08:00
dolpher
ca6a4e3609 Remove health check log (#853)
Signed-off-by: Dolpher Du <dolpher.du@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-11-08 15:54:58 +08:00
Yao Qing
46ff36c008 Fixed the issue of asynchronous call failure for MosecEmbeddings (#871)
* Fixed the issue of asynchronous call failure for MosecEmbeddings

Signed-off-by: Yao, Qing <qing.yao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add import asyncio

Signed-off-by: Yao, Qing <qing.yao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Yao, Qing <qing.yao@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-08 15:54:16 +08:00
Neo Zhang Jianyu
ef507ce6fa fix doc format issue (#870)
Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>
2024-11-08 14:58:35 +08:00
Li Gang
a2b9d95f86 Add vLLM ARC support with OpenVINO backend (#641)
* Add vllm Arc Dockerfile support

Support vllm inference on Intel ARC GPU

Signed-off-by: Li Gang <gang.g.li@intel.com>
Co-authored-by: Chen, Hu1 <hu1.chen@intel.com>

* Add vLLM ARC support

With vLLM official repo: https://github.com/vllm-project/vllm/
based on openvino backend

Dockerfile is based on Dockerfile.openvino
https://github.com/vllm-project/vllm/blob/main/Dockerfile.openvino
And add ARC support packages

Default mode: meta-llama/Llama-3.2-3B-Instruct to fit ARC A770 VRAM

Signed-off-by: Li Gang <gang.g.li@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add README and .github workflow for vLLM ARC support

Signed-off-by: Li Gang <gang.g.li@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update comps/llms/text-generation/vllm/langchain/README.md

Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Rename Dockerfile to meet Contribution Guidelines

Signed-off-by: Li Gang <gang.g.li@intel.com>

* Align image names as opea/vllm-arc:latest

Signed-off-by: Li Gang <gang.g.li@intel.com>

---------

Signed-off-by: Li Gang <gang.g.li@intel.com>
Co-authored-by: Chen, Hu1 <hu1.chen@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>
2024-11-08 14:13:06 +08:00
XinyaoWa
617e119f67 Remove useless vllm ray (#859)
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2024-11-08 13:04:19 +08:00
Isaac Ng
3401db2032 fix list_service method not returning expected response (#787) (#788)
Signed-off-by: isaacncz <isaac.ng@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-11-08 12:14:17 +08:00
Neo Zhang Jianyu
5eca5da368 Add CI case to check online doc building, not update online doc (#867)
Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>
2024-11-08 11:56:46 +08:00
XinyaoWa
453ff726a6 support faqgen upload file in UI (#866)
* support faqgen upload file in UI

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-08 11:54:04 +08:00
Letong Han
78d8276325 [Dataprep] Fix Delete Bug (#863)
Signed-off-by: letonghan <letong.han@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-08 11:00:49 +08:00
Melanie Hart Buehler
29ef64269a MultimodalQnA Image and Audio Support Phase 1 (#852)
* Adds an endpoint for image ingestion

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* Combined image and video endpoint

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* Add test and update README

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* fixed variable name for embedding model (#1)

Signed-off-by: okhleif-IL <omar.khleif@intel.com>

* Fixed test script

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* Remove redundant function

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* get_videos, delete_videos --> get_files, delete_files (#3)

Signed-off-by: okhleif-IL <omar.khleif@intel.com>

* Updates test per review feedback

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* Fixed test

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* Add support for audio files multimodal data ingestion (#4)

* Add support for audio files multimodal data ingestion

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

* Update function name

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

---------

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

* Change videos_with_transcripts to ingest_with_text

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* Add image support to video ingestion with transcript functionality

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* Update test and README

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* Updated for review suggestions

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* Add two tests for ingest_with_text

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* LVM TGI Gaudi update for prompts without images (#7)

* LVM Gaudi TGI update for prompts without images

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

* Wording

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

* Add a test

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

---------

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Change dummy image to be b64 encoded instead of the url (#9)

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

* Updates based on review feedback (#10)

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

* Test fix (#11)

Signed-off-by: dmsuehir <dina.s.jones@intel.com>

---------

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
Signed-off-by: okhleif-IL <omar.khleif@intel.com>
Signed-off-by: dmsuehir <dina.s.jones@intel.com>
Co-authored-by: dmsuehir <dina.s.jones@intel.com>
Co-authored-by: Omar Khleif <omar.khleif@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>
2024-11-08 10:19:46 +08:00
XinyaoWa
786cabe57d align vllm hpu version to latest vllm-fork (#860)
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2024-11-07 14:14:58 +08:00
lvliang-intel
618f45bab1 Upgrade habana docker version to 1.18.0 (#854)
* Upgrade habana docker version to 1.18.0

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix issues

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* fix ci issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
2024-11-07 11:28:48 +08:00
Sihan Chen
518cdfb6e3 add dynamic batching embedding/reranking (#774)
* draft static batching embedding/reranking on single gaudi card

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix

* resolve segfault, deadlock and other issues

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* narrow down default timeout

* add doockerfile

* fix hpu local microservice start

* openai format

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* configurable timeout

* lower timeout

* fix

* lower default timeout

* bf16

* log, pad max_len

* autocast, 128

* fix acc issue

* perf fallback with no acc drop

* revert no-padding ones

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix hpu graph wrapper

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add padding batch

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* habana 1.18

* static -> dynamic

* add UT, add param in_single_process

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add docker file

* fix case doc empty, and pass model id from env

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* CI

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-11-06 16:13:36 +08:00
Yao Qing
a8e5adc4d0 [Exporter Tool] Updated exporter tool for docker compose and k8s manifests. (#813)
* Support convert mega.yaml to docker compose yaml.

* Remove device option in opea mege exporter.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* updated manifests exporter

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* updated manifests_exporter.py

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* updated mega.yaml & updated manifests_exporter

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* done

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* cleancode

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* cleancode and refactor to function

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* added UT for manifests

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix for UT.

* fixed the UT issue.

* merged to one file.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Zhenzhong1 <zhenzhong.xu@intel.com>
2024-11-06 10:34:04 +08:00
Neo Zhang Jianyu
b8948f248f fix format issue (#855)
Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>
2024-11-05 17:05:20 +08:00
Isaac Ng
c1c5798485 Add issue template (#785)
Signed-off-by: isaacncz <isaac.ng@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Rachel R <rroumeliotis@gmail.com>
2024-11-04 15:57:14 +08:00
Sihan Chen
acf07cd90d fix prometheus invalid metric name (#849)
* fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 12:00:02 +08:00
Eero Tamminen
a6998a1dbd Add E2E Promeheus metrics to applications (#845)
* Fix typos in BaseStatistics method names

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Add HttpService "inprogress" (pending) request count metrics

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Add E2E Prometheus metrics to ServiceOrchestrator

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Fix: support metrics with multiple ServiceOrchestrator instances

Unlike apps, CI tests create multiple of them.

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Fix: require named MicroService -> HTTPService instances

Creating multiple MicroService()s creates multiple HTTPService()s
which creates multiple Prometheus fastapi instrumentor instances.

While latter handled that fine for ChatQnA and normal HTTP metrics,
that was not the case for its "inprogress" metrics in CI.

Therefore MicroService constructor name argument is now mandatory, so
that it can be used to make "inprogress" metrics for HTTPService
instances unique.

PS. instrumentor requires HTTPService instance specific Starlette
instance, so it cannot be made singleton.

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Fix: update test_token_generator()

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 09:58:23 +08:00