GenAIExamples

Author	SHA1	Message	Date
ZePan110	dca337d90b	Add empty list check (#914 ) * Add outputs. Signed-off-by: ZePan110 <ze.pan@intel.com> * Add empty list check Signed-off-by: ZePan110 <ze.pan@intel.com> * test CI. Signed-off-by: ZePan110 <ze.pan@intel.com> * Remove test files Signed-off-by: ZePan110 <ze.pan@intel.com> * remove debug code Signed-off-by: chensuyue <suyue.chen@intel.com> --------- Signed-off-by: ZePan110 <ze.pan@intel.com> Signed-off-by: chensuyue <suyue.chen@intel.com> Co-authored-by: chensuyue <suyue.chen@intel.com>	2024-11-18 22:46:00 +08:00
Chun Tao	2a98120edf	Add "--no-verbose" flag to wget download commands in entrypoint (#909 ) Signed-off-by: Chun Tao <chun.tao@intel.com>	2024-11-18 11:12:35 +08:00
lvliang-intel	8e148a3924	Add env for pass down model id in ChatQnA gateway (#906 ) * Pass down model id for ChatQnA Signed-off-by: lvliang-intel <liang1.lv@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update logic Signed-off-by: lvliang-intel <liang1.lv@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-16 10:13:52 +08:00
Melanie Hart Buehler	c823157428	Fix units of incorrect caption timestamps (#907 ) Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>	2024-11-15 11:12:17 -08:00
Sihan Chen	d547872c9c	add zero-shot vc readme (#904 )	2024-11-15 15:10:39 +08:00
XinyaoWa	e1475acb55	vllm hpu fix version for bug fix (#903 ) * vllm test Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * fix vllm hpu version to fix bug Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * refine readme Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * fix vllm version Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * update vllm ut model Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * revert agent Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * fix bug Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> --------- Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2024-11-15 15:10:27 +08:00
Mustafa	d211cb2dbd	Docsum Gateway Fix (#902 ) * update gateway Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update the gateway Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update the gateway Signed-off-by: Mustafa <mustafa.cetin@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Mustafa <mustafa.cetin@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-15 11:14:50 +08:00
Melanie Hart Buehler	405a632b31	Bugfix for follow-up query with a .png image (#900 ) * MultimodalQnA bugfix for follow-up query with a .png image Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>	2024-11-14 15:42:22 -08:00
rbrugaro	0163ea6f4e	trim input to TGI, moved clustering and summarization to dataprep and store in DB (#893 ) * trim input to TGI, moved clustering and summarization to dataprep and DB store Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed inspect_db causing error in precommit Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add HF token to dataprep container because tokenizer is used now Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * updated READMEs to reflect latest changes Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix all files are ingested and graph extracted first followed by 1 cluster call for full graph in database Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update README based on fix for multifile Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * Changes to make graphrag ui work Signed-off-by: theresa <theresa.shan@intel.com> * fix bug build communities done once at end of ingestion Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * minor fixes Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * README fixes Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> --------- Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> Signed-off-by: theresa <theresa.shan@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: theresa <theresa.shan@intel.com>	2024-11-14 15:29:23 -08:00
lvliang-intel	517a5b04a8	Fix LLM special token issue (#895 ) * Fix LLM special token issue Signed-off-by: lvliang-intel <liang1.lv@intel.com> * update code Signed-off-by: lvliang-intel <liang1.lv@intel.com> * update logic Signed-off-by: lvliang-intel <liang1.lv@intel.com> * update vllm llm Signed-off-by: lvliang-intel <liang1.lv@intel.com> --------- Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: ZePan110 <ze.pan@intel.com>	2024-11-14 21:26:15 +08:00
lkk	32bcde4528	fix history content from agent memory. (#899 ) * fix history content from agent memory. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-14 21:26:01 +08:00
ZePan110	0dbf57751b	Standardize the naming format of images (#898 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2024-11-14 18:22:27 +08:00
XinyaoWa	7bf1953c23	Embedding compatible with OpenAI API (#892 ) * Embedding TEI Langchain compatible with OpenAI API Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * TextDoc support list Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * support tei llama index openai compatible API Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * support mosec langchain openai compatible API Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * update UT for embedding tests Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix ut bug Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * support embedding predictionguard openai compatible API Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * support embedding multimodal clip OpenAI compatible API Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * fix bug Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * enable debug mode for embedding UT Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: chen, suyue <suyue.chen@intel.com> Co-authored-by: ZePan110 <ze.pan@intel.com>	2024-11-14 09:29:36 +08:00
Eero Tamminen	441882419a	Minor simplication to ServiceOrchestrator code (#889 ) * Drop dump_outputs() method that obfuscates the code dump_outputs() method in ServiceOrchestrator: * Is not real method (does not use self) * Adds a member to a dict instead of "dump"ing (drop or output) something * Obfuscates how schedule() method return value is constructed, and * Makes calling code unnecessary longer Similar method in "ServiceOrchestratorWithYaml" is reasonable except for the name, but drop also that for consistency. Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * Apply pylint simplification suggestion to execute() Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> --------- Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> Co-authored-by: Sihan Chen <39623753+Spycsh@users.noreply.github.com>	2024-11-13 23:44:11 +08:00
sgurunat	e3812a7417	Multiple models and remote service support for langchain vLLM text-generation (#887 ) * Multiple models support for langchain vLLM text-generation Signed-off-by: sgurunat <gurunath.s@intel.com> * Add authentication support for langchain vLLM text-generation remote endpoints Signed-off-by: sgurunat <gurunath.s@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: sgurunat <gurunath.s@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-13 21:58:19 +08:00
ZePan110	a1712035a4	Block links that require real person verification (#897 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2024-11-13 21:43:20 +08:00
sgurunat	e879366cf8	Multiple models support for LLM TGI (#835 ) * Update gateway and docarray from mega and proto services to have model field for ChatQnAGateway and LLMParams respectively * Add load_model_configs method in utils.py to validate and load the model_configs * Update llms text-generation tgi file (llm.py) to support multiple models. Uses load_model_configs method from utils * Update llms text-generation tgi template to add different templates for different models * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed llm_endpoint empty string issue on error scenario Signed-off-by: sgurunat <gurunath.s@intel.com> * Function to get llm_endpoint and keep the code clean Signed-off-by: sgurunat <gurunath.s@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: sgurunat <gurunath.s@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-13 17:11:43 +08:00
ZePan110	9e471a9ecc	Block links that require real person verification (#896 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2024-11-13 16:42:57 +08:00
sgurunat	550325d8cb	vLLM support for DocSum (#885 ) * Add model parameter for DocSumGateway in gateway.py file Signed-off-by: sgurunat <gurunath.s@intel.com> * Add langchain vllm support for DocSum along with authentication support for vllm endpoints * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Updated docker_compose_llm.yaml and README file with vLLM information Signed-off-by: sgurunat <gurunath.s@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Updated docsum-vllm Dockerfile into llm-compose-cd.yaml under github workflows Signed-off-by: sgurunat <gurunath.s@intel.com> * Updated llm-compose.yaml file to include vllm sumarization docker build Signed-off-by: sgurunat <gurunath.s@intel.com> --------- Signed-off-by: sgurunat <gurunath.s@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: lvliang-intel <liang1.lv@intel.com>	2024-11-13 15:20:15 +08:00
sgurunat	f5c60f10b1	vLLM support for FAQGen (#884 ) * Add model parameter for FaqGenGateway in gateway.py file Signed-off-by: sgurunat <gurunath.s@intel.com> * Add langchain vllm support for FaqGen along with authentication support for vllm endpoints Signed-off-by: sgurunat <gurunath.s@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Updated docker_compose_llm.yaml and README file with vLLM information Signed-off-by: sgurunat <gurunath.s@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Updated faq-vllm Dockerfile into llm-compose-cd.yaml under github workflows Signed-off-by: sgurunat <gurunath.s@intel.com> * Updated llm-compose.yaml file to include vllm faqgen build Signed-off-by: sgurunat <gurunath.s@intel.com> --------- Signed-off-by: sgurunat <gurunath.s@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-13 14:17:49 +08:00
Mustafa	baafa402c2	Add support for Audio and Video summarization to Docsum (#865 ) * v2a services Signed-off-by: Mustafa <mustafa.cetin@intel.com> * add a2t - llm Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update whisper serve Signed-off-by: Mustafa <mustafa.cetin@intel.com> * updates Signed-off-by: Mustafa <mustafa.cetin@intel.com> * add data service Signed-off-by: Mustafa <mustafa.cetin@intel.com> * gateway Signed-off-by: Mustafa <mustafa.cetin@intel.com> * clean gateway & orchestrator Signed-off-by: Mustafa <mustafa.cetin@intel.com> * updates Signed-off-by: Mustafa <mustafa.cetin@intel.com> * updates Signed-off-by: Mustafa <mustafa.cetin@intel.com> * adding functional tests Signed-off-by: Mustafa <mustafa.cetin@intel.com> * updates Signed-off-by: Mustafa <mustafa.cetin@intel.com> * updates Signed-off-by: Mustafa <mustafa.cetin@intel.com> * updates read me file Signed-off-by: Mustafa <mustafa.cetin@intel.com> * name changes Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update readme file Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update readme file Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update readme file Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update readme file Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update readme file Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update max token option Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update the test files Signed-off-by: Mustafa <mustafa.cetin@intel.com> * readme updtes Signed-off-by: Mustafa <mustafa.cetin@intel.com> * readme updtes Signed-off-by: Mustafa <mustafa.cetin@intel.com> * clean code Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update dataprep-compose-cd.yaml file Signed-off-by: Mustafa <mustafa.cetin@intel.com> * merge and sync Signed-off-by: Mustafa <mustafa.cetin@intel.com> * merge and sync gateway Signed-off-by: Mustafa <mustafa.cetin@intel.com> * adding the copyright header Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update the end of file char Signed-off-by: Mustafa <mustafa.cetin@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update gateway Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update gateway-docsum Signed-off-by: Mustafa <mustafa.cetin@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix test files Signed-off-by: Mustafa <mustafa.cetin@intel.com> * fix test files Signed-off-by: Mustafa <mustafa.cetin@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * merge & sync Signed-off-by: Mustafa <mustafa.cetin@intel.com> * Update dataprep-compose.yaml * update test Signed-off-by: Mustafa <mustafa.cetin@intel.com> * update dataprep-compose Signed-off-by: Mustafa <mustafa.cetin@intel.com> --------- Signed-off-by: Mustafa <mustafa.cetin@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com> Co-authored-by: ZePan110 <ze.pan@intel.com>	2024-11-12 21:51:45 -08:00
Eero Tamminen	3b106c82ef	Replace HTTP "inprogress" gauge with megaservice "request_pending" one (#864 ) * Add "megaservice_request_pending" metric Unlike other megaservice ServiceOrchestrator metrics, this covers (can cover) also non-streaming requests, as suggested in PR review. This does not have issues Prometheus-fastapi-instrumentator "inprogress" metric did: * Extra instances which have to be differentiated e.g. for CI * Rely on name -> suffix coming through obscure kwargs calls Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * Remove HTTP "inprogress" gauge as redundant Now that ServiceOrchestrator provides pending metric. Reverts the "inprogress" metric part of commit `a6998a1dbd`. Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * Document megaservice metrics Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-13 09:52:24 +08:00
ZePan110	2d0eea90d2	quick fix (#894 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2024-11-13 09:41:02 +08:00
Abolfazl Shahbazi	f1594cb54f	Fix missing end of file chars (#874 ) Signed-off-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>	2024-11-13 09:34:03 +08:00
lkk	e39b08f3d0	agent short & long term memory with langgraph. (#851 ) * draft a demo code for memory. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add agent short-term memory with langgraph checkpoint. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add save long-term memory func. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add save long-term memory func. * add timeout for llm response. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix ut with adding -e HABANA_VISIBLE_DEVICES=all. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-12 17:28:37 +08:00
sgurunat	24b9f03f48	vLLM support for Codegen (#886 ) * Add model parameter for CodeGenGateway in gateway.py file Signed-off-by: sgurunat <gurunath.s@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: sgurunat <gurunath.s@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-12 15:23:31 +08:00
ZePan110	23c99c1170	Combine CI/CD docker compose. (#861 ) * Combine CI/CD docker compose yaml files. Signed-off-by: ZePan110 <ze.pan@intel.com> * Fix path issue in script Signed-off-by: ZePan110 <ze.pan@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Combine CI/CD docker compose yaml files. Signed-off-by: ZePan110 <ze.pan@intel.com> * Fix path issue in script Signed-off-by: ZePan110 <ze.pan@intel.com> * Fix path Signed-off-by: ZePan110 <ze.pan@intel.com> * Combine CI/CD for .github/workflows/_comps-workflow.yml and .github/workflows/_get-image-list.yml Signed-off-by: ZePan110 <ze.pan@intel.com> * Combine compose yaml file Signed-off-by: ZePan110 <ze.pan@intel.com> * Remove CD file Signed-off-by: ZePan110 <ze.pan@intel.com> * Add vllm-fork to CI. Signed-off-by: ZePan110 <ze.pan@intel.com> * Fix issue Signed-off-by: ZePan110 <ze.pan@intel.com> * Add CICD mode Signed-off-by: ZePan110 <ze.pan@intel.com> * Add embedding-reranking-local to compose.yaml Signed-off-by: ZePan110 <ze.pan@intel.com> * Change descriptions Signed-off-by: ZePan110 <ze.pan@intel.com> * Fix script issue. Signed-off-by: ZePan110 <ze.pan@intel.com> * Add new image. Signed-off-by: ZePan110 <ze.pan@intel.com> * remove useless image from llms-compose.yaml Signed-off-by: ZePan110 <ze.pan@intel.com> * Change folder name. Signed-off-by: ZePan110 <ze.pan@intel.com> * Standardize the format of image names. Signed-off-by: ZePan110 <ze.pan@intel.com> * Add commit print Signed-off-by: ZePan110 <ze.pan@intel.com> * Standardize the format of image names Signed-off-by: ZePan110 <ze.pan@intel.com> * Add vllm-arc to compose file. Signed-off-by: ZePan110 <ze.pan@intel.com> * Restore test_agent_langchain_on_intel_hpu.sh Signed-off-by: ZePan110 <ze.pan@intel.com> * Remove useless comments Signed-off-by: ZePan110 <ze.pan@intel.com> --------- Signed-off-by: ZePan110 <ze.pan@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-12 15:15:21 +08:00
XinyuYe-Intel	37f35140cc	Add DPO support in finetuning microservice (#857 ) * added dpo support. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * make dpo trainer compatible with newest transformers. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * added ut for dpo. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * added training successfulness check in finetuning ut. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated broken link. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> --------- Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ZePan110 <ze.pan@intel.com>	2024-11-12 11:35:03 +08:00
qgao007	9a50131d69	Enable bash scr to to be path-independent using $0 to address ERROR: failed to solve: failed to read dockerfile: open Dockerfile.intel_hpu: no such file or director when following README (#808 ) Signed-off-by: Qun Gao <qun.gao@intel.com> Co-authored-by: ZePan110 <ze.pan@intel.com> Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>	2024-11-08 12:04:06 -08:00
senhui2intel	52757b382c	Enable Intel ARC gpu test for vllm openvino. (#856 ) Signed-off-by: senhui2intel <senhui.zhang@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-11-08 22:38:27 +08:00
Neo Zhang Jianyu	09980b5355	opt CI to skip none MD and RST files (#873 ) Signed-off-by: ZhangJianyu <zhang.jianyu@outlook.com>	2024-11-08 22:07:10 +08:00
Letong Han	75eb864d78	update llm endpoint validation commands (#869 ) Signed-off-by: letonghan <letong.han@intel.com>	2024-11-08 19:45:06 +08:00
dolpher	ca6a4e3609	Remove health check log (#853 ) Signed-off-by: Dolpher Du <dolpher.du@intel.com> Co-authored-by: ZePan110 <ze.pan@intel.com>	2024-11-08 15:54:58 +08:00
Yao Qing	46ff36c008	Fixed the issue of asynchronous call failure for MosecEmbeddings (#871 ) * Fixed the issue of asynchronous call failure for MosecEmbeddings Signed-off-by: Yao, Qing <qing.yao@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add import asyncio Signed-off-by: Yao, Qing <qing.yao@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Yao, Qing <qing.yao@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-08 15:54:16 +08:00
Neo Zhang Jianyu	ef507ce6fa	fix doc format issue (#870 ) Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>	2024-11-08 14:58:35 +08:00
Li Gang	a2b9d95f86	Add vLLM ARC support with OpenVINO backend (#641 ) * Add vllm Arc Dockerfile support Support vllm inference on Intel ARC GPU Signed-off-by: Li Gang <gang.g.li@intel.com> Co-authored-by: Chen, Hu1 <hu1.chen@intel.com> * Add vLLM ARC support With vLLM official repo: https://github.com/vllm-project/vllm/ based on openvino backend Dockerfile is based on Dockerfile.openvino https://github.com/vllm-project/vllm/blob/main/Dockerfile.openvino And add ARC support packages Default mode: meta-llama/Llama-3.2-3B-Instruct to fit ARC A770 VRAM Signed-off-by: Li Gang <gang.g.li@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add README and .github workflow for vLLM ARC support Signed-off-by: Li Gang <gang.g.li@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update comps/llms/text-generation/vllm/langchain/README.md Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com> * Rename Dockerfile to meet Contribution Guidelines Signed-off-by: Li Gang <gang.g.li@intel.com> * Align image names as opea/vllm-arc:latest Signed-off-by: Li Gang <gang.g.li@intel.com> --------- Signed-off-by: Li Gang <gang.g.li@intel.com> Co-authored-by: Chen, Hu1 <hu1.chen@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>	2024-11-08 14:13:06 +08:00
XinyaoWa	617e119f67	Remove useless vllm ray (#859 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2024-11-08 13:04:19 +08:00
Isaac Ng	3401db2032	fix list_service method not returning expected response (#787 ) (#788 ) Signed-off-by: isaacncz <isaac.ng@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com> Co-authored-by: ZePan110 <ze.pan@intel.com>	2024-11-08 12:14:17 +08:00
Neo Zhang Jianyu	5eca5da368	Add CI case to check online doc building, not update online doc (#867 ) Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>	2024-11-08 11:56:46 +08:00
XinyaoWa	453ff726a6	support faqgen upload file in UI (#866 ) * support faqgen upload file in UI Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-08 11:54:04 +08:00
Letong Han	78d8276325	[Dataprep] Fix Delete Bug (#863 ) Signed-off-by: letonghan <letong.han@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-08 11:00:49 +08:00
Melanie Hart Buehler	29ef64269a	MultimodalQnA Image and Audio Support Phase 1 (#852 ) * Adds an endpoint for image ingestion Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * Combined image and video endpoint Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * Add test and update README Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * fixed variable name for embedding model (#1) Signed-off-by: okhleif-IL <omar.khleif@intel.com> * Fixed test script Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * Remove redundant function Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * get_videos, delete_videos --> get_files, delete_files (#3) Signed-off-by: okhleif-IL <omar.khleif@intel.com> * Updates test per review feedback Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * Fixed test Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * Add support for audio files multimodal data ingestion (#4) * Add support for audio files multimodal data ingestion Signed-off-by: dmsuehir <dina.s.jones@intel.com> * Update function name Signed-off-by: dmsuehir <dina.s.jones@intel.com> --------- Signed-off-by: dmsuehir <dina.s.jones@intel.com> * Change videos_with_transcripts to ingest_with_text Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * Add image support to video ingestion with transcript functionality Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * Update test and README Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * Updated for review suggestions Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * Add two tests for ingest_with_text Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> * LVM TGI Gaudi update for prompts without images (#7) * LVM Gaudi TGI update for prompts without images Signed-off-by: dmsuehir <dina.s.jones@intel.com> * Wording Signed-off-by: dmsuehir <dina.s.jones@intel.com> * Add a test Signed-off-by: dmsuehir <dina.s.jones@intel.com> --------- Signed-off-by: dmsuehir <dina.s.jones@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Change dummy image to be b64 encoded instead of the url (#9) Signed-off-by: dmsuehir <dina.s.jones@intel.com> * Updates based on review feedback (#10) Signed-off-by: dmsuehir <dina.s.jones@intel.com> * Test fix (#11) Signed-off-by: dmsuehir <dina.s.jones@intel.com> --------- Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com> Signed-off-by: okhleif-IL <omar.khleif@intel.com> Signed-off-by: dmsuehir <dina.s.jones@intel.com> Co-authored-by: dmsuehir <dina.s.jones@intel.com> Co-authored-by: Omar Khleif <omar.khleif@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>	2024-11-08 10:19:46 +08:00
XinyaoWa	786cabe57d	align vllm hpu version to latest vllm-fork (#860 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2024-11-07 14:14:58 +08:00
lvliang-intel	618f45bab1	Upgrade habana docker version to 1.18.0 (#854 ) * Upgrade habana docker version to 1.18.0 Signed-off-by: lvliang-intel <liang1.lv@intel.com> * fix issues Signed-off-by: lvliang-intel <liang1.lv@intel.com> * fix ci issue Signed-off-by: lvliang-intel <liang1.lv@intel.com> --------- Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-11-07 11:28:48 +08:00
Sihan Chen	518cdfb6e3	add dynamic batching embedding/reranking (#774 ) * draft static batching embedding/reranking on single gaudi card * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix * resolve segfault, deadlock and other issues * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * narrow down default timeout * add doockerfile * fix hpu local microservice start * openai format * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * configurable timeout * lower timeout * fix * lower default timeout * bf16 * log, pad max_len * autocast, 128 * fix acc issue * perf fallback with no acc drop * revert no-padding ones * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix hpu graph wrapper * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add padding batch * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * habana 1.18 * static -> dynamic * add UT, add param in_single_process * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add docker file * fix case doc empty, and pass model id from env * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * CI --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: chen, suyue <suyue.chen@intel.com> Co-authored-by: ZePan110 <ze.pan@intel.com>	2024-11-06 16:13:36 +08:00
Yao Qing	a8e5adc4d0	[Exporter Tool] Updated exporter tool for docker compose and k8s manifests. (#813 ) * Support convert mega.yaml to docker compose yaml. * Remove device option in opea mege exporter. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated manifests exporter * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated manifests_exporter.py * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated mega.yaml & updated manifests_exporter * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * done * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleancode * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleancode and refactor to function * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added UT for manifests * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix for UT. * fixed the UT issue. * merged to one file. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Zhenzhong1 <zhenzhong.xu@intel.com>	2024-11-06 10:34:04 +08:00
Neo Zhang Jianyu	b8948f248f	fix format issue (#855 ) Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>	2024-11-05 17:05:20 +08:00
Isaac Ng	c1c5798485	Add issue template (#785 ) Signed-off-by: isaacncz <isaac.ng@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Rachel R <rroumeliotis@gmail.com>	2024-11-04 15:57:14 +08:00
Sihan Chen	acf07cd90d	fix prometheus invalid metric name (#849 ) * fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-04 12:00:02 +08:00
Eero Tamminen	a6998a1dbd	Add E2E Promeheus metrics to applications (#845 ) * Fix typos in BaseStatistics method names Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * Add HttpService "inprogress" (pending) request count metrics Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * Add E2E Prometheus metrics to ServiceOrchestrator Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * Fix: support metrics with multiple ServiceOrchestrator instances Unlike apps, CI tests create multiple of them. Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * Fix: require named MicroService -> HTTPService instances Creating multiple MicroService()s creates multiple HTTPService()s which creates multiple Prometheus fastapi instrumentor instances. While latter handled that fine for ChatQnA and normal HTTP metrics, that was not the case for its "inprogress" metrics in CI. Therefore MicroService constructor name argument is now mandatory, so that it can be used to make "inprogress" metrics for HTTPService instances unique. PS. instrumentor requires HTTPService instance specific Starlette instance, so it cannot be made singleton. Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * Fix: update test_token_generator() Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-04 09:58:23 +08:00

1 2 3 4 5 ...

698 Commits