Commit Graph

683 Commits

Author SHA1 Message Date
XinyaoWa
5aba3b25cf Support Long context for DocSum (#981)
* docsum four

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* support 4 modes for docsum

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* refine for docsum tgi

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* add docsum for ut and vllm

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix ut bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix ut bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* set default value

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-17 14:09:49 +08:00
Letong Han
f3aaaebf5a [Reorg] Remove redundant file in retrievers/redis (#1016)
Signed-off-by: letonghan <letong.han@intel.com>
2024-12-17 12:01:13 +08:00
lkk
ce1faf6ae1 refine tgi doc with default openai format. (#1037) 2024-12-17 10:43:08 +08:00
lkk
c955e5e498 update tei embedding format. (#1035) 2024-12-16 14:54:32 +08:00
minmin-intel
46835f95da Revert "Add SQL agent strategy (#975)" (#1030)
This reverts commit c36c5032dc.

Co-authored-by: lkk <33276950+lkk12014402@users.noreply.github.com>
2024-12-14 12:09:14 +08:00
XinyaoWa
48ed589822 vllm comps support openai API ChatCompletionRequest (#1032)
* vllm support openai API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* test_llms_text-generation_vllm_langchain_on_intel_hpu.sh

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix time

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-13 17:56:24 +08:00
lkk
f5efaf1f18 remove examples gateway. (#979)
* remove examples gateway.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove gateway.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* refine service code.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update http_service.py

* remove gateway ut.

* remove gateway ut.

* fix conflict service name.

* Update http_service.py

* add handle message ut.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove `multiprocessing.Process` start server code.

* fix ut.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove multiprocessing and enhance ut for coverage.

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
2024-12-13 09:31:11 +08:00
minmin-intel
c36c5032dc Add SQL agent strategy (#975)
* initial code for sql agent llama

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* add test for sql agent

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* update sql agent test

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* fix bugs and use vllm to test sql agent

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* add tag-bench test and google search tool

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* test sql agent with hints

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* fix bugs for sql agent with hints and update test

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add readme for sql agent and fix ci bugs

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add sql agent using openai models

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix bugs in sql agent openai

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* make wait time longer for sql agent microservice to be ready

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* update readme

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* fix test bug

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* skip planexec with vllm due to vllm-gaudi bug

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* debug ut issue

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* use vllm for all uts

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* debug ci issue

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* change vllm port

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* update ut

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* remove tgi server

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* align vllm port

Signed-off-by: minmin-intel <minmin.hou@intel.com>

---------

Signed-off-by: minmin-intel <minmin.hou@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-11 10:55:55 -08:00
Letong Han
6acefae785 [LLM] Modify Params to Support Falcon3 Model (#1027)
* modify params to support falcon3 model

---------

Signed-off-by: letonghan <letong.han@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Zhenzhong Xu <zhenzhong.xu@intel.com>
2024-12-11 11:35:05 +08:00
Liang Lv
c409ef9fcc Add Component base class for code refactoring (#983)
* Add Component base class

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add controller class

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add ut

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-10 13:20:16 +08:00
Wang, Kai Lawrence
ddd372d3e4 Remove enforce-eager to enable HPU graphs for better vLLM perf (#954)
* remove enforce-eager to enable HPU graphs

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>

* Increase the llm max timeout in ci for fully warmup

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>

---------

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
2024-12-10 13:19:56 +08:00
kkrishTa
5ed041bded Feature/elasticsearch vector store integration - Infosys (#972)
* Feature/elastic

Elasticsearch vectorstore, dataprep and retriever

---------

Co-authored-by: Adarsh <reachaadi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Liang Lv <liang1.lv@intel.com>
2024-12-10 09:40:44 +08:00
Yao Qing
fbf3017afb Revert mosec embedding microservice to to use synchronous interface. (#971)
* Revert mosec embedding microservice to  to use synchronous interface.

Signed-off-by: Yao, Qing <qing.yao@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add dependency.

Signed-off-by: Yao, Qing <qing.yao@intel.com>

---------

Signed-off-by: Yao, Qing <qing.yao@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-06 13:18:51 +08:00
Eero Tamminen
5663e16821 Exclude yield/reply time from first token latency metric (#973)
While metrics are OK for small number of requests, when megaservice
is handling many (hundreds of) _parallel_ requests, it was reporting
clearly (~10%) larger first token latency, than the client receiving
the tokens from the megaservice.

Getting the time before token is yielded, means that reported first
token latency can be slightly shorter than it actually is. However,
testing with ChatQnA shows latencies to be clearly closer to ones seen
by the client (within couple of percent) and typically smaller (i.e.
logical).

PS. Doing the metrics timing after yielding the token, meant that also
time for sending the reply to the client and waiting that to complete,
was included to the token time.  I suspect that with lot of parallel
requests, processing often had switched to other megaservice request
processing threads, and getting control back to yielding thread for
timing, could be delayed much longer than sending the response to
client took.

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
2024-12-06 11:08:57 +08:00
dependabot[bot]
3328ea3ab2 Bump aiohttp from 3.10.10 to 3.10.11 in /comps/animation/wav2lip (#966)
Bumps [aiohttp](https://github.com/aio-libs/aiohttp) from 3.10.10 to 3.10.11.
- [Release notes](https://github.com/aio-libs/aiohttp/releases)
- [Changelog](https://github.com/aio-libs/aiohttp/blob/master/CHANGES.rst)
- [Commits](https://github.com/aio-libs/aiohttp/compare/v3.10.10...v3.10.11)

---
updated-dependencies:
- dependency-name: aiohttp
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-12-04 09:36:53 +08:00
VincyZhang
766c757f3b add dangerous cmd check (#955)
* add dangerous cmd check

Signed-off-by: Wenxin Zhang <wenxin.zhang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* clean code

Signed-off-by: Wenxin Zhang <wenxin.zhang@intel.com>

---------

Signed-off-by: Wenxin Zhang <wenxin.zhang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-03 22:33:26 +08:00
Neo Zhang Jianyu
0772494f62 add label automatically when create issue (#960)
Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>
2024-12-02 14:48:17 +08:00
ZePan110
9d6d7b8195 Check image and service names in compose.yaml (#951)
* WIP

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Chack image and service names in compose.yaml

Signed-off-by: ZePan110 <ze.pan@intel.com>

* merge pr-check-duplicated-image.yml to pr-dockerfile-path-scan.yaml

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Remove .github/workflows/pr-check-duplicated-image.yml

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Unblocking txt files from push-image-build.yml

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix name error

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Split pr-dockerfile-path-scan.yaml to .github/workflows/pr-dockerfile-path-scan.yaml and .github/workflows/pr-link-path-scan.yaml and change the mask of .github/workflows/push-image-build.yml

Signed-off-by: ZePan110 <ze.pan@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-28 13:39:52 +08:00
Sihan Chen
9a0d91a5c8 Enhance asr/tts tests (#952)
* add tests

* validate tests

* refactor log
2024-11-28 10:01:17 +08:00
ZePan110
145f3fb886 Add dependence (#950)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-27 14:19:36 +08:00
Lianhao Lu
0e94eecbb7 CI: Add check for conflict image build definition (#944)
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
2024-11-27 11:28:47 +08:00
ZePan110
c5b8cdd8c7 Fix build issue (#946)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-26 19:15:07 +08:00
Sihan Chen
c3948ad531 openai compatible for asr/tts (#929)
* openai compatible for asr/tts

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add dep

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-25 09:55:48 +08:00
Mustafa
bbca7fd57b bug fix - moviepy version update (#942)
* initial commit with fix

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* initial commit with fix

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

---------

Signed-off-by: Mustafa <mustafa.cetin@intel.com>
2024-11-22 09:38:24 -08:00
berkecanrizai
7569e7c9b6 update pathway vector store, fix tests (#940)
Signed-off-by: Berke <berkecanrizai1@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-22 09:48:58 +08:00
chen, suyue
6c2b466372 Fix image scan _get-image-list.yml (#941)
* fix corner issue

Signed-off-by: chensuyue <suyue.chen@intel.com>

* for test

Signed-off-by: chensuyue <suyue.chen@intel.com>

* fix issue

Signed-off-by: chensuyue <suyue.chen@intel.com>

* fix images

Signed-off-by: chensuyue <suyue.chen@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: chensuyue <suyue.chen@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-22 09:46:33 +08:00
Isaac Ng
3dcc1e0a78 fix issue template bug (#939)
Signed-off-by: isaacncz <isaac.ng@intel.com>
2024-11-21 17:09:15 +08:00
chen, suyue
997749136d bump version into 1.1 (#930)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-11-21 14:15:22 +08:00
Chaunte W. Lacewell
750e501bee [Bug] Fix VDMS retriever and apply fix to VDMS dataprep (#928)
* Update requirements to pin protobuf version and fix grpc conflict, and limit vdms version

Signed-off-by: Lacewell, Chaunte W <chaunte.w.lacewell@intel.com>

* Update fix by removing grpcio pin and pinning opentelemetry-proto to 1.23.0

Signed-off-by: Lacewell, Chaunte W <chaunte.w.lacewell@intel.com>

---------

Signed-off-by: Lacewell, Chaunte W <chaunte.w.lacewell@intel.com>
2024-11-21 11:05:05 +08:00
chen, suyue
9b1e322624 fix the image name (#918)
Signed-off-by: chensuyue <suyue.chen@intel.com>
2024-11-20 11:00:01 +08:00
ZePan110
85818c97af Fix CD issues (#917)
* Add PREDICTIONGUARD_API_KEY

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Increase timeout

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix log name issue

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Remove useless code.

Signed-off-by: ZePan110 <ze.pan@intel.com>

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-20 09:34:40 +08:00
ZePan110
f19cf083d1 Rename image names XXX-hpu to XXX-gaudi (#911)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-19 22:06:55 +08:00
Letong Han
1bfc4306fd Fix Dataprep Upload Link issue (#913)
* fix html content loading problem

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add empty list check (#914)

* Add outputs.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Add empty list check

Signed-off-by: ZePan110 <ze.pan@intel.com>

* test CI.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Remove test files

Signed-off-by: ZePan110 <ze.pan@intel.com>

* remove debug code

Signed-off-by: chensuyue <suyue.chen@intel.com>

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
Co-authored-by: chensuyue <suyue.chen@intel.com>

* Fix hardware tag retrieval issue (#916)

Signed-off-by: ZePan110 <ze.pan@intel.com>

* fix html content loading problem

Signed-off-by: letonghan <letong.han@intel.com>

* fix milvus connection issue

Signed-off-by: letonghan <letong.han@intel.com>

* update parse_html function for all dbs

Signed-off-by: letonghan <letong.han@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: letonghan <letong.han@intel.com>
Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: chensuyue <suyue.chen@intel.com>
2024-11-19 14:22:30 +08:00
minmin-intel
1cf27817aa fix retriever and reranker to process chat completion request (#915)
* fix retriever and reranker to process chat completion request

Signed-off-by: minmin-intel <minmin.hou@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: minmin-intel <minmin.hou@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-19 13:54:59 +08:00
ZePan110
8121602bac Fix hardware tag retrieval issue (#916)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-19 09:23:28 +08:00
ZePan110
dca337d90b Add empty list check (#914)
* Add outputs.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Add empty list check

Signed-off-by: ZePan110 <ze.pan@intel.com>

* test CI.

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Remove test files

Signed-off-by: ZePan110 <ze.pan@intel.com>

* remove debug code

Signed-off-by: chensuyue <suyue.chen@intel.com>

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: chensuyue <suyue.chen@intel.com>
Co-authored-by: chensuyue <suyue.chen@intel.com>
2024-11-18 22:46:00 +08:00
Chun Tao
2a98120edf Add "--no-verbose" flag to wget download commands in entrypoint (#909)
Signed-off-by: Chun Tao <chun.tao@intel.com>
2024-11-18 11:12:35 +08:00
lvliang-intel
8e148a3924 Add env for pass down model id in ChatQnA gateway (#906)
* Pass down model id for ChatQnA

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update logic

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-16 10:13:52 +08:00
Melanie Hart Buehler
c823157428 Fix units of incorrect caption timestamps (#907)
Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
2024-11-15 11:12:17 -08:00
Sihan Chen
d547872c9c add zero-shot vc readme (#904) 2024-11-15 15:10:39 +08:00
XinyaoWa
e1475acb55 vllm hpu fix version for bug fix (#903)
* vllm test

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix vllm hpu version to fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* refine readme

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix vllm version

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* update vllm ut model

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* revert agent

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
2024-11-15 15:10:27 +08:00
Mustafa
d211cb2dbd Docsum Gateway Fix (#902)
* update gateway

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update the gateway

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* update the gateway

Signed-off-by: Mustafa <mustafa.cetin@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Mustafa <mustafa.cetin@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-15 11:14:50 +08:00
Melanie Hart Buehler
405a632b31 Bugfix for follow-up query with a .png image (#900)
* MultimodalQnA bugfix for follow-up query with a .png image

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com>
2024-11-14 15:42:22 -08:00
rbrugaro
0163ea6f4e trim input to TGI, moved clustering and summarization to dataprep and store in DB (#893)
* trim input to TGI, moved clustering and summarization to dataprep and DB store

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* removed inspect_db causing error in precommit

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add HF token to dataprep container because tokenizer is used now

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* updated READMEs to reflect latest changes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* bug fix all files are ingested and graph extracted first followed by 1 cluster call for full graph in database

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update README based on fix for multifile

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* Changes to make graphrag ui work

Signed-off-by: theresa <theresa.shan@intel.com>

* fix bug build communities done once at end of ingestion

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* minor fixes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* README fixes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

---------

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: theresa <theresa.shan@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: theresa <theresa.shan@intel.com>
2024-11-14 15:29:23 -08:00
lvliang-intel
517a5b04a8 Fix LLM special token issue (#895)
* Fix LLM special token issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update code

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update logic

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

* update vllm llm

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

---------

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-11-14 21:26:15 +08:00
lkk
32bcde4528 fix history content from agent memory. (#899)
* fix history content from agent memory.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-14 21:26:01 +08:00
ZePan110
0dbf57751b Standardize the naming format of images (#898)
Signed-off-by: ZePan110 <ze.pan@intel.com>
2024-11-14 18:22:27 +08:00
XinyaoWa
7bf1953c23 Embedding compatible with OpenAI API (#892)
* Embedding TEI Langchain compatible with OpenAI API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* TextDoc support list

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* support tei llama index openai compatible API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* support mosec langchain openai compatible API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* update UT for embedding tests

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix ut bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* support embedding predictionguard  openai compatible API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* support embedding multimodal clip OpenAI compatible API

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* fix bug

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* enable debug mode for embedding UT

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
2024-11-14 09:29:36 +08:00
Eero Tamminen
441882419a Minor simplication to ServiceOrchestrator code (#889)
* Drop dump_outputs() method that obfuscates the code

dump_outputs() method in ServiceOrchestrator:
* Is not real method (does not use self)
* Adds a member to a dict instead of "dump"ing (drop or output) something
* Obfuscates how schedule() method return value is constructed, and
* Makes calling code unnecessary longer

Similar method in "ServiceOrchestratorWithYaml" is reasonable except
for the name, but drop also that for consistency.

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Apply pylint simplification suggestion to execute()

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

---------

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
Co-authored-by: Sihan Chen <39623753+Spycsh@users.noreply.github.com>
2024-11-13 23:44:11 +08:00
sgurunat
e3812a7417 Multiple models and remote service support for langchain vLLM text-generation (#887)
* Multiple models support for langchain vLLM text-generation

Signed-off-by: sgurunat <gurunath.s@intel.com>

* Add authentication support for langchain vLLM text-generation remote endpoints

Signed-off-by: sgurunat <gurunath.s@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: sgurunat <gurunath.s@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-13 21:58:19 +08:00