Liang Lv
0f7e5a37ac
Adapt code for dataprep microservice refactor ( #1408 )
...
https://github.com/opea-project/GenAIComps/pull/1153
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-20 20:37:03 +08:00
xiguiw
2d5898244c
Enchance health check in GenAIExample docker-compose ( #1410 )
...
Fix service launch issue
1. Update Gaudi TGI image from 2.0.6 to 2.3.1
2. Change the hpu-gaudi TGI health check condition.
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-20 20:13:13 +08:00
Wang, Kai Lawrence
742cb6ddd3
[ChatQnA] Switch to vLLM as default llm backend on Xeon ( #1403 )
...
Switching from TGI to vLLM as the default LLM serving backend on Xeon for the ChatQnA example to enhance the perf.
https://github.com/opea-project/GenAIExamples/issues/1213
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-17 20:48:19 +08:00
Wang, Kai Lawrence
00e9da9ced
[ChatQnA] Switch to vLLM as default llm backend on Gaudi ( #1404 )
...
Switching from TGI to vLLM as the default LLM serving backend on Gaudi for the ChatQnA example to enhance the perf.
https://github.com/opea-project/GenAIExamples/issues/1213
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-17 20:46:38 +08:00
Letong Han
4cabd55778
Refactor Retrievers related Examples ( #1387 )
...
Delete redundant retrievers docker image in docker_images_list.md.
Refactor Retrievers related Examples READMEs.
Change all of the comps/retrievers/xxx/xxx/Dockerfile path into comps/retrievers/src/Dockerfile.
Fix the Examples CI issues of PR opea-project/GenAIComps#1138 .
Signed-off-by: letonghan <letong.han@intel.com >
2025-01-16 14:21:48 +08:00
xiguiw
698a06edbf
[DOC] Fix document issue ( #1395 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-01-16 11:30:07 +08:00
Liang Lv
3ca78867eb
Update example code for embedding dependency moving to 3rd_party ( #1368 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-10 15:36:58 +08:00
Louie Tsai
81022355a7
Enable OpenTelemetry Tracing for ChatQnA TGI serving on Gaudi ( #1316 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-01-08 17:20:13 -08:00
Liang Lv
b3c405a5f6
Adapt example code for guardrails refactor ( #1360 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-08 14:35:23 +08:00
ZePan110
ed2b8ed983
Exclude dockerfile under tests and exclude check Dockerfile under tests. ( #1354 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-01-07 09:05:01 +08:00
ZePan110
aa5c91d7ee
Check duplicated dockerfile ( #1289 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-01-06 17:30:12 +08:00
chen, suyue
5c7a5bd850
Update Code and README for GenAIComps Refactor ( #1285 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
Signed-off-by: WenjiaoYue <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>
2025-01-02 20:03:26 +08:00
Ying Hu
597f17b979
Update set_env.sh to fix LOGFLAG warning ( #1319 )
2024-12-30 10:54:26 +08:00
Wang, Kai Lawrence
4c01e14642
[ChatQnA] Remove enforce-eager to enable HPU graphs for better vLLM perf ( #1210 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2024-12-10 13:19:15 +08:00
pallavijaini0525
3a371ac102
Updated the Pinecone readme to reflect the new structure ( #1222 )
...
Signed-off-by: Pallavi Jaini <pallavi.jaini@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-05 10:04:09 +08:00
Sihan Chen
907b30b7fe
Refactor service names ( #1199 )
2024-11-28 10:01:31 +08:00
Wang, Kai Lawrence
ac470421d0
Update the llm backend ports ( #1172 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2024-11-22 09:20:09 +08:00
ZePan110
8808b51e42
Rename image name XXX-hpu to XXX-gaudi ( #1154 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2024-11-19 22:18:41 +08:00
Louie Tsai
152adf8012
maintain a version info for docker_compose yaml files among release ( #1141 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-11-17 22:39:41 -08:00
Louie Tsai
00d9bb6128
Enable vLLM Profiling for ChatQnA on Gaudi ( #1128 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-11-14 15:46:33 -08:00
lvliang-intel
9ff7df9202
Use fixed version of TEI Gaudi for stability ( #1101 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2024-11-13 10:45:50 -08:00
chen, suyue
393367e9f1
Fix left issue of tgi version update ( #1121 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-13 15:42:42 +08:00
Louie Tsai
7adbba6add
Enable vLLM Profiling for ChatQnA ( #1124 )
2024-11-13 11:26:31 +08:00
pallavijaini0525
0d52c2f003
Pinecone update to Readme and docker compose for ChatQnA ( #540 )
...
Signed-off-by: pallavi jaini <pallavi.jaini@intel.com >
Signed-off-by: AI Workloads <aigoldrush1@g2-r3-2.iind.intel.com >
Signed-off-by: Pallavi Jaini <pallavi,jaini@intel.com >
Signed-off-by: Pallavi Jaini <pallavi.jaini@intel.com >
Signed-off-by: root <root@test-pjaini.535545281608.us-region-2.idcservice.net >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: AI Workloads <aigoldrush1@g2-r3-2.iind.intel.com >
Co-authored-by: Pallavi Jaini <pallavi,jaini@intel.com >
Co-authored-by: root <root@test-pjaini.535545281608.us-region-2.idcservice.net >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-13 09:32:37 +08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
Letong Han
aa314f6757
[Readme] Update ChatQnA Readme for LLM Endpoint ( #1086 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-11-11 13:53:06 +08:00
XinyaoWa
40386d9bd6
remove vllm-on-ray ( #1084 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-11-08 13:01:48 +08:00
lvliang-intel
4635a927fa
Make embedding run on CPU for aligning with Gaudi performance benchmark ( #1057 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-07 17:39:34 +08:00
XinyaoWa
e9b164505e
align vllm hpu version to latest vllm-fork ( #1061 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-11-07 14:08:56 +08:00
Arthur Leung
6263b517b9
[Doc] Add steps to deploy opea services using minikube ( #1058 )
...
Signed-off-by: Arthur Leung <arcyleung@gmail.com >
Co-authored-by: Arthur Leung <arcyleung@gmail.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-07 13:57:34 +08:00
xiguiw
a0921f127f
[Doc] Fix broken build instruction ( #1063 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2024-11-05 13:35:12 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
xiguiw
95b58b51fa
Fix AIPC docker container network issue ( #1021 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2024-10-25 10:46:57 +08:00
Louie Tsai
a10b4a1f1d
Address request from Issue#971 ( #1018 )
2024-10-23 23:57:52 -07:00
RuijingGuo
def39cfcdc
setup ollama service in aipc docker compose ( #1008 )
...
Signed-off-by: Guo Ruijing <ruijing.guo@intel.com >
2024-10-23 14:22:48 +08:00
lvliang-intel
0eedbbfce0
Update aipc ollama docker compose and readme ( #984 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-10-22 10:30:47 +08:00
lvliang-intel
9438d392b4
Update README for some minor issues ( #1000 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-22 10:30:18 +08:00
lvliang-intel
3c164f3aa2
Make rerank run on gaudi for hpu docker compose ( #980 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-18 21:49:36 +08:00
lvliang-intel
256b58c07e
Replace environment variables with service name for ChatQnA ( #977 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-18 11:31:24 +08:00
ylg
37c74b232c
Update ChatQnA yaml and set retriever's TEI_EMBEDDING_ENDPOINT ( #953 )
...
Signed-off-by: longguang.yue <bigclouds@163.com >
2024-10-17 16:58:47 +08:00
lvliang-intel
c930bea172
Add missing nginx microservice and fix frontend test ( #951 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-16 13:29:31 +08:00
xiguiw
b056ce6617
[Doc] Update ChatQnA AIPC README ( #935 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-12 11:04:53 +08:00
xiguiw
773c32b38b
Fix AIPC retriever and UI error ( #933 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2024-10-11 13:35:27 +08:00
lvliang-intel
619d941047
Set no wrapper ChatQnA as default ( #891 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 13:30:45 +08:00
sri-intel
2de1bfc5bb
Bug fix for issue #881 ( #882 )
...
Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com >
2024-09-27 13:06:02 +08:00
xiguiw
9d0b49c2d6
[doc] Update AIPC document ( #874 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-26 14:28:16 +08:00
Steve Zhang
954a22051b
Make all xeon tgi image version consistent ( #851 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-24 11:19:37 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
Letong Han
c35fe0b429
[Doc] Update ChatQnA README for Nginx Docker Image ( #862 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-23 12:25:30 +09:00
Letong Han
7eaab93d0b
[Doc] Refine ChatQnA README ( #855 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-20 11:20:20 +08:00