Louie Tsai
7adbba6add
Enable vLLM Profiling for ChatQnA ( #1124 )
2024-11-13 11:26:31 +08:00
pallavijaini0525
0d52c2f003
Pinecone update to Readme and docker compose for ChatQnA ( #540 )
...
Signed-off-by: pallavi jaini <pallavi.jaini@intel.com >
Signed-off-by: AI Workloads <aigoldrush1@g2-r3-2.iind.intel.com >
Signed-off-by: Pallavi Jaini <pallavi,jaini@intel.com >
Signed-off-by: Pallavi Jaini <pallavi.jaini@intel.com >
Signed-off-by: root <root@test-pjaini.535545281608.us-region-2.idcservice.net >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: AI Workloads <aigoldrush1@g2-r3-2.iind.intel.com >
Co-authored-by: Pallavi Jaini <pallavi,jaini@intel.com >
Co-authored-by: root <root@test-pjaini.535545281608.us-region-2.idcservice.net >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-13 09:32:37 +08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
bjzhjing
f7a7f8aa3f
Fix typo ( #1117 )
...
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
2024-11-12 09:54:05 +08:00
lvliang-intel
e3187be819
Update ChatQnA manifests using always pull image policy ( #1100 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-11-11 14:37:14 +08:00
Sihan Chen
abd9d12937
Fix non stream case ( #1115 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-11 14:18:42 +08:00
bjzhjing
a7353bbaa4
Refine performance directory ( #1017 )
...
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
2024-11-11 13:58:46 +08:00
Letong Han
aa314f6757
[Readme] Update ChatQnA Readme for LLM Endpoint ( #1086 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-11-11 13:53:06 +08:00
Wang, Kai Lawrence
f7026773b8
[ChatQnA] Fix the no_proxy setting for gpu example ( #1078 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2024-11-08 22:27:51 +08:00
XinyaoWa
40386d9bd6
remove vllm-on-ray ( #1084 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-11-08 13:01:48 +08:00
lvliang-intel
4635a927fa
Make embedding run on CPU for aligning with Gaudi performance benchmark ( #1057 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-07 17:39:34 +08:00
XinyaoWa
e9b164505e
align vllm hpu version to latest vllm-fork ( #1061 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-11-07 14:08:56 +08:00
Arthur Leung
6263b517b9
[Doc] Add steps to deploy opea services using minikube ( #1058 )
...
Signed-off-by: Arthur Leung <arcyleung@gmail.com >
Co-authored-by: Arthur Leung <arcyleung@gmail.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-07 13:57:34 +08:00
Wang, Kai Lawrence
944ae47948
[ChatQnA] Fix the service connection issue on GPU and modify the emb backend ( #1059 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2024-11-06 10:22:21 +08:00
xiguiw
a0921f127f
[Doc] Fix broken build instruction ( #1063 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2024-11-05 13:35:12 +08:00
lvliang-intel
0306c620b5
Update TGI CPU image to latest official release 2.4.0 ( #1035 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 11:28:43 +08:00
lkk
3372b9d480
update accuracy embedding endpoint for no wrapper ( #1056 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 09:18:49 +08:00
XinyaoWa
c65d7d40fb
fix vllm output in chatqna ( #1038 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-11-01 09:26:57 +08:00
chen, suyue
0f5a9c4a5e
Fix ChatQnA manifest test issue on Xeon ( #1044 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-31 14:23:17 +08:00
lvliang-intel
7197286a14
Fix ChatQnA manifest default port issue ( #1033 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-30 11:52:04 +08:00
xiguiw
95b58b51fa
Fix AIPC docker container network issue ( #1021 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2024-10-25 10:46:57 +08:00
Louie Tsai
a10b4a1f1d
Address request from Issue#971 ( #1018 )
2024-10-23 23:57:52 -07:00
RuijingGuo
def39cfcdc
setup ollama service in aipc docker compose ( #1008 )
...
Signed-off-by: Guo Ruijing <ruijing.guo@intel.com >
2024-10-23 14:22:48 +08:00
lvliang-intel
0eedbbfce0
Update aipc ollama docker compose and readme ( #984 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-10-22 10:30:47 +08:00
lvliang-intel
9438d392b4
Update README for some minor issues ( #1000 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-22 10:30:18 +08:00
lvliang-intel
3c164f3aa2
Make rerank run on gaudi for hpu docker compose ( #980 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-18 21:49:36 +08:00
CharleneHu-42
7669c42085
Update ChatQnA README to add benchmark launcher ( #958 )
...
Signed-off-by: CharleneHu-42 <yabai.hu@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Yi Yao <yi.a.yao@intel.com >
2024-10-18 13:33:20 +08:00
lvliang-intel
256b58c07e
Replace environment variables with service name for ChatQnA ( #977 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-18 11:31:24 +08:00
ylg
37c74b232c
Update ChatQnA yaml and set retriever's TEI_EMBEDDING_ENDPOINT ( #953 )
...
Signed-off-by: longguang.yue <bigclouds@163.com >
2024-10-17 16:58:47 +08:00
Sihan Chen
4a265abb73
Fix top_n rerank docs ( #976 )
2024-10-17 15:49:16 +08:00
Sihan Chen
b0487fe92b
fix chatqna accuracy issue with incorrect penalty ( #974 )
2024-10-17 15:48:44 +08:00
chen, suyue
eeced9b31c
Enhance CI/CD image build ( #961 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-10-17 14:33:58 +08:00
WenjiaoYue
b377c2b8f8
Update manifest ui containerPort ( #952 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-17 09:42:55 +08:00
lvliang-intel
c930bea172
Add missing nginx microservice and fix frontend test ( #951 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-16 13:29:31 +08:00
lvliang-intel
778afb50ac
Clean no wrapper image in performance benchmark manifests ( #955 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-15 18:21:53 +08:00
lkk
088ab98f31
update examples accuracy ( #941 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-14 13:20:50 +08:00
xiguiw
b056ce6617
[Doc] Update ChatQnA AIPC README ( #935 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-12 11:04:53 +08:00
xiguiw
773c32b38b
Fix AIPC retriever and UI error ( #933 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2024-10-11 13:35:27 +08:00
lvliang-intel
619d941047
Set no wrapper ChatQnA as default ( #891 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 13:30:45 +08:00
Abolfazl Shahbazi
b71a12d424
Remove 'vim' from Dockerfiles ( #924 )
...
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com >
2024-10-10 18:24:31 -07:00
feng-intel
ae10712fe8
doc: Update ChatQnA/benchmark/performance doc ( #930 )
2024-10-10 16:30:40 +08:00
pallavijaini0525
e2f9037344
Added the K8s yaml for vLLM support ( #917 )
...
Signed-off-by: desaidhr <dhruv.desai@intel.com >
Co-authored-by: desaidhr <dhruv.desai@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-10 11:08:07 +08:00
Zhenzhong1
d16c80e493
[ChatQnA] manage your own ChatQnA pipelines. ( #878 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-30 17:01:44 +09:00
sri-intel
2de1bfc5bb
Bug fix for issue #881 ( #882 )
...
Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com >
2024-09-27 13:06:02 +08:00
sri-intel
75df2c9979
docker install instruction for csp ( #843 )
...
Signed-off-by: sri <srinarayan.srikanthan@intel.com >
Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com >
2024-09-27 13:00:10 +08:00
jotpalch
bd32b03e3c
Doc: Update folder path to correct location in "Deploy ChatQnA in Kubernetes" ( #875 )
2024-09-26 14:38:22 +08:00
xiguiw
9d0b49c2d6
[doc] Update AIPC document ( #874 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-26 14:28:16 +08:00
David Kinder
99c10933b4
doc: fix doc heading ( #873 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-26 12:33:57 +09:00
Zhenzhong1
c1038d2193
[ChatQnA] Deploy ChatQnA for benchmarking with different configurations. ( #870 )
2024-09-25 16:47:44 +08:00
lvliang-intel
33b9d4e421
Remove redundant code and update tgi version ( #871 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-09-25 15:33:33 +08:00