Zhenzhong1
d6b04b3405
benchmark helmcharts ( #995 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-21 11:13:24 +08:00
lvliang-intel
3c164f3aa2
Make rerank run on gaudi for hpu docker compose ( #980 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-18 21:49:36 +08:00
CharleneHu-42
7669c42085
Update ChatQnA README to add benchmark launcher ( #958 )
...
Signed-off-by: CharleneHu-42 <yabai.hu@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Yi Yao <yi.a.yao@intel.com >
2024-10-18 13:33:20 +08:00
lvliang-intel
256b58c07e
Replace environment variables with service name for ChatQnA ( #977 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-18 11:31:24 +08:00
ylg
37c74b232c
Update ChatQnA yaml and set retriever's TEI_EMBEDDING_ENDPOINT ( #953 )
...
Signed-off-by: longguang.yue <bigclouds@163.com >
2024-10-17 16:58:47 +08:00
Sihan Chen
4a265abb73
Fix top_n rerank docs ( #976 )
2024-10-17 15:49:16 +08:00
Sihan Chen
b0487fe92b
fix chatqna accuracy issue with incorrect penalty ( #974 )
2024-10-17 15:48:44 +08:00
chen, suyue
eeced9b31c
Enhance CI/CD image build ( #961 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-10-17 14:33:58 +08:00
WenjiaoYue
b377c2b8f8
Update manifest ui containerPort ( #952 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-17 09:42:55 +08:00
lvliang-intel
c930bea172
Add missing nginx microservice and fix frontend test ( #951 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-16 13:29:31 +08:00
lvliang-intel
778afb50ac
Clean no wrapper image in performance benchmark manifests ( #955 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-15 18:21:53 +08:00
lkk
088ab98f31
update examples accuracy ( #941 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-14 13:20:50 +08:00
xiguiw
b056ce6617
[Doc] Update ChatQnA AIPC README ( #935 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-12 11:04:53 +08:00
xiguiw
773c32b38b
Fix AIPC retriever and UI error ( #933 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2024-10-11 13:35:27 +08:00
lvliang-intel
619d941047
Set no wrapper ChatQnA as default ( #891 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 13:30:45 +08:00
Abolfazl Shahbazi
b71a12d424
Remove 'vim' from Dockerfiles ( #924 )
...
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com >
2024-10-10 18:24:31 -07:00
feng-intel
ae10712fe8
doc: Update ChatQnA/benchmark/performance doc ( #930 )
2024-10-10 16:30:40 +08:00
pallavijaini0525
e2f9037344
Added the K8s yaml for vLLM support ( #917 )
...
Signed-off-by: desaidhr <dhruv.desai@intel.com >
Co-authored-by: desaidhr <dhruv.desai@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-10 11:08:07 +08:00
Zhenzhong1
d16c80e493
[ChatQnA] manage your own ChatQnA pipelines. ( #878 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-30 17:01:44 +09:00
sri-intel
2de1bfc5bb
Bug fix for issue #881 ( #882 )
...
Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com >
2024-09-27 13:06:02 +08:00
sri-intel
75df2c9979
docker install instruction for csp ( #843 )
...
Signed-off-by: sri <srinarayan.srikanthan@intel.com >
Signed-off-by: srinarayan-srikanthan <srinarayan.srikanthan@intel.com >
2024-09-27 13:00:10 +08:00
jotpalch
bd32b03e3c
Doc: Update folder path to correct location in "Deploy ChatQnA in Kubernetes" ( #875 )
2024-09-26 14:38:22 +08:00
xiguiw
9d0b49c2d6
[doc] Update AIPC document ( #874 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-26 14:28:16 +08:00
David Kinder
99c10933b4
doc: fix doc heading ( #873 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-26 12:33:57 +09:00
Zhenzhong1
c1038d2193
[ChatQnA] Deploy ChatQnA for benchmarking with different configurations. ( #870 )
2024-09-25 16:47:44 +08:00
lvliang-intel
33b9d4e421
Remove redundant code and update tgi version ( #871 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-09-25 15:33:33 +08:00
David Kinder
3e796ba73d
doc: fix missing references to README.md ( #860 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-24 21:40:42 +08:00
Steve Zhang
954a22051b
Make all xeon tgi image version consistent ( #851 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-24 11:19:37 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
Letong Han
c35fe0b429
[Doc] Update ChatQnA README for Nginx Docker Image ( #862 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-23 12:25:30 +09:00
lvliang-intel
28f5e4a268
Add docker based benchmark instructions for ChatQnA ( #859 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 10:14:44 +08:00
Letong Han
7eaab93d0b
[Doc] Refine ChatQnA README ( #855 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-20 11:20:20 +08:00
Neo Zhang Jianyu
bc817700b9
refactor the network port setting for AWS ( #849 )
...
Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com >
2024-09-19 21:58:56 +08:00
lvliang-intel
bd811bd622
Add validate microservice details link ( #852 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-09-19 21:54:32 +08:00
WenjiaoYue
05f9828e77
Add nginx and UI to the ChatQnA manifest ( #848 )
...
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 21:04:12 +08:00
Letong Han
6c364487d3
[ChatQnA] Add Nginx in Docker Compose and README ( #850 )
...
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 20:39:58 +08:00
ZePan110
21e215c5d5
Refine code scan output and remove opea_release_data.md. ( #844 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2024-09-19 17:34:55 +08:00
lkk
f04f061f8c
move evaluation scripts ( #842 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 15:59:13 +08:00
XinyaoWa
2f03a3a894
Align parameters for "max_token, repetition_penalty,presence_penalty,frequency_penalty" ( #726 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 14:15:25 +08:00
Letong Han
372d78c2ac
[Doc] Refine READMEs ( #841 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-19 13:25:40 +08:00
Zhenzhong1
933c3d3445
[ChatQnA] Update OOB with wrapper manifests. ( #823 )
2024-09-19 11:03:10 +08:00
kevinintel
3b70fb0d42
Refine the quick start of ChatQnA ( #828 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-18 22:23:22 +08:00
kevinintel
e0b3b579a3
[Doc] doc improvement ( #811 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-18 15:21:28 +08:00
chen, suyue
e5affb93ab
update V1.0 benchmark manifest ( #822 )
...
Co-authored-by: Zhenzhong1 <zhenzhong.xu@intel.com >
2024-09-18 10:36:33 +08:00
lvliang-intel
bceacdc804
Fix README issues ( #817 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-18 09:50:17 +08:00
Louie Tsai
375ea7a90c
Improve ChatQnA flowchat according to feedback ( #736 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-09-16 18:29:13 -07:00
Sihan Chen
06696c8e58
[ChatQnA] Add no_wrapper benchmarking and update legacy manifests ( #767 )
...
Co-authored-by: Zhenzhong1 <zhenzhong.xu@intel.com >
2024-09-14 16:17:15 +08:00
lkk
ba17031198
add tgi bf16 setup on CPU k8s. ( #795 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com >
2024-09-13 19:55:57 +08:00
ZhaoqiongZ
f990f7966e
update doc according to comments ( #805 )
...
Signed-off-by: Zheng, Zhaoqiong <zhaoqiong.zheng@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-13 19:55:33 +08:00
Ying Hu
87e51d5c36
Update README.md of pdf file ( #804 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-13 17:14:34 +08:00