chen, suyue
c546d96e98
downgrade tei version from 1.6 to 1.5, fix the chatqna perf regression ( #1886 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-04-25 23:00:36 +08:00
Xiaotian Chen
1bd56af994
Update TGI image versions ( #1625 )
...
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com >
2025-04-01 11:27:51 +08:00
XinyaoWa
6d24c1c77a
Merge FaqGen into ChatQnA ( #1654 )
...
1. Delete FaqGen
2. Refactor FaqGen into ChatQnA, serve as a LLM selection.
3. Combine all ChatQnA related Dockerfile into one
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-03-20 17:40:00 +08:00
Liang Lv
0f7e5a37ac
Adapt code for dataprep microservice refactor ( #1408 )
...
https://github.com/opea-project/GenAIComps/pull/1153
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-20 20:37:03 +08:00
Ying Hu
91ff520baa
Update README.md for add K8S cluster link for Gaudi ( #1380 )
2025-01-13 09:33:58 +08:00
XinyaoWa
464e2d3125
Rename streaming to stream to align with OpenAI API ( #1332 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-06 13:25:55 +08:00
chen, suyue
1f29eca288
fix chatqna benchmark without rerank config issue ( #1341 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-06 09:16:20 +08:00
bjzhjing
7d9b34cf5e
Chatqna/benchmark: Remove the deprecated directory ( #1261 )
...
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
2024-12-19 10:51:01 +08:00
Letong Han
545aa571bf
[ChatQnA] Update Benchmark E2E Parameters ( #1200 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-11-27 17:11:11 +08:00
bjzhjing
ef2047b070
Adjustments for helm release change ( #1173 )
...
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
2024-11-21 14:14:27 +08:00
chen, suyue
f70d9c3853
chatqna benchmark for v1.1 release ( #1120 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
2024-11-19 22:57:25 +08:00
lvliang-intel
9ff7df9202
Use fixed version of TEI Gaudi for stability ( #1101 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com >
2024-11-13 10:45:50 -08:00
lvliang-intel
1ff85f6a85
Upgrade TGI Gaudi version to v2.0.6 ( #1088 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-11-12 14:38:22 +08:00
bjzhjing
f7a7f8aa3f
Fix typo ( #1117 )
...
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
2024-11-12 09:54:05 +08:00
bjzhjing
a7353bbaa4
Refine performance directory ( #1017 )
...
Signed-off-by: Cathy Zhang <cathy.zhang@intel.com >
2024-11-11 13:58:46 +08:00
lkk
3372b9d480
update accuracy embedding endpoint for no wrapper ( #1056 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-04 09:18:49 +08:00
CharleneHu-42
7669c42085
Update ChatQnA README to add benchmark launcher ( #958 )
...
Signed-off-by: CharleneHu-42 <yabai.hu@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Yi Yao <yi.a.yao@intel.com >
2024-10-18 13:33:20 +08:00
lvliang-intel
778afb50ac
Clean no wrapper image in performance benchmark manifests ( #955 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-15 18:21:53 +08:00
lkk
088ab98f31
update examples accuracy ( #941 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-14 13:20:50 +08:00
lvliang-intel
619d941047
Set no wrapper ChatQnA as default ( #891 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 13:30:45 +08:00
feng-intel
ae10712fe8
doc: Update ChatQnA/benchmark/performance doc ( #930 )
2024-10-10 16:30:40 +08:00
Zhenzhong1
d16c80e493
[ChatQnA] manage your own ChatQnA pipelines. ( #878 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-30 17:01:44 +09:00
David Kinder
99c10933b4
doc: fix doc heading ( #873 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-26 12:33:57 +09:00
Zhenzhong1
c1038d2193
[ChatQnA] Deploy ChatQnA for benchmarking with different configurations. ( #870 )
2024-09-25 16:47:44 +08:00
lvliang-intel
33b9d4e421
Remove redundant code and update tgi version ( #871 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-09-25 15:33:33 +08:00
David Kinder
3e796ba73d
doc: fix missing references to README.md ( #860 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-24 21:40:42 +08:00
lvliang-intel
3fb60608b3
Use official tei gaudi image and update tgi gaudi version ( #810 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 17:52:56 +08:00
lvliang-intel
28f5e4a268
Add docker based benchmark instructions for ChatQnA ( #859 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-23 10:14:44 +08:00
ZePan110
21e215c5d5
Refine code scan output and remove opea_release_data.md. ( #844 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2024-09-19 17:34:55 +08:00
lkk
f04f061f8c
move evaluation scripts ( #842 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 15:59:13 +08:00
XinyaoWa
2f03a3a894
Align parameters for "max_token, repetition_penalty,presence_penalty,frequency_penalty" ( #726 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 14:15:25 +08:00
Zhenzhong1
933c3d3445
[ChatQnA] Update OOB with wrapper manifests. ( #823 )
2024-09-19 11:03:10 +08:00
chen, suyue
e5affb93ab
update V1.0 benchmark manifest ( #822 )
...
Co-authored-by: Zhenzhong1 <zhenzhong.xu@intel.com >
2024-09-18 10:36:33 +08:00
Sihan Chen
06696c8e58
[ChatQnA] Add no_wrapper benchmarking and update legacy manifests ( #767 )
...
Co-authored-by: Zhenzhong1 <zhenzhong.xu@intel.com >
2024-09-14 16:17:15 +08:00
Zhenzhong1
36fb9a987d
[ChatQnA] Update benchmarking manifests ( #766 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-10 11:07:48 +08:00
David Kinder
67394b88fa
doc: fix headings and indenting ( #748 )
...
* doc: fix headings and indenting
* only one H1 header (for title) is allowed
* fix indenting under ordered lists
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-06 12:59:33 +08:00
chen, suyue
947936ed7b
Update v0.9 RAG release data ( #747 )
...
* run both xeon and gaudi when both hardware detect
Signed-off-by: chensuyue <suyue.chen@intel.com >
* add v0.9 RAG release data
Signed-off-by: chensuyue <suyue.chen@intel.com >
* update system summary
Signed-off-by: chensuyue <suyue.chen@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-06 09:55:55 +08:00
Zhenzhong1
ac3486038c
[ChatQnA] udate OOB & Tuned manifests ( #738 )
...
* update OOB manifests
* update tgi parameters
* update OOB manifests for w/o rerank
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update tgi parameters
* update tgi parameters for v0.9 w/o rerank
* update OOB manifests 2.0.4->2.0.1 for w/o rerank
* update tuned manifests
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update tuned manifests
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-05 22:23:51 +08:00
chen, suyue
e0bc5f2a4d
update logs from standard cd perf workflow ( #733 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-05 14:58:24 +08:00
Zhenzhong1
6730b242cc
[ChatQnA] Update retrieval & dataprep manifests ( #717 )
...
* modify tgi hyperparameters
* upgrade tgi 2.0.1 to 2.0.4
* Update dataprep-microservice_run.yaml
* Update retrieval-microservice_run.yaml
* Update retrieval-microservice_run.yaml
* Update dataprep-microservice_run.yaml
* Update dataprep-microservice_run.yaml
* Update dataprep-microservice_run.yaml
* Update retrieval-microservice_run.yaml
* Update retrieval-microservice_run.yaml
2024-09-04 19:50:46 +08:00
Letong Han
4a51874e4d
update readme for w/o rerank ( #731 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-04 15:01:30 +08:00
Letong Han
55d287dfcf
update readme to fix input length ( #720 )
...
Signed-off-by: letonghan <letong.han@intel.com >
2024-09-03 19:01:28 +08:00
Zhenzhong1
3563f5db6b
[ChatQnA]Update manifests ( #716 )
...
* update manifests for v0.9
2024-09-03 15:24:54 +08:00
David Kinder
c6d811ab11
doc: remove invalid code block language ( #705 )
...
Signed-off-by: David B. Kinder <david.b.kinder@intel.com >
2024-09-03 10:10:56 +08:00
lvliang-intel
af21e94a29
Add benchmark README for ChatQnA ( #662 )
...
* Add benchmark README for ChatQnA
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add benchmark.yaml
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update yaml path
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
* fix preci issue
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update title
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
---------
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-08-26 22:39:57 +08:00
Zhenzhong1
e6b4fff05c
Update the number of microservice replicas for OPEA v0.9 ( #645 )
2024-08-22 09:48:47 +08:00
Zhenzhong1
ba78b4c994
update manifests for v0.9 ( #632 )
...
* update model HF TOKEN variables & reranking name for v0.9
2024-08-20 14:35:14 +08:00
Ying Chun Guo
4fd3517f23
update benchmark manifest to fix errors ( #626 )
...
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com >
2024-08-19 21:59:26 +08:00
Zhenzhong1
08f57fa54a
update manifests for v0.9 ( #623 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-08-19 15:55:04 +08:00
lkk
02a15366bc
update chatqna dataprep-redis port ( #557 )
...
Co-authored-by: sdp <sdp@srf797636.jf.intel.com >
2024-08-08 22:23:32 +08:00