GenAIExamples

Author	SHA1	Message	Date
chen, suyue	c546d96e98	downgrade tei version from 1.6 to 1.5, fix the chatqna perf regression (#1886 ) Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-04-25 23:00:36 +08:00
Xiaotian Chen	1bd56af994	Update TGI image versions (#1625 ) Signed-off-by: xiaotia3 <xiaotian.chen@intel.com>	2025-04-01 11:27:51 +08:00
XinyaoWa	6d24c1c77a	Merge FaqGen into ChatQnA (#1654 ) 1. Delete FaqGen 2. Refactor FaqGen into ChatQnA, serve as a LLM selection. 3. Combine all ChatQnA related Dockerfile into one Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2025-03-20 17:40:00 +08:00
Liang Lv	0f7e5a37ac	Adapt code for dataprep microservice refactor (#1408 ) https://github.com/opea-project/GenAIComps/pull/1153 Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2025-01-20 20:37:03 +08:00
Ying Hu	91ff520baa	Update README.md for add K8S cluster link for Gaudi (#1380 )	2025-01-13 09:33:58 +08:00
XinyaoWa	464e2d3125	Rename streaming to stream to align with OpenAI API (#1332 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>	2025-01-06 13:25:55 +08:00
chen, suyue	1f29eca288	fix chatqna benchmark without rerank config issue (#1341 ) Signed-off-by: chensuyue <suyue.chen@intel.com>	2025-01-06 09:16:20 +08:00
bjzhjing	7d9b34cf5e	Chatqna/benchmark: Remove the deprecated directory (#1261 ) Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>	2024-12-19 10:51:01 +08:00
Letong Han	545aa571bf	[ChatQnA] Update Benchmark E2E Parameters (#1200 ) Signed-off-by: letonghan <letong.han@intel.com>	2024-11-27 17:11:11 +08:00
bjzhjing	ef2047b070	Adjustments for helm release change (#1173 ) Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>	2024-11-21 14:14:27 +08:00
chen, suyue	f70d9c3853	chatqna benchmark for v1.1 release (#1120 ) Signed-off-by: chensuyue <suyue.chen@intel.com> Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>	2024-11-19 22:57:25 +08:00
lvliang-intel	9ff7df9202	Use fixed version of TEI Gaudi for stability (#1101 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>	2024-11-13 10:45:50 -08:00
lvliang-intel	1ff85f6a85	Upgrade TGI Gaudi version to v2.0.6 (#1088 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>	2024-11-12 14:38:22 +08:00
bjzhjing	f7a7f8aa3f	Fix typo (#1117 ) Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>	2024-11-12 09:54:05 +08:00
bjzhjing	a7353bbaa4	Refine performance directory (#1017 ) Signed-off-by: Cathy Zhang <cathy.zhang@intel.com>	2024-11-11 13:58:46 +08:00
lkk	3372b9d480	update accuracy embedding endpoint for no wrapper (#1056 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-11-04 09:18:49 +08:00
CharleneHu-42	7669c42085	Update ChatQnA README to add benchmark launcher (#958 ) Signed-off-by: CharleneHu-42 <yabai.hu@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Yi Yao <yi.a.yao@intel.com>	2024-10-18 13:33:20 +08:00
lvliang-intel	778afb50ac	Clean no wrapper image in performance benchmark manifests (#955 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-10-15 18:21:53 +08:00
lkk	088ab98f31	update examples accuracy (#941 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-14 13:20:50 +08:00
lvliang-intel	619d941047	Set no wrapper ChatQnA as default (#891 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-10-11 13:30:45 +08:00
feng-intel	ae10712fe8	doc: Update ChatQnA/benchmark/performance doc (#930 )	2024-10-10 16:30:40 +08:00
Zhenzhong1	d16c80e493	[ChatQnA] manage your own ChatQnA pipelines. (#878 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-30 17:01:44 +09:00
David Kinder	99c10933b4	doc: fix doc heading (#873 ) Signed-off-by: David B. Kinder <david.b.kinder@intel.com>	2024-09-26 12:33:57 +09:00
Zhenzhong1	c1038d2193	[ChatQnA] Deploy ChatQnA for benchmarking with different configurations. (#870 )	2024-09-25 16:47:44 +08:00
lvliang-intel	33b9d4e421	Remove redundant code and update tgi version (#871 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com>	2024-09-25 15:33:33 +08:00
David Kinder	3e796ba73d	doc: fix missing references to README.md (#860 ) Signed-off-by: David B. Kinder <david.b.kinder@intel.com>	2024-09-24 21:40:42 +08:00
lvliang-intel	3fb60608b3	Use official tei gaudi image and update tgi gaudi version (#810 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-23 17:52:56 +08:00
lvliang-intel	28f5e4a268	Add docker based benchmark instructions for ChatQnA (#859 ) Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-23 10:14:44 +08:00
ZePan110	21e215c5d5	Refine code scan output and remove opea_release_data.md. (#844 ) Signed-off-by: ZePan110 <ze.pan@intel.com>	2024-09-19 17:34:55 +08:00
lkk	f04f061f8c	move evaluation scripts (#842 ) Co-authored-by: root <root@idc708073.jf.intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-19 15:59:13 +08:00
XinyaoWa	2f03a3a894	Align parameters for "max_token, repetition_penalty,presence_penalty,frequency_penalty" (#726 ) Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-19 14:15:25 +08:00
Zhenzhong1	933c3d3445	[ChatQnA] Update OOB with wrapper manifests. (#823 )	2024-09-19 11:03:10 +08:00
chen, suyue	e5affb93ab	update V1.0 benchmark manifest (#822 ) Co-authored-by: Zhenzhong1 <zhenzhong.xu@intel.com>	2024-09-18 10:36:33 +08:00
Sihan Chen	06696c8e58	[ChatQnA] Add no_wrapper benchmarking and update legacy manifests (#767 ) Co-authored-by: Zhenzhong1 <zhenzhong.xu@intel.com>	2024-09-14 16:17:15 +08:00
Zhenzhong1	36fb9a987d	[ChatQnA] Update benchmarking manifests (#766 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-10 11:07:48 +08:00
David Kinder	67394b88fa	doc: fix headings and indenting (#748 ) * doc: fix headings and indenting * only one H1 header (for title) is allowed * fix indenting under ordered lists Signed-off-by: David B. Kinder <david.b.kinder@intel.com>	2024-09-06 12:59:33 +08:00
chen, suyue	947936ed7b	Update v0.9 RAG release data (#747 ) * run both xeon and gaudi when both hardware detect Signed-off-by: chensuyue <suyue.chen@intel.com> * add v0.9 RAG release data Signed-off-by: chensuyue <suyue.chen@intel.com> * update system summary Signed-off-by: chensuyue <suyue.chen@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: chensuyue <suyue.chen@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-06 09:55:55 +08:00
Zhenzhong1	ac3486038c	[ChatQnA] udate OOB & Tuned manifests (#738 ) * update OOB manifests * update tgi parameters * update OOB manifests for w/o rerank * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update tgi parameters * update tgi parameters for v0.9 w/o rerank * update OOB manifests 2.0.4->2.0.1 for w/o rerank * update tuned manifests * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update tuned manifests --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-05 22:23:51 +08:00
chen, suyue	e0bc5f2a4d	update logs from standard cd perf workflow (#733 ) Signed-off-by: chensuyue <suyue.chen@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-09-05 14:58:24 +08:00
Zhenzhong1	6730b242cc	[ChatQnA] Update retrieval & dataprep manifests (#717 ) * modify tgi hyperparameters * upgrade tgi 2.0.1 to 2.0.4 * Update dataprep-microservice_run.yaml * Update retrieval-microservice_run.yaml * Update retrieval-microservice_run.yaml * Update dataprep-microservice_run.yaml * Update dataprep-microservice_run.yaml * Update dataprep-microservice_run.yaml * Update retrieval-microservice_run.yaml * Update retrieval-microservice_run.yaml	2024-09-04 19:50:46 +08:00
Letong Han	4a51874e4d	update readme for w/o rerank (#731 ) Signed-off-by: letonghan <letong.han@intel.com>	2024-09-04 15:01:30 +08:00
Letong Han	55d287dfcf	update readme to fix input length (#720 ) Signed-off-by: letonghan <letong.han@intel.com>	2024-09-03 19:01:28 +08:00
Zhenzhong1	3563f5db6b	[ChatQnA]Update manifests (#716 ) * update manifests for v0.9	2024-09-03 15:24:54 +08:00
David Kinder	c6d811ab11	doc: remove invalid code block language (#705 ) Signed-off-by: David B. Kinder <david.b.kinder@intel.com>	2024-09-03 10:10:56 +08:00
lvliang-intel	af21e94a29	Add benchmark README for ChatQnA (#662 ) * Add benchmark README for ChatQnA Signed-off-by: lvliang-intel <liang1.lv@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add benchmark.yaml Signed-off-by: lvliang-intel <liang1.lv@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update yaml path Signed-off-by: lvliang-intel <liang1.lv@intel.com> * fix preci issue Signed-off-by: lvliang-intel <liang1.lv@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update title Signed-off-by: lvliang-intel <liang1.lv@intel.com> --------- Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-08-26 22:39:57 +08:00
Zhenzhong1	e6b4fff05c	Update the number of microservice replicas for OPEA v0.9 (#645 )	2024-08-22 09:48:47 +08:00
Zhenzhong1	ba78b4c994	update manifests for v0.9 (#632 ) * update model HF TOKEN variables & reranking name for v0.9	2024-08-20 14:35:14 +08:00
Ying Chun Guo	4fd3517f23	update benchmark manifest to fix errors (#626 ) Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>	2024-08-19 21:59:26 +08:00
Zhenzhong1	08f57fa54a	update manifests for v0.9 (#623 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-08-19 15:55:04 +08:00
lkk	02a15366bc	update chatqna dataprep-redis port (#557 ) Co-authored-by: sdp <sdp@srf797636.jf.intel.com>	2024-08-08 22:23:32 +08:00

1 2

51 Commits