chyundunovDatamonsters
bb7a675665
ChatQnA - refactoring README.md for deploy application on ROCm ( #1857 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Signed-off-by: Chingis Yundunov <c.yundunov@datamonsters.com >
Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Artem Astafev <a.astafev@datamonsters.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-04-25 08:52:24 +08:00
chyundunovDatamonsters
c50dfb2510
Adding files to deploy ChatQnA application on ROCm vLLM ( #1560 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-04-03 17:19:26 +08:00
Xiaotian Chen
1bd56af994
Update TGI image versions ( #1625 )
...
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com >
2025-04-01 11:27:51 +08:00
XinyaoWa
6d24c1c77a
Merge FaqGen into ChatQnA ( #1654 )
...
1. Delete FaqGen
2. Refactor FaqGen into ChatQnA, serve as a LLM selection.
3. Combine all ChatQnA related Dockerfile into one
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-03-20 17:40:00 +08:00
Artem Astafev
6abf7652e8
Fix ChatQnA ROCm compose Readme file and absolute path for ROCM CI test ( #1159 )
...
Signed-off-by: Artem Astafev <a.astafev@datamonsters.com >
2025-02-27 15:26:45 +08:00
Wang, Kai Lawrence
3d3ac59bfb
[ChatQnA] Update the default LLM to llama3-8B on cpu/gpu/hpu ( #1430 )
...
Update the default LLM to llama3-8B on cpu/nvgpu/amdgpu/gaudi for docker-compose deployment to avoid the potential model serving issue or the missing chat-template issue using neural-chat-7b.
Slow serving issue of neural-chat-7b on ICX: #1420
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-20 22:47:56 +08:00
Liang Lv
0f7e5a37ac
Adapt code for dataprep microservice refactor ( #1408 )
...
https://github.com/opea-project/GenAIComps/pull/1153
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-20 20:37:03 +08:00
Letong Han
4cabd55778
Refactor Retrievers related Examples ( #1387 )
...
Delete redundant retrievers docker image in docker_images_list.md.
Refactor Retrievers related Examples READMEs.
Change all of the comps/retrievers/xxx/xxx/Dockerfile path into comps/retrievers/src/Dockerfile.
Fix the Examples CI issues of PR opea-project/GenAIComps#1138 .
Signed-off-by: letonghan <letong.han@intel.com >
2025-01-16 14:21:48 +08:00
Liang Lv
3ca78867eb
Update example code for embedding dependency moving to 3rd_party ( #1368 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-01-10 15:36:58 +08:00
chen, suyue
5c7a5bd850
Update Code and README for GenAIComps Refactor ( #1285 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: letonghan <letong.han@intel.com >
Signed-off-by: ZePan110 <ze.pan@intel.com >
Signed-off-by: WenjiaoYue <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>
2025-01-02 20:03:26 +08:00
Wang, Kai Lawrence
ac470421d0
Update the llm backend ports ( #1172 )
...
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2024-11-22 09:20:09 +08:00
Artem Astafev
6d3a017609
Add compose example for ChatQnA AMD ROCm deployment ( #1122 )
...
Signed-off-by: Artem Astafev <a.astafev@datamonsters.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-15 17:24:06 +08:00