sri-intel
c63e2cd067
Remote inference support for examples in Productivity suite ( #1818 )
...
Signed-off-by: Srinarayan Srikanthan <srinarayan.srikanthan@intel.com >
2025-04-18 14:36:57 +08:00
pre-commit-ci[bot]
094ca7aefe
[pre-commit.ci] pre-commit autoupdate ( #1771 )
...
Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Sun, Xuehao <xuehao.sun@intel.com >
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com >
2025-04-09 11:51:57 -07:00
XinyaoWa
6d24c1c77a
Merge FaqGen into ChatQnA ( #1654 )
...
1. Delete FaqGen
2. Refactor FaqGen into ChatQnA, serve as a LLM selection.
3. Combine all ChatQnA related Dockerfile into one
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-03-20 17:40:00 +08:00
Spycsh
ce38a84372
Revert chatqna async and enhance tests ( #1598 )
...
align with opea-project/GenAIComps#1354
2025-03-03 23:03:44 +08:00
XinyaoWa
78f8ae524d
Fix async in chatqna bug ( #1589 )
...
Algin async with comps: related PR: opea-project/GenAIComps#1300
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-02-27 23:32:29 +08:00
Eero Tamminen
23a77df302
Fix "OpenAI" & "response" spelling ( #1561 )
2025-02-25 12:45:21 +08:00
Wang, Kai Lawrence
3d3ac59bfb
[ChatQnA] Update the default LLM to llama3-8B on cpu/gpu/hpu ( #1430 )
...
Update the default LLM to llama3-8B on cpu/nvgpu/amdgpu/gaudi for docker-compose deployment to avoid the potential model serving issue or the missing chat-template issue using neural-chat-7b.
Slow serving issue of neural-chat-7b on ICX: #1420
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-20 22:47:56 +08:00
XinyaoWa
464e2d3125
Rename streaming to stream to align with OpenAI API ( #1332 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-06 13:25:55 +08:00
lkk
2af1ea0f8e
remove examples gateway. ( #1243 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-13 15:16:11 +08:00
lkk
bde285dfce
move examples gateway ( #992 )
...
Co-authored-by: root <root@idc708073.jf.intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Sihan Chen <39623753+Spycsh@users.noreply.github.com >
2024-12-06 14:40:25 +08:00
Sihan Chen
abd9d12937
Fix non stream case ( #1115 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-11-11 14:18:42 +08:00
XinyaoWa
c65d7d40fb
fix vllm output in chatqna ( #1038 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2024-11-01 09:26:57 +08:00
lvliang-intel
0eedbbfce0
Update aipc ollama docker compose and readme ( #984 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2024-10-22 10:30:47 +08:00
lvliang-intel
256b58c07e
Replace environment variables with service name for ChatQnA ( #977 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-18 11:31:24 +08:00
Sihan Chen
4a265abb73
Fix top_n rerank docs ( #976 )
2024-10-17 15:49:16 +08:00
Sihan Chen
b0487fe92b
fix chatqna accuracy issue with incorrect penalty ( #974 )
2024-10-17 15:48:44 +08:00
lvliang-intel
619d941047
Set no wrapper ChatQnA as default ( #891 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 13:30:45 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00
Tian, Feng
169fe96332
GenAIExample code structure reorg ( #207 )
...
Signed-off-by: Tian, Feng <feng.tian@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-05-30 00:13:49 +08:00
lvliang-intel
a6b3caf128
Refactor example code ( #183 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Signed-off-by: Yue, Wenjiao <wenjiao.yue@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-05-24 13:32:14 +08:00