ZePan110
581e954a8d
Integrate ChatQnA set_env to ut scripts and add README.md for UT scripts. ( #1971 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-05-20 13:42:18 +08:00
Louie Tsai
e8cdf7d668
[ChatQnA] update to the latest Grafana Dashboard ( #1728 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-04-03 12:14:55 -07:00
Louie Tsai
0736912c69
change gaudi node exporter from default one to 41612 ( #1702 )
...
Signed-off-by: Louie Tsai <louie.tsai@intel.com >
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-03-20 21:38:24 -07:00
James Edwards
527b146a80
Add final README.md and set_env.sh script for quickstart review. Previous pull request was 1595. ( #1662 )
...
Signed-off-by: Edwards, James A <jaedwards@habana.ai >
Co-authored-by: Edwards, James A <jaedwards@habana.ai >
2025-03-14 16:05:01 -07:00
Louie Tsai
671dff7f51
[ChatQnA] Enable Prometheus and Grafana with telemetry docker compose file. ( #1623 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-03-13 23:18:29 -07:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
xiguiw
45d5da2ddd
HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN ( #1503 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-09 20:33:06 +08:00
Liang Lv
9adf7a6af0
Add support for latest deepseek models on Gaudi ( #1491 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-02-05 08:30:04 +08:00
Wang, Kai Lawrence
3d3ac59bfb
[ChatQnA] Update the default LLM to llama3-8B on cpu/gpu/hpu ( #1430 )
...
Update the default LLM to llama3-8B on cpu/nvgpu/amdgpu/gaudi for docker-compose deployment to avoid the potential model serving issue or the missing chat-template issue using neural-chat-7b.
Slow serving issue of neural-chat-7b on ICX: #1420
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com >
2025-01-20 22:47:56 +08:00
Louie Tsai
81022355a7
Enable OpenTelemetry Tracing for ChatQnA TGI serving on Gaudi ( #1316 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-01-08 17:20:13 -08:00
Ying Hu
597f17b979
Update set_env.sh to fix LOGFLAG warning ( #1319 )
2024-12-30 10:54:26 +08:00
Louie Tsai
152adf8012
maintain a version info for docker_compose yaml files among release ( #1141 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2024-11-17 22:39:41 -08:00
lvliang-intel
256b58c07e
Replace environment variables with service name for ChatQnA ( #977 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2024-10-18 11:31:24 +08:00
lvliang-intel
619d941047
Set no wrapper ChatQnA as default ( #891 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 13:30:45 +08:00
Letong Han
6c364487d3
[ChatQnA] Add Nginx in Docker Compose and README ( #850 )
...
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-19 20:39:58 +08:00
XinyaoWa
d73129cbf0
Refactor folder to support different vendors ( #743 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Signed-off-by: chensuyue <suyue.chen@intel.com >
2024-09-10 23:27:19 +08:00