[Readme] Update ChatQnA Readme for LLM Endpoint (#1086)

Signed-off-by: letonghan <letong.han@intel.com>
2024-11-11 13:53:06 +08:00
parent 3744bb8c1b
commit aa314f6757
4 changed files with 14 additions and 20 deletions
--- a/ChatQnA/docker_compose/nvidia/gpu/README.md
+++ b/ChatQnA/docker_compose/nvidia/gpu/README.md
@@ -238,9 +238,9 @@ docker compose up -d
   Then try the `cURL` command below to validate TGI.

   ```bash
-   curl http://${host_ip}:8008/generate \
+   curl http://${host_ip}:9009/v1/chat/completions \
     -X POST \
-     -d '{"inputs":"What is Deep Learning?","parameters":{"max_new_tokens":64, "do_sample": true}}' \
+     -d '{"model": "Intel/neural-chat-7b-v3-3", "messages": [{"role": "user", "content": "What is Deep Learning?"}], "max_tokens":17}' \
     -H 'Content-Type: application/json'
   ```