[Readme] Update ChatQnA Readme for LLM Endpoint (#1086)

Signed-off-by: letonghan <letong.han@intel.com>
This commit is contained in:
Letong Han
2024-11-11 13:53:06 +08:00
committed by GitHub
parent 3744bb8c1b
commit aa314f6757
4 changed files with 14 additions and 20 deletions

View File

@@ -238,9 +238,9 @@ docker compose up -d
Then try the `cURL` command below to validate TGI.
```bash
curl http://${host_ip}:8008/generate \
curl http://${host_ip}:9009/v1/chat/completions \
-X POST \
-d '{"inputs":"What is Deep Learning?","parameters":{"max_new_tokens":64, "do_sample": true}}' \
-d '{"model": "Intel/neural-chat-7b-v3-3", "messages": [{"role": "user", "content": "What is Deep Learning?"}], "max_tokens":17}' \
-H 'Content-Type: application/json'
```