GenAIExamples/ChatQnA/deprecated/langchain/test/README.md at aa6b0e88fa7a8cd451c8b571a4d7946515f36c8e

Files

lvliang-intel 4a9a497bb2 Add ChatQnA microservice implementation on Gaudi (#112 )

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Signed-off-by: letonghan <letong.han@intel.com>

2024-05-10 13:55:59 +08:00

1.7 KiB

Raw Blame History

Performance measurement tests with langsmith

Pre-requisite: Signup in langsmith [https://www.langchain.com/langsmith] and get the api token

Steps to run perf measurements with tgi_gaudi.ipynb jupyter notebook

This dir is mounted at /test in qna-rag-redis-server
Make sure redis container and LLM serving is up and running
enter into qna-rag-redis-server container and start jupyter notebook server (can specify needed IP address and jupyter will run on port 8888)
```
docker exec -it qna-rag-redis-server bash
cd /test
jupyter notebook --allow-root --ip=X.X.X.X
```
Launch jupyter notebook in your browser and open the tgi_gaudi.ipynb notebook
Update all the configuration parameters in the second cell of the notebook
Clear all the cells and run all the cells
The output of the last cell which calls client.run_on_dataset() will run the langchain Q&A test and captures measurements in the langsmith server. The URL to access the test result can be obtained from the output of the command

Steps to run perf measurements with end_to_end_rag_test.py python script

This dir is mounted at /test in qna-rag-redis-server
Make sure redis container and LLM serving is up and running

enter into qna-rag-redis-server container and run the python script

docker exec -it qna-rag-redis-server bash
cd /test
python end_to_end_rag_test.py -l "<LLM model serving - TGI or VLLM>" -e <TEI embedding model serving> -m <LLM model name> -ht "<huggingface token>" -lt <langsmith api key> -dbs "<path to schema>" -dbu "<redis server URL>" -dbi "<DB Index name>" -d "<langsmith dataset name>"

Check the results in langsmith server

1.7 KiB Raw Blame History

Performance measurement tests with langsmith

Steps to run perf measurements with tgi_gaudi.ipynb jupyter notebook

Steps to run perf measurements with end_to_end_rag_test.py python script

1.7 KiB

Raw Blame History