Files

pre-commit-ci[bot] 065222f29b [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

2024-10-22 08:19:15 +00:00

753 B

Raw Blame History

Benchmarking Deployment

This document guides you through deploying this example pipelines using Helm charts. Helm charts simplify managing Kubernetes applications by packaging configuration and resources.

Getting Started

Preparation

# on k8s-master node
cd GenAIExamples/{example_name}/benchmark/performance/helm_charts

# Replace the key of HUGGINGFACEHUB_API_TOKEN with your actual Hugging Face token:
# vim hpu_with_rerank.yaml or hpu_without_rerank.yaml
HUGGINGFACEHUB_API_TOKEN: hf_xxxxx

Deployment

# Options:
# --num_nodes choices=[1, 2, 4, 8]
# --mode choices=["tuned", "oob"]
# --workflow choices=["with_rerank", "without_rerank"]
python deployment.py --workflow=with_rerank --mode=tuned --num_nodes=1

753 B Raw Blame History

Benchmarking Deployment

Getting Started

Preparation

Deployment

753 B

Raw Blame History