llmd run_multiturn_benchmark

Runs a multi-turn benchmark job against the LLM inference service

Parameters

endpoint_url

Endpoint URL for the LLM inference service to benchmark

name

Name of the benchmark job
default value: multi-turn-benchmark

namespace

Namespace to run the benchmark job in (empty string auto-detects current namespace)

image

Container image for the benchmark
default value: quay.io/hayesphilip/multi-turn-benchmark

version

Version tag for the benchmark image
default value: 0.0.1

timeout

Timeout in seconds to wait for job completion
default value: 900

parallel

Number of parallel connections
default value: 9