llmd run_multiturn_benchmark

Runs a multi-turn benchmark job against the LLM inference service

Parameters

endpoint_url

  • Endpoint URL for the LLM inference service to benchmark

name

  • Name of the benchmark job

  • default value: multi-turn-benchmark

namespace

  • Namespace to run the benchmark job in (empty string auto-detects current namespace)

image

  • Container image for the benchmark

  • default value: quay.io/hayesphilip/multi-turn-benchmark

version

  • Version tag for the benchmark image

  • default value: 0.0.1

timeout

  • Timeout in seconds to wait for job completion

  • default value: 900

parallel

  • Number of parallel connections

  • default value: 9