kserve deploy_model
Deploy a KServe model
Parameters
namespace
The namespace in which the model should be deployed
runtime
Name of the runtime (standalone-tgis or vllm)
model_name
The name to give to the serving runtime
sr_name
The name of the ServingRuntime object
sr_kserve_image
The image of the Kserve serving runtime container
inference_service_name
The name to give to the inference service
inference_service_min_replicas
The minimum number of replicas. If none, the field is left unset.
type: Int
delete_others
If True, deletes the other serving runtime/inference services of the namespace
default value:
True
raw_deployment
If True, do not try to configure anything related to Serverless.