Triton Management Service is an application that helps users manage and orchestrate a fleet of Triton Inference Servers in a Kubernetes Cluster.
Contents
Getting Started
Key Concepts
- Model Repositories
- Image Allowlist
- Leases
- Autoscaling Leases
- Triton Pools & Quota Base Shared Tritons
- TMS Metrics
- TMS GRPC API Package
Reference
Release Notes