Sequence Length Distributions for Advanced Benchmarking
Sequence length distributions allow you to specify complex patterns of input sequence length (ISL) and output sequence length (OSL) pairs with different probabilities and optional variance. This enables benchmarking of multiple use cases such as summarization and Q&A on one endpoint.
Overview
The sequence distribution feature provides benchmarking of mixed workloads with different ISL and OSL pairings.
Basic Usage
Example command
Add variance to make workloads more realistic:
This creates:
- 70% of requests with ISL ~ Normal(64, 10), OSL ~ Normal(32, 8)
- 20% of requests with ISL ~ Normal(256, 40), OSL ~ Normal(128, 20)
- 10% of requests with ISL ~ Normal(1024, 100), OSL ~ Normal(512, 50)
Values are automatically clamped to be at least 1.
Supported Formats
1. Semicolon Format (Recommended)
Basic:
With standard deviations:
2. Bracket Format
Basic:
With standard deviations:
3. JSON Format
Basic:
With standard deviations:
Examples
Example Case: Chatbot Workload Simulation
Sample Output (Successful Run):