Fixed schedule benchmarking provides precise timing control by executing requests at specific timestamps. This mode is ideal for simulating exact traffic patterns, testing temporal performance characteristics, and reproducing time-sensitive scenarios.
Fixed schedule mode enables:
Fixed schedule files use JSONL format with timestamp-based entries:
Field Descriptions:
timestamp: Milliseconds from schedule start when request should be sentinput_length: Number of tokens in the input promptinput_text: Exact text to send in the request (provided instead of input_length)output_length: Maximum number of tokens in the response (optional)hash_ids: Hash block identifiers to simulate text reuse with 512-token blocks (optional)Sample Output (Successful Run):
Key Parameters:
--fixed-schedule-auto-offset: Automatically adjusts timestamps to start from 0Execute only a portion of the schedule using start and end offsets:
Sample Output (Successful Run):
Windowing Parameters:
--fixed-schedule-start-offset 2000: Start execution at 2000ms timestamp--fixed-schedule-end-offset 4000: End execution at 4000ms timestampWhen to Use Fixed Schedule Benchmarking: