Profile with SpecBench Dataset
AIPerf supports benchmarking using the SpecBench dataset, which contains diverse questions across writing, reasoning, math, and coding categories. This dataset is commonly used for evaluating speculative decoding methods.
This guide covers profiling OpenAI-compatible chat completions endpoints using the SpecBench public dataset.
Start a vLLM Server
Launch a vLLM server with a chat model:
Verify the server is ready:
Profile with SpecBench Dataset
AIPerf downloads the SpecBench JSONL file from GitHub and uses the first turn of each question as a single-turn prompt.
Sample Output (Successful Run):