> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/nemo/relay/llms.txt.
> For full documentation content, see https://docs.nvidia.com/nemo/relay/llms-full.txt.
> For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.nvidia.com/nemo/relay/_mcp/server.

# Struct LlmCall Prediction

> Predictions for an LLM call at a given position in the call hierarchy.

Generated from `cargo doc --no-deps -p nemo-relay -p nemo-relay-adaptive -p nemo-relay-ffi`.

<pre />

Predictions for an LLM call at a given position in the call hierarchy.

## Fields

### `remaining_calls: PredictionMetrics`

How many more LLM calls are expected after this one.

### `interarrival_ms: PredictionMetrics`

Expected time in milliseconds until the next LLM call.

### `output_tokens: PredictionMetrics`

Expected output token count for this call.

### `latency_sensitivity: Option<u32>`

Auto-computed latency sensitivity score from profiler analysis. `None` means no profiling data available - fall back to default.

## Trait Implementations

### `impl Clone for LlmCallPrediction`

<pre />

#### `clone`

<pre />

#### `clone_from`

<pre />

### `impl Debug for LlmCallPrediction`

<pre />

#### `fmt`

<pre />

### `impl Default for LlmCallPrediction`

<pre />

#### `default`

<pre />

### `impl<'de> Deserialize<'de> for LlmCallPrediction`

<pre />

#### `deserialize`

<pre />

### `impl PartialEq for LlmCallPrediction`

<pre />

#### `eq`

<pre />

#### `ne`

<pre />

### `impl Serialize for LlmCallPrediction`

<pre />

#### `serialize`

<pre />

### `impl StructuralPartialEq for LlmCallPrediction`

<pre />