Struct LlmCall Prediction | NVIDIA NeMo Relay

Generated from cargo doc --no-deps -p nemo-relay -p nemo-relay-adaptive -p nemo-relay-pii-redaction -p nemo-relay-ffi -p nemo-relay-types -p nemo-relay-plugin -p nemo-relay-worker-proto -p nemo-relay-worker.

pub struct LlmCallPrediction {
    pub remaining_calls: PredictionMetrics,
    pub interarrival_ms: PredictionMetrics,
    pub output_tokens: PredictionMetrics,
    pub latency_sensitivity: Option<u32>,
}

Predictions for an LLM call at a given position in the call hierarchy.

Fields

`remaining_calls: PredictionMetrics`

How many more LLM calls are expected after this one.

`interarrival_ms: PredictionMetrics`

Expected time in milliseconds until the next LLM call.

`output_tokens: PredictionMetrics`

Expected output token count for this call.

`latency_sensitivity: Option<u32>`

Auto-computed latency sensitivity score from profiler analysis. None means no profiling data available - fall back to default.

Trait Implementations

`impl Clone for LlmCallPrediction`

impl Clone for LlmCallPrediction

`clone`

fn clone(&self) -> LlmCallPrediction

`clone_from`

fn clone_from(&mut self, source: &Self)

`impl Debug for LlmCallPrediction`

impl Debug for LlmCallPrediction

`fmt`

fn fmt(&self, f: &mut Formatter<'_>) -> Result

`impl Default for LlmCallPrediction`

impl Default for LlmCallPrediction

`default`

fn default() -> LlmCallPrediction

`impl<'de> Deserialize<'de> for LlmCallPrediction`

impl<'de> Deserialize<'de> for LlmCallPrediction

`deserialize`

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
    __D: Deserializer<'de>,

`impl PartialEq for LlmCallPrediction`

impl PartialEq for LlmCallPrediction

`eq`

fn eq(&self, other: &LlmCallPrediction) -> bool

`ne`

fn ne(&self, other: &Rhs) -> bool

`impl Serialize for LlmCallPrediction`

impl Serialize for LlmCallPrediction

`serialize`

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where
    __S: Serializer,

`impl StructuralPartialEq for LlmCallPrediction`

impl StructuralPartialEq for LlmCallPrediction