Module pricing

Generated from cargo doc --no-deps -p nemo-relay -p nemo-relay-adaptive -p nemo-relay-ffi.

Data-driven LLM model pricing used to layer cost estimates onto usage.

Pricing is deliberately separate from response normalization so adding providers, aliases, or cache-accounting rules does not require editing AnnotatedLlmResponse.

Structs

ModelPricing: Per-token pricing for a model, expressed in USD per one million tokens.
PricingCatalog: Collection of model pricing entries.
PricingConfig: Runtime pricing resolver configuration.
PricingResolver: Ordered pricing lookup chain.
PromptCachePricing: Prompt-cache accounting rules for a model pricing entry.
TokenPricingRates: Token rates expressed as USD per one million tokens.
TokenRateTier: A token pricing tier selected by prompt/input token count.

Enums

CacheReadAccounting: How cache-read tokens relate to prompt token counts in provider usage.
PricingCatalogError: Errors produced while parsing or validating a pricing catalog.
PricingSourceConfig: Declarative pricing source supported by Relay configuration.
PricingUnit: Billing unit represented by a pricing entry.
RateScheduleApplication: How a selected rate-schedule tier applies to billable usage.
TokenRateSchedule: Data-driven token rate schedule for provider pricing with request thresholds.

Traits

PricingSource: Pluggable pricing source interface.

Functions

active_pricing_resolver: Returns the active process-wide pricing resolver.
attach_estimated_cost: Adds a model-pricing estimate to a normalized response when cost is missing.
attach_estimated_cost_for_provider: Adds a provider-aware model-pricing estimate to a normalized response when cost is missing.
estimate_cost: Estimates USD cost for a model/usage pair when pricing is known.
estimate_cost_for_provider: Estimates USD cost for a provider/model pair when pricing is known.
estimate_cost_with_catalog: Estimates USD cost using the provided catalog.
estimate_cost_with_provider: Estimates USD cost using the provided catalog and provider/model pair.
infer_model_provider: Infers a provider/route value for a decoded model.
pricing_for_model: Returns known pricing for a model ID.
pricing_for_provider: Returns known pricing for a provider/model pair.
reset_active_pricing_resolver: Restores the active process-wide pricing resolver to an empty resolver.
set_active_pricing_resolver: Replaces the active process-wide pricing resolver.

Structs

ModelPricing: Per-token pricing for a model, expressed in USD per one million tokens.

PricingCatalog: Collection of model pricing entries.

PricingConfig: Runtime pricing resolver configuration.

PricingResolver: Ordered pricing lookup chain.

PromptCachePricing: Prompt-cache accounting rules for a model pricing entry.

TokenPricingRates: Token rates expressed as USD per one million tokens.

TokenRateTier: A token pricing tier selected by prompt/input token count.

Enums

CacheReadAccounting: How cache-read tokens relate to prompt token counts in provider usage.

PricingCatalogError: Errors produced while parsing or validating a pricing catalog.

PricingSourceConfig: Declarative pricing source supported by Relay configuration.

PricingUnit: Billing unit represented by a pricing entry.

RateScheduleApplication: How a selected rate-schedule tier applies to billable usage.

TokenRateSchedule: Data-driven token rate schedule for provider pricing with request thresholds.

Functions

active_pricing_resolver: Returns the active process-wide pricing resolver.

attach_estimated_cost: Adds a model-pricing estimate to a normalized response when cost is missing.

attach_estimated_cost_for_provider: Adds a provider-aware model-pricing estimate to a normalized response when cost is missing.

estimate_cost: Estimates USD cost for a model/usage pair when pricing is known.

estimate_cost_for_provider: Estimates USD cost for a provider/model pair when pricing is known.

estimate_cost_with_catalog: Estimates USD cost using the provided catalog.

estimate_cost_with_provider: Estimates USD cost using the provided catalog and provider/model pair.

infer_model_provider: Infers a provider/route value for a decoded model.

pricing_for_model: Returns known pricing for a model ID.

pricing_for_provider: Returns known pricing for a provider/model pair.

reset_active_pricing_resolver: Restores the active process-wide pricing resolver to an empty resolver.

set_active_pricing_resolver: Replaces the active process-wide pricing resolver.