Module pricing
Generated from cargo doc --no-deps -p nemo-relay -p nemo-relay-adaptive -p nemo-relay-ffi.
Data-driven LLM model pricing used to layer cost estimates onto usage.
Pricing is deliberately separate from response normalization so adding providers, aliases, or cache-accounting rules does not require editing AnnotatedLlmResponse.
Structs
- ModelPricing: Per-token pricing for a model, expressed in USD per one million tokens.
- PricingCatalog: Collection of model pricing entries.
- PricingConfig: Runtime pricing resolver configuration.
- PricingResolver: Ordered pricing lookup chain.
- PromptCachePricing: Prompt-cache accounting rules for a model pricing entry.
- TokenPricingRates: Token rates expressed as USD per one million tokens.
- TokenRateTier: A token pricing tier selected by prompt/input token count.
Enums
- CacheReadAccounting: How cache-read tokens relate to prompt token counts in provider usage.
- PricingCatalogError: Errors produced while parsing or validating a pricing catalog.
- PricingSourceConfig: Declarative pricing source supported by Relay configuration.
- PricingUnit: Billing unit represented by a pricing entry.
- RateScheduleApplication: How a selected rate-schedule tier applies to billable usage.
- TokenRateSchedule: Data-driven token rate schedule for provider pricing with request thresholds.
Traits
- PricingSource: Pluggable pricing source interface.
Functions
- active_pricing_resolver: Returns the active process-wide pricing resolver.
- attach_estimated_cost: Adds a model-pricing estimate to a normalized response when cost is missing.
- attach_estimated_cost_for_provider: Adds a provider-aware model-pricing estimate to a normalized response when cost is missing.
- estimate_cost: Estimates USD cost for a model/usage pair when pricing is known.
- estimate_cost_for_provider: Estimates USD cost for a provider/model pair when pricing is known.
- estimate_cost_with_catalog: Estimates USD cost using the provided catalog.
- estimate_cost_with_provider: Estimates USD cost using the provided catalog and provider/model pair.
- infer_model_provider: Infers a provider/route value for a decoded model.
- pricing_for_model: Returns known pricing for a model ID.
- pricing_for_provider: Returns known pricing for a provider/model pair.
- reset_active_pricing_resolver: Restores the active process-wide pricing resolver to an empty resolver.
- set_active_pricing_resolver: Replaces the active process-wide pricing resolver.