nemo_relay.adaptive
nemo_relay.adaptive
Generated from python/nemo_relay/adaptive.pyi.
Module nemo_relay.adaptive.
Adaptive plugin configuration helpers.
Classes
ConfigDiagnostic(TypedDict)
One adaptive validation diagnostic.
ConfigReport(TypedDict)
Validation report for adaptive configuration.
ConfigPolicy
Policy for unsupported adaptive configuration.
Methods
to_dict
Serialize this policy to the canonical JSON object shape.
BackendSpec
Adaptive state backend selection.
Methods
in_memory
Return an in-memory adaptive backend spec.
redis
Return a Redis adaptive backend spec.
to_dict
Serialize this backend spec to the canonical JSON object shape.
StateConfig
Adaptive state configuration.
Methods
to_dict
Serialize this state config to the canonical JSON object shape.
TelemetryConfig
Built-in adaptive telemetry subscriber settings.
Methods
to_dict
Serialize this telemetry config to the canonical JSON object shape.
AdaptiveHintsConfig
Built-in adaptive hints injection settings.
Methods
to_dict
Serialize this adaptive-hints config to the canonical JSON object shape.
ToolParallelismConfig
Built-in adaptive tool scheduling settings.
Methods
to_dict
Serialize this tool-parallelism config to the canonical JSON object shape.
AcgStabilityThresholds
Prompt-stability classification thresholds for ACG.
Methods
to_dict
Serialize these ACG stability thresholds to the canonical JSON object shape.
AcgConfig
Adaptive Cache Governor settings.
Methods
to_dict
Serialize this ACG config to the canonical JSON object shape.
AdaptiveConfig
Canonical config document for the top-level adaptive component.
Methods
to_dict
Serialize this adaptive config to the canonical JSON object shape.
ComponentSpec
Top-level adaptive component wrapper.
Methods
to_dict
Serialize this component to the canonical plugin shape.
AdaptiveRuntime
Hosted adaptive runtime wrapper used by external framework integrations.
Methods
AdaptiveRuntime
Create an instance.
register
Register the configured adaptive features with NeMo Relay.
deregister
Deregister previously registered adaptive features.
shutdown
Deregister the runtime and release its owned resources.
wait_for_idle
Block until the adaptive telemetry drain has processed pending work.
report
Return the validation report associated with this runtime.
bind_scope
Bind this runtime’s ACG request rewriting to an active scope.
build_cache_request_facts
Build cache-diagnostics facts for an annotated request.
Functions
validate_config
Validate an adaptive config document without constructing a runtime.
build_cache_telemetry_event
Build one canonical cache telemetry event from usage plus request facts.
set_latency_sensitivity
Set a request-local latency-sensitivity hint.
Type Aliases And Constants
ADAPTIVE_PLUGIN_KIND