Use Adaptive Hints when downstream model calls or provider adapters can safely receive guidance metadata from the adaptive runtime.
Adaptive hints register as LLM request intercepts. Lower numeric priority values run earlier in the intercept chain. The default priority is chosen relative to other middleware rather than as a standalone importance score.
plugins.toml ExampleThis configuration injects adaptive guidance into outgoing model requests while allowing later request intercepts to continue running.
Use plugin configuration when the application should let NeMo Relay own the Adaptive Hints request-intercept lifecycle.
Use the manual runtime API when an integration needs to own adaptive lifecycle directly instead of activating the top-level plugin component.
Disable break_chain unless the adaptive hint should be the final request
transform. Adjust priority only when adaptive hints need to run before or
after known application middleware.
Outgoing managed LLM requests receive adaptive hint metadata in the configured header and body location. The hints do not replace the application callback or change the returned value by themselves. Downstream code must explicitly interpret the metadata before behavior changes.
inject_body_path does not match the request shape expected by downstream
provider adapters.