For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
  • About NVIDIA NeMo Relay
    • Overview
    • Architecture
    • Ecosystem
    • Concepts
    • Release Notes
  • Getting Started
    • Agent Runtime Primer
    • Prerequisites
    • Installation
    • Configuration / Setup
    • Quick Start
  • NVIDIA NeMo Relay CLI
    • About
    • Basic Usage
    • Claude Code
    • Codex
    • Cursor
    • Hermes Agent
  • Supported Integrations
    • About
    • OpenClaw Plugin Guide
    • LangChain Integration Guide
    • LangGraph Integration Guide
    • Deep Agents Integration Guide
  • Instrument Applications
    • About
    • Adding Scopes and Marks
    • Instrument a Tool Call
    • Instrument an LLM Call
    • Add Middleware
    • Code Examples
  • Observability Plugin
    • About
    • Configuration
    • Agent Trajectory Interchange Format (ATIF)
    • Agent Trajectory Observability Format (ATOF)
    • OpenTelemetry
    • OpenInference
  • Adaptive Plugin
    • About
    • Configuration
    • Adaptive Cache Governor (ACG)
    • Adaptive Hints
  • NeMo Guardrails Plugin
    • About
    • Configuration
  • Integrate into Frameworks
    • About
    • Adding Scopes
    • Wrap Tool Calls
    • Wrap LLM Calls
    • Handle Non-Serializable Data
    • Using Codecs
    • Provider Codecs
    • Provider Response Codecs
    • Code Examples
  • Build Plugins
    • About
    • Define a Plugin
    • Validate Plugin Configuration
    • Plugin Configuration Files
    • Register Plugin Behavior
    • Design Plugin Configuration
    • NeMo Guardrails Example Plugin
    • Code Examples
  • Contribute
    • About
    • Development Setup
    • Workflow and Reviews
    • Testing and Documentation
  • Reference
    • APIs
      • Python Library Reference
      • Node.js Library Reference
      • Rust Library Reference
        • nemo-relay
          • api
            • event
            • llm
              • CreateLlmHandleParams
              • EndLlmHandleParams
              • LlmAttributes
              • LlmCallEndParams
              • LlmCallExecuteParams
              • LlmCallParams
              • LlmHandle
              • LlmRequest
              • LlmStreamCallExecuteParams
              • llm_call
              • llm_call_end
              • llm_call_execute
              • llm_conditional_execution
              • llm_request_intercepts
              • llm_stream_call_execute
            • registry
            • runtime
            • scope
            • subscriber
            • tool
          • codec
          • config_editor
          • error
          • json
          • observability
          • plugin
          • plugins
          • stream
          • editor_config
        • nemo-relay-adaptive
        • nemo-relay-ffi
    • Performance
  • Resources
    • Support and FAQs
    • Glossary
    • Troubleshooting Guide
    • Community
    • Legal
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogo
On this page
  • Parameters
  • Returns
  • Errors
  • Notes
ReferenceAPIsRust Library Referencenemo-relayapillm

Function llm_call_end

||View as Markdown|
Previous

Function llm_call

Next

Function llm_call_execute

Generated from cargo doc --no-deps -p nemo-relay -p nemo-relay-adaptive -p nemo-relay-ffi.

pub fn llm_call_end(params: LlmCallEndParams<'_>) -> Result<()>

Finish a manual LLM lifecycle span.

This emits an LLM-end event for a handle previously returned by llm_call.

Parameters

  • handle: LLM handle to close.
  • response: Raw provider response associated with the end event.
  • data: Optional application payload retained for compatibility. The emitted end event data is the sanitized response unless it sanitizes to JSON null, in which case this payload is used.
  • metadata: Optional JSON metadata recorded on the end event.
  • annotated_response: Optional normalized response annotation produced by a response codec. When omitted and response_codec is supplied, the annotation is decoded from the sanitized end-event payload.
  • response_codec: Optional response codec used to produce a normalized response annotation from the sanitized end-event payload.
  • timestamp: Optional timestamp recorded on the emitted end event. When None, the runtime uses the current UTC time, or one microsecond after the handle start time if the current time is not later.

Returns

A Result that is Ok(()) when the end event has been emitted.

Errors

Returns an error when the runtime owner check fails, internal state cannot be read safely, or response codec decoding fails.

Notes

Sanitize-response guardrails affect only the emitted end-event payload, not the caller-owned response value.