For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
  • About NVIDIA NeMo Relay
    • Overview
    • Architecture
    • Ecosystem
    • Concepts
    • Release Notes
  • Getting Started
    • Agent Runtime Primer
    • Prerequisites
    • Installation
    • Configuration / Setup
    • Quick Start
  • NVIDIA NeMo Relay CLI
    • About
    • Basic Usage
    • Claude Code
    • Codex
    • Cursor
    • Hermes Agent
  • Supported Integrations
    • About
    • OpenClaw Plugin Guide
    • LangChain Integration Guide
    • LangGraph Integration Guide
    • Deep Agents Integration Guide
  • Instrument Applications
    • About
    • Adding Scopes and Marks
    • Instrument a Tool Call
    • Instrument an LLM Call
    • Add Middleware
    • Code Examples
  • Observability Plugin
    • About
    • Configuration
    • Agent Trajectory Interchange Format (ATIF)
    • Agent Trajectory Observability Format (ATOF)
    • OpenTelemetry
    • OpenInference
  • Adaptive Plugin
    • About
    • Configuration
    • Adaptive Cache Governor (ACG)
    • Adaptive Hints
  • NeMo Guardrails Plugin
    • About
    • Configuration
  • Integrate into Frameworks
    • About
    • Adding Scopes
    • Wrap Tool Calls
    • Wrap LLM Calls
    • Handle Non-Serializable Data
    • Using Codecs
    • Provider Codecs
    • Provider Response Codecs
    • Code Examples
  • Build Plugins
    • About
    • Define a Plugin
    • Validate Plugin Configuration
    • Plugin Configuration Files
    • Register Plugin Behavior
    • Design Plugin Configuration
    • NeMo Guardrails Example Plugin
    • Code Examples
  • Contribute
    • About
    • Development Setup
    • Workflow and Reviews
    • Testing and Documentation
  • Reference
    • APIs
    • Performance
  • Resources
    • Support and FAQs
    • Glossary
    • Troubleshooting Guide
    • Community
    • Legal
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogo
On this page
  • Start Here When
  • Agent Harness Support
  • Guides
NVIDIA NeMo Relay CLI

About

||View as Markdown|
Previous

Rust Quick Start

Next

Basic Usage

Use this section when you want the nemo-relay binary to observe local coding agent sessions through hooks, a passthrough LLM gateway, and NeMo Relay observability exporters.

The NeMo Relay CLI is installed by the nemo-relay-cli Cargo package. It can run supported coding agents through a managed local gateway, forward agent hook payloads into NeMo Relay lifecycle events, route OpenAI-compatible or Anthropic-compatible model traffic through the gateway, and diagnose local configuration.

Start Here When

Use these guides when you need to:

  • Observe Claude Code, Codex, Cursor, or Hermes Agent sessions locally.
  • Configure coding-agent hooks for NeMo Relay lifecycle events.
  • Route model-provider traffic through the local NeMo Relay gateway.
  • Export local sessions to Agent Trajectory Interchange Format (ATIF), Agent Trajectory Observability Format (ATOF) JSONL, OpenTelemetry, or OpenInference.
  • Diagnose hook loading, gateway routing, and exporter output.

If you are instrumenting an application or framework directly, use Instrument Applications or Integrate into Frameworks instead.

Agent Harness Support

NeMo Relay CLI support is experimental and observability-focused.

AgentObservabilitySecurityOptimizationNotes
Claude Code✅ Yes❌ No❌ NoObservability only; no known issues.
Codex✅ Yes❌ No❌ NoObservability only; some hooks needed for full feature coverage are missing.
Hermes Agent✅ Yes❌ No❌ NoObservability only; no known issues.
Cursor✅ Yes❌ No❌ NoObservability only; missing hooks under cursor-agent limit feature coverage.

Guides

Use these guide links to move from CLI setup into agent-specific instructions.

  • Basic Usage explains gateway routes, transparent runs, shared configuration, hook forwarding, and runtime mapping.
  • Claude Code covers transparent Claude Code runs, Anthropic gateway routing, ATIF verification, and unsupported Claude application modes.
  • Codex covers transparent Codex CLI runs, local GUI/app caveats, model provider routing, and remote-task limits.
  • Cursor covers transparent Cursor runs, temporary hook patching, GUI and CLI smoke tests, and gateway routing limits.
  • Hermes Agent covers Hermes shell hook installation, dynamic gateway URL handling, session-finalize behavior, and hook consent caveats.

Start with Basic Usage, then use the guide for the coding agent that you want to observe.