NVIDIA NeMo Agent Toolkit Release Notes#

This section contains the release notes for NeMo Agent Toolkit.

Release 1.5.0#

Summary#

This release expands runtime intelligence, framework-level performance acceleration, and production observability in the toolkit, while making workflow publishing to MCP ecosystems easier.

Migration notice: Release 1.5.0 includes packaging and compatibility refactors (including meta-package changes, eval package split, and import-path updates). Review the Migration Guide before upgrading.

  • Dynamo Runtime Intelligence: Automatically infer per-request latency sensitivity from agent profiles and apply runtime hints for cache control, load-aware routing, and priority-aware serving.

  • Agent Performance Primitives (APP): Introduce framework-agnostic performance primitives that accelerate graph-based agent frameworks such as LangChain, CrewAI, and Agno with parallel execution, speculative branching, and node-level priority routing.

  • LangSmith Native Integration: Observe end-to-end agent execution with native LangSmith tracing, run evaluation experiments, compare outcomes, and manage prompt versions across development and production workflows.

  • FastMCP Workflow Publishing: Publish NeMo Agent Toolkit workflows as MCP servers using the FastMCP runtime to simplify MCP-native deployment and integration.

Refer to the changelog for the complete list of changes.

Known Issues#