NVIDIA NeMo Agent Toolkit Release Notes#
This section contains the release notes for NeMo Agent Toolkit.
Release 1.5.0#
Summary#
This release expands runtime intelligence, framework-level performance acceleration, and production observability in the toolkit, while making workflow publishing to MCP ecosystems easier.
Migration notice: Release 1.5.0 includes packaging and compatibility refactors (including meta-package changes, eval package split, and import-path updates). Review the Migration Guide before upgrading.
Dynamo Runtime Intelligence: Automatically infer per-request latency sensitivity from agent profiles and apply runtime hints for cache control, load-aware routing, and priority-aware serving.
Agent Performance Primitives (APP): Introduce framework-agnostic performance primitives that accelerate graph-based agent frameworks such as LangChain, CrewAI, and Agno with parallel execution, speculative branching, and node-level priority routing.
LangSmith Native Integration: Observe end-to-end agent execution with native LangSmith tracing, run evaluation experiments, compare outcomes, and manage prompt versions across development and production workflows.
FastMCP Workflow Publishing: Publish NeMo Agent Toolkit workflows as MCP servers using the FastMCP runtime to simplify MCP-native deployment and integration.
Refer to the changelog for the complete list of changes.
Known Issues#
Refer to NVIDIA/NeMo-Agent-Toolkit#issues for an up to date list of current issues.