NVIDIA NeMo Agent Toolkit Release Notes#
Release 1.2.1#
Summary#
This is a documentation only release, there are no code changes in this release.
Release 1.2.0#
Summary#
The NeMo Agent toolkit, formerly known as Agent Intelligence (AIQ) toolkit, has been renamed in this release to align with the NVIDIA NeMo family of products. This release also brings significant new capabilities and improvements across authentication, resource management, observability, and developer experience. The toolkit continues to offer backwards compatibility, making the transition seamless for existing users.
The following are the key features and improvements in this release:
Authentication for Tool Calling: Implement robust authentication mechanisms that enable secure and configurable access management for tool invocation within agent workflows.
Test Time Compute: Dynamically reallocate compute resources after model training, allowing agents to optimize reasoning, factual accuracy, and system robustness without retraining the base model.
Sizing Calculator: Estimate GPU cluster requirements to support your target number of users and desired response times, simplifying deployment planning and scaling.
Object Store Integration: Connect and manage data through supported object stores, improving agent extensibility and enabling advanced data workflows.
Enhanced Cursor Rules: Build new workflows or extend existing ones by leveraging cursor rules, making agent development faster and more flexible.
Interactive Notebooks: Access a suite of onboarding and example notebooks to accelerate agent workflow development, testing, and experimentation.
Observability Refactor: Onboard new observability and monitoring platforms more easily, and take advantage of improved plug-in architecture for workflow inspection and analysis.
Examples Reorganization: Organize examples by functionality, making it easier to find and use the examples.
Refer to the changelog for a complete list of changes.
Release 1.1.0#
Summary#
Full Model Context Protocol (MCP) support. Workflows/tools can now be exposed as MCP servers.
Deep integration with Weights and Biases’ Weave for logging and tracing support.
Addition of the Agno LLM framework.
A new ReWOO agent that improves on ReAct by removing the tool output from the LLM context, reducing token counts.
A new Alert Triage Agent example that demonstrates how to build a full application with NeMo Agent toolkit to automatically analyze system monitoring alerts, performs diagnostic checks using various tools, and generates structured triage reports with root cause categorization.
Support for Python 3.11.
Various other improvements.
Refer to the changelog for a complete list of changes.
Release 1.0.0#
Summary#
This is the first general release of NeMo Agent toolkit.
LLM APIs#
NIM
OpenAI
Supported LLM Frameworks#
LangChain
LlamaIndex
Known Issues#
Faiss is currently broken on Arm64. This is a known issue #72 caused by an upstream bug in the Faiss library facebookresearch/faiss#3936.
Refer to NVIDIA/NeMo-Agent-Toolkit#issues for an up to date list of current issues.