Agents#

The Warehouse Blueprint incorporates an agentic AI system that provides natural language interaction capabilities for querying warehouse safety incidents, generating reports, and retrieving visual information from warehouse cameras.

The agent uses NVIDIA Nemotron Nano 9B v2 (nvidia/nvidia-nemotron-nano-9b-v2) for natural language understanding and query routing, and NVIDIA Cosmos Reason2 8B (nvidia/cosmos-reason2-8b) for video understanding and analyzing warehouse safety incidents.

For detailed information on all components, APIs, and customization options, refer to the VSS Agents.

Agent Architecture#

VSS Agent Architecture

The agent system uses a two-tier hierarchy:

  1. Top Agent: LLM-powered routing agent that interprets queries, decides which tools or sub-agents to invoke, maintains conversation context, and streams reasoning traces.

  2. Sub-Agents: Specialized agents for different types of reporting tasks:

    • Report Agent: Handles detailed, comprehensive reports for single incidents. Retrieves incident data, performs video analysis using VLMs, and generates structured markdown reports with incident details, location information, people and vehicles involved.

    • Multi-Report Agent: Handles listing and summarizing multiple incidents. Supports filtering by time range, sensor, and incident count. Provides incident summaries with optional chart generation for visualizing incident trends.

For detailed agent architecture information, see Agent Overview

Tools#

Video Analytics Tools

Tool

Description

video_analytics_mcp.get_incidents

Retrieves multiple incidents from Video Analytics service

video_analytics_mcp.get_incident

Retrieves a specific incident by ID

video_analytics_mcp.get_fov_histogram

Gets field-of-view occupancy histogram data

video_analytics_mcp.get_sensor_ids

Lists all available sensors/cameras

Video Storage Tools

Tool

Description

vst_video_url

Retrieves video URLs for incident playback

vst_mcp.sensor_list

Gets list of sensors from Video Storage service

vst_mcp.get_video_storage_url

Gets video storage URLs

vst_mcp.get_replay_picture_url

Gets replay picture URLs

vst_mcp.get_live_picture_url

Gets live picture URLs

Report Generation Tools

Tool

Description

video_understanding

Uses VLM to analyze video frames and extract incident information

template_report_gen

Generates structured incident reports using templates and VLM

chart_generator

Creates visualization charts for reports

multi_incident_formatter

Formats multiple incidents for summary reports

get_fov_counts_with_chart

Provides occupancy statistics with histogram visualizations

Supported Queries#

Query Type

Examples

Sensor Discovery

List all available sensors

Snapshots

Take a snapshot of Camera_01

Incident Listing

List last 5 incidents for Camera_01, Show incidents in the last 24 hours

Reports

Generate a report for incident 12345, Give a report for Camera_01 in last hour

Occupancy Metrics

How many people were in Camera_01 20 minutes ago?

Multi-Step

List last 5 incidents; generate report for the second one

The agent understands temporal expressions such as “last 10 minutes”, “yesterday”, and “past hour”.

Note

Camera_01 is an example. Ask List all available sensors to discover your sensor names.

Example: Sensor Discovery

Listing available sensors

Example: Incident Report with Chart

Generating incident reports

Example: Camera Snapshot

Camera snapshot

Warehouse Agent Configuration#

Configuration File Location

The warehouse agent configuration file is located at deployments/warehouse/vss-agent/configs/config.yml. For detailed configuration options and YAML structure, see Agent Configuration.

Models Used

Model

Type

Purpose

nvidia/nvidia-nemotron-nano-9b-v2

LLM

Query routing, reasoning, report generation

nvidia/cosmos-reason2-8b

VLM

Video understanding and analysis

Key Warehouse-Specific Settings

The warehouse profile configures the following key components:

  • Routing prompt: Optimized for warehouse video surveillance incident reporting

  • VLM prompts: Tuned to analyze incidents, location conditions (lighting, floor, blockages), people (worker/driver/pedestrian), and vehicles (forklift/transporter)

  • Report template: Warehouse incident report format

For the complete configuration reference including all tools, agents, environment variables, and YAML examples, see Agent Configuration.

For information about other available profiles (Smart Cities, Public Safety, Developer), see Agent Profiles.

Customization#

The agent configuration is designed to be flexible and extensible. Key customization areas for the warehouse deployment include:

  • Prompt Customization: Modify workflow.prompt to adjust routing logic for warehouse-specific query patterns

  • VLM Prompt Engineering: Tune vlm_prompts in template_report_gen for different warehouse scenarios

  • Report Templates: Create custom markdown templates in the templates directory for warehouse-specific report formats

For comprehensive customization guidance including adding custom functions and MCP servers, see Agent Customization.

Agent Evaluation#

The VSS Agent includes a comprehensive evaluation framework for assessing agent performance across different dimensions including report quality, question-answering accuracy, and trajectory quality.

For detailed information on configuring evaluators, creating evaluation datasets, and interpreting results, see Agent Evaluation.

Observability#

Warehouse agents support distributed tracing via Phoenix through NeMo Agent Toolkit (NAT) telemetry export.

To enable Phoenix telemetry, configure telemetry in the agent configuration file (see Agent Configuration) and set PHOENIX_ENDPOINT in your environment.

Once telemetry is enabled and your agent is running, access the Phoenix UI at http://<HOST_IP>:6006 (projects view is typically under /projects).

For a walkthrough of the Phoenix UI (projects view, traces list, and sample trace anatomy), see Observability.

Additional references:

Known Limitations#

For information on warehouse blueprint known limitations, please refer to the Known Limitations section.

For information on general VSS Agent known issues, please refer to the VSS Agent Known Issues section.