Real-Time Alert Workflow#

The Real-Time Alert Workflow monitors live video streams and generates alerts when the VLM detects anomalies or specified events.

Capabilities

Use Cases

Estimated Deployment Time: 15-20 minutes

The following diagram illustrates the real-time alert workflow architecture:

Key Features of the Real-Time Alert Agent:

Continuous frame sampling from video streams
Natural language queries for detected alerts
Frame sampling and VLM-based anomaly detection using the RTVI microservice.
Configurable alert prompts and invocation settings for custom detection scenarios.
Report Generation

What’s being deployed#

VSS Agent: Agent service that orchestrates tool calls and model inference to answer questions and generate outputs
VSS Agent UI: Web UI with chat, video upload, and different views
RTVI VLM: Real-Time VLM microservice for alert verification
Video IO & Storage (VIOS): Video ingestion, recording, and playback services used by the agent for video access and management
NVStreamer: Video streaming service for video playback
Nemotron LLM (NIM): LLM inference service used for reasoning, tool selection, and response generation
Cosmos Reason (NIM): Vision-language model with physical reasoning capabilities
ELK: Elasticsearch, Logstash, and Kibana stack for log storage and analysis
Phoenix: Observability and telemetry service for agent workflow monitoring

Before you begin, ensure all of the prerequisites are met. See Prerequisites for more details.

Once deployed, the following services are available:

It is recommended to clear chat and refresh the VSS UI page after deployment to clear any data from a previous deployment.