Features#
VSS supports video and image file upload, live stream support, summarization, Q&A, and alerts on files and live streams with various configuration options.
Faster long video processing
Image and Multi-Image processing
Live Stream (RTSP) support
Supported file formats: mp4, mkv, jpg, png
Supported codecs: h264/h265 video and Opus/Vorbis audio
Summarization for videos, images, and live streams
Q&A for videos, images, and live streams
Alerts for videos and live streams
TRT-LLM acceleration for VILA-1.5
TRT-LLM acceleration for NVILA
vLLM acceleration for Cosmos-Reason1 and Qwen2.5 VL based models
Multi-node multi-GPU support
Context Aware RAG support for enhanced accuracy and details, includes Vector-RAG and Graph-RAG
Support for GPT-4o as the VLM and LLM
Use OpenAI Compatible hosted VLM models
Drop-in support for custom VLMs
Guardrails support
OpenAI API based REST API
Multi-stream support
Use of Riva ASR based audio transcription in summarization, Q&A, and alerts
CV pipeline to generate CV metadata and Set of Marks (SOM) Prompting for videos and live streams
Support for finetuned NVILA: Recipe to fuse LoRA checkpoint with Base NVILA model
Support for reviewing video snippets of externally generated alerts using VLM
Note
CV pipeline feature is currently at Alpha stage.