Features#

VSS supports video and image file upload, live stream support, summarization, Q&A, and alerts on files and live streams with various configuration options.

Faster Long video processing
Image and Multi-Image processing
Live Stream (RTSP) support
Supported file formats: mp4, mkv, jpg, png
Supported codecs: h264/h265 video and Opus/Vorbis audio
Summarization for videos, images, and live streams
Q&A for videos, images, and live streams
Alerts for videos and live streams
TRT-LMM acceleration for VILA-1.5
TRT-LMM acceleration for NVILA
Multi-node multi-GPU support
Context Aware RAG support for enhanced accuracy and details, includes Vector-RAG and Graph-RAG
Support for GPT-4o as the VLM and LLM
Use OpenAI Compatible hosted VLM models
Drop-in support for custom VLMs
Guardrails support
OpenAI API based REST API
Multi-stream support
Use of Riva ASR based audio transcription in summarization, QnA, and alerts
CV pipeline to generate CV metadata and Set of Marks (SOM) Prompting for videos and live streams
Support for finetuned NVILA : Recipe to fuse LoRA checkpoint with Base NVILA model

Note

CV pipeline feature is currently at Alpha stage.