Features#

VSS supports video and image file upload, live stream support, summarization, Q&A, and alerts on files and live streams with various configuration options.

  • Faster Long video processing

  • Image and Multi-Image processing

  • Live Stream (RTSP) support

  • Supported file formats: mp4, mkv, jpg, png

  • Supported codecs: h264/h265 video and Opus/Vorbis audio

  • Summarization for videos, images, and live streams

  • Q&A for videos, images, and live streams

  • Alerts for videos and live streams

  • TRT-LMM acceleration for VILA-1.5

  • TRT-LMM acceleration for NVILA

  • Multi-node multi-GPU support

  • Context Aware RAG support for enhanced accuracy and details, includes Vector-RAG and Graph-RAG

  • Support for GPT-4o as the VLM and LLM

  • Use OpenAI Compatible hosted VLM models

  • Drop-in support for custom VLMs

  • Guardrails support

  • OpenAI API based REST API

  • Multi-stream support

  • Use of Riva ASR based audio transcription in summarization, QnA, and alerts

  • CV pipeline to generate CV metadata and Set of Marks (SOM) Prompting for videos and live streams

  • Support for finetuned NVILA : Recipe to fuse LoRA checkpoint with Base NVILA model

Note

CV pipeline feature is currently at Alpha stage.