Quickstart Guide#

This page will walk you through running inference with the pre-trained Cosmos-Reason1-7B model on a sample video.

Basic Example#

Run the inference_sample.py script to instruct the Cosmos-Reason1-7B model to describe what happens in the sample video.

./scripts/inference_sample.py

The inference.py example script allows you to perform the following inference tasks:

Caption the sample video:

./scripts/inference.py --prompt prompts/caption.yaml --videos assets/sample.mp4 -v

Ask a question about the sample video with reasoning:

./scripts/inference.py --prompt prompts/question.yaml --question 'What are the potential safety hazards?' --reasoning --videos assets/sample.mp4 -v

Temporally caption the sample video and save the input frames to outputs/temporal_caption_text for debugging:

./scripts/inference.py --prompt prompts/temporal_caption_text.yaml --videos assets/sample.mp4 --timestamp -v -o outputs/temporal_caption_text

You can configure inference by editing the following files:

The Using Cosmos-Reason1 for Rejection Sampling page outlines how to use Cosmos-Reason1 to perform rejection sampling on videos produced by Cosmos-Predict.

Refer to the Reason1 Post-Training Guide to learn more about post-training the Reason1 model.