Cosmos-Predict2#
Cosmos-Predict2 is a key branch of the Cosmos World Foundation Models ecosystem for Physical AI, specializing in future state prediction through advanced world modeling. It offers two powerful capabilities: text-to-image generation for creating high-quality images from text descriptions, and video-to-world generation for producing visual simulations from video inputs.
Cosmos-Predict2 includes the following:
Diffusion-based world foundation models for Text2Image and Video2World generation, where a user can generate a visual simulation based on text prompts or video prompts.