Wan 2.1 T2V
Wan 2.1 is a text-to-video diffusion model from Wan AI, trained with flow matching on a large-scale video dataset. It generates high-quality short video clips from text prompts.
Available Models
- Wan2.1-T2V-1.3B: 1.3B parameters
Task
- Text-to-Video (T2V)
Example HF Models
Example Recipes
Try with NeMo AutoModel
1. Install (full instructions):
2. Clone the repo to get the example recipes:
3. Run the recipe from inside the repo:
Run with Docker
1. Pull the container and mount a checkpoint directory:
2. Navigate to the AutoModel directory (where the recipes are):
3. Run the recipe:
See the Installation Guide and Diffusion Fine-Tuning Guide.
Training
See the Diffusion Training and Fine-Tuning Guide and Dataset Preparation.