Qwen-Image
Qwen-Image
Qwen-Image is Alibaba Cloud’s text-to-image diffusion transformer. NeMo AutoModel supports Qwen-Image training via its flow-matching pipeline with a dedicated qwen_image adapter, enabling FSDP2 parallelization, multiresolution bucketed dataloading and LoRA-style fine-tuning.
Available Models
- Qwen-Image
Task
- Text-to-Image (T2I)
Example HF Models
Example Recipes
Try with NeMo AutoModel
1. Install (full instructions):
2. Clone the repo to get the example recipes:
3. Run the recipe from inside the repo:
Run with Docker
1. Pull the container and mount a checkpoint directory:
2. Navigate to the AutoModel directory (where the recipes are):
3. Run the recipe:
See the Installation Guide and Diffusion Training and Fine-Tuning Guide.
Fine-Tuning
See the Diffusion Training and Fine-Tuning Guide.