Important

NeMo 2.0 is an experimental feature and currently released in the dev container only: nvcr.io/nvidia/nemo:dev. Please refer to the Migration Guide for information on getting started.

Text to Image Models

NeMo multimodal provides implementations of multiple image-to-text models, including Stable Diffusion, Imagen, DreamBooth, ControlNet, and InstructPix2Pix. Please refer to NeMo Framework User Guide for Multimodal Models for detailed support information.