Important
You are viewing the NeMo 2.0 documentation. This release introduces significant changes to the API and a new library, NeMo Run. We are currently porting all features from NeMo 1.0 to 2.0. For documentation on previous versions or features not yet available in 2.0, please refer to the NeMo 24.07 documentation.
Framework Inference#
For DreamBooth, the inference script generates images from text prompts defined in the config file, similar to section
5.7.3. Note that, dreambooth is a fine-tuning model based on diffusion models to link a special token with certain
subject, so make sure the special token you trained on is included in the text prompt. For
example, a photo of sks dog sleeping.
To enable the inference stage with DreamBooth, configure the configuration files:
In the
defaultssection ofconf/config.yaml, update thefw_inferencefield to point to the desired DreamBooth inference configuration file. For example, if you want to use thedreambooth/text2img.yamlconfiguration, change thefw_inferencefield todreambooth/text2img.defaults: - fw_inference: dreambooth/text2img ...
In the
stagesfield ofconf/config.yaml, make sure thefw_inferencestage is included. For example,stages: - fw_inference ...
Configure
promptsandnum_images_per_promptfields ofconf/fw_inference/dreambooth/text2img.yaml. Setmodel.restore_from_pathto the ckpt generated from dreambooth training.
Remarks:
Please refer to DreamBooth Training , the inference stage of DreamBooth should be conducted subsequent to the DreamBooth conversion process. This conversion transforms the DreamBooth ckpt into a ‘.nemo’ format and meanwhile remapping the parameter keys into Stable Diffusion style, allowing for a consistent inference pipeline.