`nemo_automodel.components.flow_matching.adapters.flux`#

Flux model adapter for FlowMatching Pipeline.

This adapter supports FLUX.1 style models with:

Module Contents#

Model adapter for FLUX.1 image generation models.

class nemo_automodel.components.flow_matching.adapters.flux.FluxAdapter( guidance_scale: float = 3.5, use_guidance_embeds: bool = True, )#

Model adapter for FLUX.1 image generation models.

Supports batch format from multiresolution dataloader:

FLUX model forward interface:

Initialization

Initialize FluxAdapter.

Parameters:

_pack_latents(latents: torch.Tensor) → torch.Tensor#

Pack latents from [B, C, H, W] to Flux format [B, (H//2)(W//2), C4].

Flux uses a 2x2 patch embedding, so latents are reshaped accordingly.

static _unpack_latents( latents: torch.Tensor, height: int, width: int, vae_scale_factor: int = 8, ) → torch.Tensor#

Unpack latents from Flux format back to [B, C, H, W].

Parameters:

_prepare_latent_image_ids( batch_size: int, height: int, width: int, device: torch.device, dtype: torch.dtype, ) → torch.Tensor#

Prepare positional IDs for image latents.

Returns tensor of shape [B, (H//2)*(W//2), 3] containing (batch_idx, y, x).

Prepare inputs for Flux model from FlowMatchingContext.

Expects 4D image latents: [B, C, H, W]

forward( model: torch.nn.Module, inputs: Dict[str, Any], ) → torch.Tensor#

Execute forward pass for Flux model.

Returns unpacked prediction in [B, C, H, W] format.