> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/nemo/curator/llms.txt.
> For full documentation content, see https://docs.nvidia.com/nemo/curator/llms-full.txt.
> For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.nvidia.com/nemo/curator/_mcp/server.

> Technical reference for NeMo Curator's infrastructure components including distributed computing, memory management, and GPU acceleration

# Infrastructure References

This section provides technical reference documentation for NeMo Curator's infrastructure components that are used across all modalities (text, image, video).

***

## Infrastructure Components

Optimize memory usage when processing large datasets
partitioning
batching
monitoring

Leverage NVIDIA GPUs for faster data processing
cuda
rmm
performance

Continue interrupted operations across large datasets
checkpoints
recovery
batching

Available environments and configurations in NeMo Curator containers. Includes build arguments and video-specific environments.
docker
conda
environments