***

description: >-
Technical reference for NeMo Curator's infrastructure components including
distributed computing, memory management, and GPU acceleration
categories:

* reference
  tags:
* infrastructure
* distributed
* gpu-accelerated
* memory-management
* docker
* performance
  personas:
* admin-focused
* mle-focused
* devops-focused
  difficulty: reference
  content\_type: reference
  modality: universal

***

# Infrastructure References

This section provides technical reference documentation for NeMo Curator's infrastructure components that are used across all modalities (text, image, video).

***

## Infrastructure Components

<Cards>
  <Card title="Memory Management" href="/reference/infra/memory-management">
    Optimize memory usage when processing large datasets
    partitioning
    batching
    monitoring
  </Card>

  <Card title="GPU Acceleration" href="/reference/infra/gpu-processing">
    Leverage NVIDIA GPUs for faster data processing
    cuda
    rmm
    performance
  </Card>

  <Card title="Resumable Processing" href="/reference/infra/resumable-processing">
    Continue interrupted operations across large datasets
    checkpoints
    recovery
    batching
  </Card>

  <Card title="Container Environments" href="/reference/infra/container-environments">
    Available environments and configurations in NeMo Curator containers. Includes build arguments and video-specific environments.
    docker
    conda
    environments
  </Card>
</Cards>
