For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI Reference
DocumentationAPI Reference
  • API Reference
    • Overview
        • Nemo Curator
          • Backends
          • Config
          • Core
          • Metrics
          • Models
          • Package Info
          • Pipeline
          • Stages
          • Tasks
          • Utils
            • Client Utils
            • Column Utils
            • Decoder Utils
            • File Utils
            • Gpu Utils
            • Grouping
            • Hf Download Utils
            • Merge File Prefixes
            • Nvcodec Utils
            • Operation Utils
            • Performance Utils
            • Prompts
            • Ray Utils
            • Split Large Files
            • Storage Utils
            • Vllm Utils
            • Windowing Utils
            • Writer Utils
    • Pipeline
    • ProcessingStage
    • CompositeStage
    • Resources
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoNeMo Curator
On this page
  • Module Contents
  • Classes
  • Functions
  • API
API ReferenceFull Library ReferenceNemo CuratorNemo CuratorUtils

nemo_curator.utils.client_utils

||View as Markdown|
Previous

nemo_curator.utils

Next

nemo_curator.utils.column_utils

Module Contents

Classes

NameDescription
FSPathWrapper that combines filesystem and path for convenient file operations.

Functions

NameDescription
is_remote_url-

API

class nemo_curator.utils.client_utils.FSPath(
fs: fsspec.AbstractFileSystem,
path: str
)

Wrapper that combines filesystem and path for convenient file operations.

nemo_curator.utils.client_utils.FSPath.__repr__()
nemo_curator.utils.client_utils.FSPath.__str__()
nemo_curator.utils.client_utils.FSPath.as_posix() -> str
nemo_curator.utils.client_utils.FSPath.get_bytes_cat_ranges(
part_size: int = 10 * 1024 ** 2
) -> bytes

Read object into memory using fsspec’s cat_ranges. Modified from https://github.com/rapidsai/cudf/blob/ba64909422016ba389ab06ed01d7578336c19e8e/python/dask_cudf/dask_cudf/io/json.py#L26-L34

nemo_curator.utils.client_utils.FSPath.open(
mode: str = 'rb',
kwargs = {}
) -> fsspec.spec.AbstractBufferedFile
nemo_curator.utils.client_utils.is_remote_url(
url: str
) -> bool