nemo_curator.stages.interleaved.io.writers.base
nemo_curator.stages.interleaved.io.writers.base
Module Contents
Classes
API
DataclassAbstract
Bases: ProcessingStage[InterleavedBatch, FileGroupTask]
Base class for interleaved writers.
Handles filesystem setup, deterministic file naming, optional binary
materialization, and process() orchestration. Subclasses implement
_write_dataframe for format-specific output.
append_mode_implemented
file_extension
materialize_on_write
mode
name
path
write_kwargs
abstract
Format-specific DataFrame writer. Subclasses implement this.