*** layout: overview slug: nemo-curator/nemo\_curator/tasks/file\_group title: nemo\_curator.tasks.file\_group -------------------------------------- ## Module Contents ### Classes | Name | Description | | --------------------------------------------------------------- | ---------------------------------------------- | | [`FileGroupTask`](#nemo_curator-tasks-file_group-FileGroupTask) | Task representing a group of files to be read. | ### API ```python class nemo_curator.tasks.file_group.FileGroupTask( task_id: str, dataset_name: str, data: list[str] = list(), _stage_perf: list[nemo_curator.utils.performance_utils.StagePerfStats] = list(), _metadata: dict[str, typing.Any] = dict(), reader_config: dict[str, typing.Any] = dict() ) ``` Dataclass **Bases:** [Task\[list\[str\]\]](/nemo-curator/nemo_curator/tasks/tasks#nemo_curator-tasks-tasks-Task) Task representing a group of files to be read. This is created during the planning phase and passed to reader stages. Number of files in this group. ```python nemo_curator.tasks.file_group.FileGroupTask.validate() -> bool ``` Validate the task data.