***

layout: overview
slug: nemo-curator/nemo\_curator/tasks/file\_group
title: nemo\_curator.tasks.file\_group
--------------------------------------

## Module Contents

### Classes

| Name                                                            | Description                                    |
| --------------------------------------------------------------- | ---------------------------------------------- |
| [`FileGroupTask`](#nemo_curator-tasks-file_group-FileGroupTask) | Task representing a group of files to be read. |

### API

<Anchor id="nemo_curator-tasks-file_group-FileGroupTask">
  <CodeBlock links={{"nemo_curator.utils.performance_utils.StagePerfStats":"/nemo-curator/nemo_curator/utils/performance_utils#nemo_curator-utils-performance_utils-StagePerfStats"}} showLineNumbers={false} wordWrap={true}>
    ```python
    class nemo_curator.tasks.file_group.FileGroupTask(
        task_id: str,
        dataset_name: str,
        data: list[str] = list(),
        _stage_perf: list[nemo_curator.utils.performance_utils.StagePerfStats] = list(),
        _metadata: dict[str, typing.Any] = dict(),
        reader_config: dict[str, typing.Any] = dict()
    )
    ```
  </CodeBlock>
</Anchor>

<Indent>
  <Badge>
    Dataclass
  </Badge>

  **Bases:** [Task\[list\[str\]\]](/nemo-curator/nemo_curator/tasks/tasks#nemo_curator-tasks-tasks-Task)

  Task representing a group of files to be read.
  This is created during the planning phase and passed to reader stages.

  <ParamField path="data" type="list[str] = field(default_factory=list)" />

  <ParamField path="num_items" type="int">
    Number of files in this group.
  </ParamField>

  <ParamField path="reader_config" type="dict[str, Any] = field(default_factory=dict)" />

  <Anchor id="nemo_curator-tasks-file_group-FileGroupTask-validate">
    <CodeBlock showLineNumbers={false} wordWrap={true}>
      ```python
      nemo_curator.tasks.file_group.FileGroupTask.validate() -> bool
      ```
    </CodeBlock>
  </Anchor>

  <Indent>
    Validate the task data.
  </Indent>
</Indent>
