> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/nemo/curator/llms.txt.
> For full documentation content, see https://docs.nvidia.com/nemo/curator/llms-full.txt.
> For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.nvidia.com/nemo/curator/_mcp/server.

# nemo_curator.tasks.file_group

## Module Contents

### Classes

| Name                                                            | Description                                    |
| --------------------------------------------------------------- | ---------------------------------------------- |
| [`FileGroupTask`](#nemo_curator-tasks-file_group-FileGroupTask) | Task representing a group of files to be read. |

### API

```python
class nemo_curator.tasks.file_group.FileGroupTask(
    task_id: str,
    dataset_name: str,
    data: list[str] = list(),
    _stage_perf: list[nemo_curator.utils.performance_utils.StagePerfStats] = list(),
    _metadata: dict[str, typing.Any] = dict(),
    reader_config: dict[str, typing.Any] = dict()
)
```

Dataclass

**Bases:** [Task\[list\[str\]\]](/nemo-curator/nemo_curator/tasks/tasks#nemo_curator-tasks-tasks-Task)

Task representing a group of files to be read.
This is created during the planning phase and passed to reader stages.

Number of files in this group.

```python
nemo_curator.tasks.file_group.FileGroupTask.validate() -> bool
```

Validate the task data.