nemo_curator.stages.text.modifiers.doc_modifier

View as Markdown

Module Contents

Classes

NameDescription
DocumentModifierAbstract base class for text-based document modifiers.

API

class nemo_curator.stages.text.modifiers.doc_modifier.DocumentModifier()
Abstract

Abstract base class for text-based document modifiers.

Subclasses must implement modify_document to transform input value(s) and return the modified value. This supports both single-input and multi-input usage:

  • Single input: modify_document(value)
  • Multiple inputs: modify_document(**values) where each input field is expanded as a keyword argument (e.g., modify_document(column_1=..., column_2=...)).
_name
= self.__class__.__name__
name
str
nemo_curator.stages.text.modifiers.doc_modifier.DocumentModifier.modify_document(
args: object = (),
kwargs: object = {}
) -> object
abstract

Transform the provided value(s) and return the result.