modifiers.c4
#
Module Contents#
Classes#
If the sentence contains any of the boilerplate strings then discard. This includes things like “terms of use”, “privacy policy”, etc. Source: Adapted significantly from Google C4 processing. |
API#
- class modifiers.c4.BoilerPlateStringModifier(remove_if_at_top_or_bottom: bool = True)#
Bases:
nemo_curator.modifiers.doc_modifier.DocumentModifier
If the sentence contains any of the boilerplate strings then discard. This includes things like “terms of use”, “privacy policy”, etc. Source: Adapted significantly from Google C4 processing.
Initialization
- modify_document(text: str) str #