modifiers.markdown_remover#

Module Contents#

Classes#

MarkdownRemover

Removes Markdown formatting in a document including bold, italic, underline, and URL text.

Data#

API#

modifiers.markdown_remover.MARKDOWN_BOLD_REGEX#

‘\\(.?)\\*’

modifiers.markdown_remover.MARKDOWN_ITALIC_REGEX#

‘\(.?)\*’

‘\[.?\]\((.?)\)’

modifiers.markdown_remover.MARKDOWN_UNDERLINE_REGEX#

(.*?)

class modifiers.markdown_remover.MarkdownRemover#

Bases: nemo_curator.modifiers.DocumentModifier

Removes Markdown formatting in a document including bold, italic, underline, and URL text.

Initialization

modify_document(text: str) str#