core.tokenizers.text.parsers.base_parser#

Module Contents#

Classes#

BaseParser

Base class for text parsers.

API#

class core.tokenizers.text.parsers.base_parser.BaseParser#

Base class for text parsers.

static parse(text: str, **kwargs) tuple[str, dict[str, Any]]#

Parses the text into a tuple containing extracted content and a dictionary of additional information.

Parameters:

text (str) – The text to parse.

Returns:

A tuple containing the unprocessed text and a dictionary with the extracted information.

Return type:

tuple[str, dict[str, Any]]