API ReferenceFull Library ReferenceNemo CuratorNemo CuratorStagesTextDownloadHtml Extractorsnemo_curator.stages.text.download.html_extractors.resiliparseAsk a question|Copy page|View as Markdown|More actionsModule Contents Classes NameDescriptionResiliparseExtractor- API class nemo_curator.stages.text.download.html_extractors.resiliparse.ResiliparseExtractor( required_stopword_density: float = 0.32, main_content: bool = True, alt_texts: bool = False) Bases: HTMLExtractorAlgorithmnemo_curator.stages.text.download.html_extractors.resiliparse.ResiliparseExtractor.extract_text( html: str, stop_words: frozenset[str], language: str) -> list[str] | None