stages.text.download.arxiv.url_generation#

Module Contents#

Classes#

ArxivUrlGenerator

Generates URLs for Arxiv data.

API#

class stages.text.download.arxiv.url_generation.ArxivUrlGenerator#

Bases: nemo_curator.stages.text.download.URLGenerator

Generates URLs for Arxiv data.

generate_urls() list[str]#

Generate a list of URLs to download.