Important

NeMo 2.0 is an experimental feature and currently released in the dev container only: nvcr.io/nvidia/nemo:dev. Please refer to the Migration Guide for information on getting started.

Datasets

Any dataset available in NeMo for ASR (ASR datasets) can be used for SSL. To create your own NeMo compatible datasets, refer to Preparing Custom ASR Data section. Note that explicit labels (transcriptions) are not utilized in SSL and hence are optional when creating datasets for SSL.