Important
NeMo 2.0 is an experimental feature and currently released in the dev container only: nvcr.io/nvidia/nemo:dev. Please refer to NeMo 2.0 overview for information on getting started.
Resources and Documentation
Hands-on TTS tutorial notebooks can be found under the TTS tutorials folder. If you are a beginner to NeMo, consider trying out the tutorials of NeMo Primer and NeMo Model. If you are also a beginner to TTS, consider trying out the NeMo TTS Primer Tutorial. These tutorials can be run on Google Colab by specifying the link to the notebooks’ GitHub pages on Colab.
If you are looking for information about a particular TTS model, or would like to find out more about the model architectures available in the directory of nemo.collections.tts, refer to the Models section.
NeMo includes preprocessing scripts for several common TTS datasets. The Data Preprocessing section contains instructions on how to run those scripts. You can also creating your own NeMo-compatible dataset preprocessing script by following the guidance.
Information about how to load model checkpoints (either local files or pretrained ones from NGC), as well as a list of the checkpoints available on NGC are located on the Checkpoints section.
Documentation regarding the configuration files specific to the NeMo TTS models can be found on the Configuration Files section.