NeMo TTS Collection API#
Model Classes#
Mel-Spectrogram Generators#
Speech-to-Text Aligner Models#
Two-Stage Models#
Vocoders#
Base Classes#
The classes below are the base of the TTS pipeline. To read more about them, see the Base Classes section of the intro page.
Dataset Processing Classes#
- class nemo.collections.tts.data.dataset.MixerTTSXDataset(*args: Any, **kwargs: Any)[source]#
Bases:
TTSDataset