Speech AI Tools

User Guide (Latest Version)

NeMo provides a set of tools useful for developing Automatic Speech Recognitions (ASR) and Text-to-Speech (TTS) synthesis models: https://github.com/NVIDIA/NeMo/tree/stable/tools .

There are also additional NeMo-related tools hosted in separate github repositories:

Previous Grapheme-to-Phoneme Models
Next NeMo Forced Aligner (NFA)
© | | | | | | |. Last updated on Jun 24, 2024.