NVIDIA Neural Modules (NeMo) is a flexible, Python-based toolkit enabling data scientists and researchers to build state-of-the-art speech and language deep learning models composed of reusable building blocks that can be safely connected together for conversational AI applications.
All NVIDIA NeMo documentation for your reference
This NeMo User Guide focuses on how to get started with NeMo, train quickly, provides tutorials on speech and speaker recognition, as well as voice activity detection how-to's, all about NLP and speech synthesis. NVIDIA NeMo is a framework-agnostic toolkit for building AI applications powered by Neural Modules. Lastly, this User Guide includes the NeMo and NeMo Collections API documentation.