Speech Intent Classification and Slot Filling

Intent Classification and Slot Filling aims to not only recognize the user’s intention, but also detect entity slots and their corresponding lexical fillers. Below is an example:


Different from its counterpart in Natural Language Understanding (NLU) that takes text as input, here the model predicts the semantics directly from audio input.

The full documentation tree is as follows:

Example of Speech Intent Classification and Slot Filling can be found here.

Information about how to load model checkpoints (either local files or pretrained ones from NGC), as well as a list of the checkpoints available on NGC are located on the Checkpoints page.

Documentation regarding the configuration files specific to SLU can be found in the Configuration Files page.

Previous Resources and Documentation
Next Models
© Copyright 2023-2024, NVIDIA. Last updated on May 17, 2024.