Important

You are viewing the NeMo 2.0 documentation. This release introduces significant changes to the API and a new library, NeMo Run. We are currently porting all features from NeMo 1.0 to 2.0. For documentation on previous versions or features not yet available in 2.0, please refer to the NeMo 24.07 documentation.

Datasets#

Input data should be provided in line delimited JSON format as below:

{"audio_filepath": "/path/to/abcd.wav", "offset": 0, "duration": 10.1, "text": "{'scenario': 'Calendar', 'action': 'Create_entry', 'entities': [{'type': 'event_name', 'filler': 'brunch'}, {'type': 'date', 'filler': 'Saturday'}, {'type': 'timeofday', 'filler': 'morning'}, {'type': 'person', 'filler': 'Aronson'}]}"}

The semantics annotation is a Python dictionary flattened as a string, and indexed by the “text” key in the manifest. For a semantics annotation, there are three mandatory keys: “scenario”, “action” and “entities”. The values for “scenario” and “action” are strings, where the value for “entities” is a Python list of dictionary. Each item in “entities” is also a Python dictionary, with two keys “type” (entity slot) and “filler” (slot filler).