Speech Data Explorer¶

Note

Dash-based tool for interactive exploration of ASR/TTS datasets.

Features:

dataset’s statistics (alphabet, vocabulary, duration-based histograms)
navigation across dataset (sorting, filtering)
inspection of individual utterances (waveform, spectrogram, audio player)
errors’ analysis (Word Error Rate, Character Error Rate, Word Match Rate, Mean Word Accuracy, diff)

Please make sure that requirements are installed. Then run:

python data_explorer.py path_to_manifest.json

JSON manifest file should contain the following fields:

Errors’ analysis requires “pred_text” (ASR transcript) for all utterances.

Any additional field will be parsed and displayed in ‘Samples’ tab.