NeMo Speaker Diarization API¶

Model Classes¶

class nemo.collections.asr.models.ClusteringDiarizer(cfg: omegaconf.DictConfig)[source]¶

diarize(paths2audio_files: Optional[List[str]] = None, batch_size: int = 1)[source]¶

classmethod list_available_models()[source]¶

Should list all pre-trained models available via NVIDIA NGC cloud

classmethod restore_from(restore_path: str, override_config_path: Optional[str] = None, map_location: Optional[torch.device] = None, strict: bool = False)[source]¶: Restores module/model with weights

save_to(save_path: str)¶

Saves model instance (weights and configuration) into EFF archive or .: You can use “restore_from” method to fully restore instance from .nemo file.
.nemo file is an archive (tar.gz) with the following:: model_config.yaml - model configuration in .yaml format. You can deserialize this into cfg argument for model’s constructor model_wights.chpt - model checkpoint

Parameters: save_path – Path to .nemo file where model instance should be saved

class nemo.collections.asr.parts.mixins.DiarizationMixin[source]¶

Bases: abc.ABC

abstract diarize(paths2audio_files: List[str], batch_size: int = 1) → List[str][source]¶

Takes paths to audio files and returns speaker labels :param paths2audio_files: paths to audio fragment to be transcribed