nv_ingest_api.internal.extract.audio package#
Submodules#
nv_ingest_api.internal.extract.audio.audio_extraction module#
- nv_ingest_api.internal.extract.audio.audio_extraction.extract_text_from_audio_internal(
- df_extraction_ledger: DataFrame,
- task_config: Dict[str, Any],
- extraction_config: AudioExtractorSchema,
- execution_trace_log: Dict | None = None,
Extracts audio data from a DataFrame.
- Parameters:
df_extraction_ledger (pd.DataFrame) – DataFrame containing the content from which audio data is to be extracted.
task_config (Dict[str, Any]) – Dictionary containing task properties and configurations.
extraction_config (Any) – The validated configuration object for audio extraction.
execution_trace_log (Optional[Dict], optional) – Optional trace information for debugging or logging. Defaults to None.
- Returns:
A tuple containing the updated DataFrame and the trace information.
- Return type:
Tuple[pd.DataFrame, Dict]
- Raises:
Exception – If any error occurs during the audio data extraction process.