nv_ingest_api.internal.extract.audio package#

Submodules#

nv_ingest_api.internal.extract.audio.audio_extraction module#

nv_ingest_api.internal.extract.audio.audio_extraction.extract_text_from_audio_internal(
df_extraction_ledger: DataFrame,
task_config: Dict[str, Any],
extraction_config: AudioExtractorSchema,
execution_trace_log: Dict | None = None,
) Tuple[DataFrame, Dict][source]#

Extracts audio data from a DataFrame.

Parameters:
  • df_extraction_ledger (pd.DataFrame) – DataFrame containing the content from which audio data is to be extracted.

  • task_config (Dict[str, Any]) – Dictionary containing task properties and configurations.

  • extraction_config (Any) – The validated configuration object for audio extraction.

  • execution_trace_log (Optional[Dict], optional) – Optional trace information for debugging or logging. Defaults to None.

Returns:

A tuple containing the updated DataFrame and the trace information.

Return type:

Tuple[pd.DataFrame, Dict]

Raises:

Exception – If any error occurs during the audio data extraction process.

Module contents#