nv_ingest_api.internal.extract.audio package#

Submodules#

nv_ingest_api.internal.extract.audio.audio_extraction module#

nv_ingest_api.internal.extract.audio.audio_extraction.extract_text_from_audio_internal( df_extraction_ledger: DataFrame, task_config: Dict[str, Any], extraction_config: AudioExtractorSchema, execution_trace_log: Dict | None = None, ) → Tuple[DataFrame, Dict][source]#

Extracts audio data from a DataFrame.

Parameters:

df_extraction_ledger (pd.DataFrame) – DataFrame containing the content from which audio data is to be extracted.
task_config (Dict[str, Any]) – Dictionary containing task properties and configurations.
extraction_config (Any) – The validated configuration object for audio extraction.
execution_trace_log (Optional[Dict], optional) – Optional trace information for debugging or logging. Defaults to None.

Returns:

A tuple containing the updated DataFrame and the trace information.

Return type:

Tuple[pd.DataFrame, Dict]

Raises:

Exception – If any error occurs during the audio data extraction process.

nv_ingest_api.internal.extract.audio package#

Submodules#

nv_ingest_api.internal.extract.audio.audio_extraction module#

Module contents#