nv_ingest.framework.orchestration.morpheus.stages.transforms package#

Submodules#

nv_ingest.framework.orchestration.morpheus.stages.transforms.embed_text_stage module#

nv_ingest.framework.orchestration.morpheus.stages.transforms.embed_text_stage.generate_text_embed_extractor_stage(
c: Any,
transform_config: Dict[str, Any],
task: str = 'embed',
task_desc: str = 'text_embed_extraction',
pe_count: int = 1,
)[source]#

Generates a multiprocessing stage to perform text embedding extraction from a pandas DataFrame.

Parameters:
  • c (Any) – Global configuration object.

  • transform_config (Dict[str, Any]) – Configuration parameters for the text embedding extractor, validated against EmbedExtractionsSchema.

  • task (str, optional) – The task name for the stage worker function (default is “embed”).

  • task_desc (str, optional) – A descriptor used for latency tracing and logging (default is “text_embed_extraction”).

  • pe_count (int, optional) – Number of process engines to use concurrently (default is 1).

Returns:

A configured stage that processes a pandas DataFrame and returns a tuple of (DataFrame, trace_info dict).

Return type:

MultiProcessingBaseStage

nv_ingest.framework.orchestration.morpheus.stages.transforms.image_caption_extraction module#

nv_ingest.framework.orchestration.morpheus.stages.transforms.image_caption_extraction.generate_caption_extraction_stage(
c: Config,
transform_config: Dict[str, Any],
task: str = 'caption',
task_desc: str = 'caption_extraction',
pe_count: int = 8,
)[source]#

Generates a caption extraction stage with the specified configuration.

Parameters:
  • c (Config) – Morpheus global configuration object.

  • transform_config (dict) – Configuration parameters for caption extraction.

  • task (str, optional) – The task name to match for the stage worker function, by default “caption”.

  • task_desc (str, optional) – A descriptor to be used in latency tracing, by default “caption_extraction”.

  • pe_count (int, optional) – Number of processing elements to use, by default 8.

Returns:

The generated caption extraction stage.

Return type:

MultiProcessingBaseStage

Raises:

ValueError – If an error occurs during stage generation.

Module contents#

nv_ingest.framework.orchestration.morpheus.stages.transforms.generate_caption_extraction_stage(
c: Config,
transform_config: Dict[str, Any],
task: str = 'caption',
task_desc: str = 'caption_extraction',
pe_count: int = 8,
)[source]#

Generates a caption extraction stage with the specified configuration.

Parameters:
  • c (Config) – Morpheus global configuration object.

  • transform_config (dict) – Configuration parameters for caption extraction.

  • task (str, optional) – The task name to match for the stage worker function, by default “caption”.

  • task_desc (str, optional) – A descriptor to be used in latency tracing, by default “caption_extraction”.

  • pe_count (int, optional) – Number of processing elements to use, by default 8.

Returns:

The generated caption extraction stage.

Return type:

MultiProcessingBaseStage

Raises:

ValueError – If an error occurs during stage generation.