nv_ingest.extraction_workflows.pptx package#
Submodules#
nv_ingest.extraction_workflows.pptx.pptx_helper module#
- nv_ingest.extraction_workflows.pptx.pptx_helper.format_text(
- text: str,
- bold: bool = False,
- italic: bool = False,
- underline: bool = False,
- nv_ingest.extraction_workflows.pptx.pptx_helper.get_bbox(
- presentation_object: Presentation | None = None,
- shape_object: Slide | None = None,
- text_depth: TextTypeEnum | None = None,
- nv_ingest.extraction_workflows.pptx.pptx_helper.python_pptx(
- pptx_stream,
- extract_text: bool,
- extract_images: bool,
- extract_tables: bool,
- extract_charts: bool,
- **kwargs,
Helper function to use python-pptx to extract text from a bytestream PPTX, while deferring image classification into tables/charts if requested.
Module contents#
- nv_ingest.extraction_workflows.pptx.python_pptx(
- pptx_stream,
- extract_text: bool,
- extract_images: bool,
- extract_tables: bool,
- extract_charts: bool,
- **kwargs,
Helper function to use python-pptx to extract text from a bytestream PPTX, while deferring image classification into tables/charts if requested.