NVIDIA Blueprints are comprehensive reference workflows built with NVIDIA AI libraries, SDKs, and microservices for speeding up the deployment of AI solutions. This toolkit uses the PDF to podcast blueprint that leverages large language models (LLMs), text-to-speech, and NVIDIA NIM microservices to build a generative AI application that transforms PDF data into audio content.

NVIDIA NIM

NVIDIA NIM provides containers to self-host GPU-accelerated inferencing microservices for pretrained and customized AI models across clouds and data centers. NIM microservices expose industry-standard APIs for simple integration into AI applications, development frameworks, and workflows. Built on pre-optimized inference engines from NVIDIA and the community, including NVIDIA® TensorRT™ and TensorRT-LLM, NIM microservices optimize response latency and throughput for each combination of foundation model and GPU. NVIDIA NIM for Developer is the edition used in this toolkit.

The NIM microservices used in this toolkit: