Nemotron-Parse
Nemotron-Parse
Nemotron-Parse-v1.1 is NVIDIA’s document parsing VLM, specializing in extracting structured information from complex documents including tables, forms, and mixed-content PDFs.
Available Models
- Nemotron-Parse-v1.1
Architecture
NemotronParseForConditionalGeneration
Example HF Models
Example Recipes
Try with NeMo AutoModel
1. Install (full instructions):
2. Clone the repo to get the example recipes:
3. Run the recipe from inside the repo:
Run with Docker
1. Pull the container and mount a checkpoint directory:
2. Navigate to the AutoModel directory (where the recipes are):
3. Run the recipe:
See the Installation Guide and VLM Fine-Tuning Guide.
Fine-Tuning Tutorial on Brev
Launch the end-to-end Nemotron Parse fine-tuning tutorial on Brev with a single click:
See also the tutorial notebook and the VLM Fine-Tuning Guide.