Overview
Why NeMo Framework?
Software Component Versions
Getting Started
Playbooks
Cloud Service Providers
SFT and PEFT
RAG
Large Language Models
Embedding Models
Multimodal Models
Speech AI Models
Deploy NeMo Framework Models
Library Documentation
Example Scripts for Pretraining and Fine-tuning
Changelog
Known Issues

NVIDIA NeMo Framework User Guide

»
Multimodal Models »
Vision-Language Foundation Models

Vision-Language Foundation Models

CLIP
Vision Transformer
NSFW Content Filter

Privacy Policy | Manage My Privacy | Do Not Sell or Share My Data | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2023-2024, NVIDIA Corporation.

Last updated on Jul 24, 2024.