Mistral-Small-4
Mistral-Small-4
Mistral-Small-4-119B is Mistral AI’s multimodal MoE model supporting both text and image inputs at scale.
Available Models
- Mistral-Small-4-119B-2603
Architecture
MistralForConditionalGeneration
Example HF Models
Example Recipes
Try with NeMo AutoModel
1. Install (full instructions):
2. Clone the repo to get the example recipes:
This recipe was validated on 4 nodes × 8 GPUs (32 H100s). See the Launcher Guide for multi-node setup.
3. Run the recipe from inside the repo:
Run with Docker
1. Pull the container and mount a checkpoint directory:
2. Navigate to the AutoModel directory (where the recipes are):
3. Run the recipe:
See the Installation Guide and VLM Fine-Tuning Guide.
Fine-Tuning
See the VLM Fine-Tuning Guide.