Ministral3 / Devstral
Ministral3 / Devstral
Ministral is Mistral AI’s efficient small model series optimized for on-device and edge use cases. Devstral is a code-focused model built on the same architecture, designed for software engineering agents.
Both use the Mistral3ForConditionalGeneration architecture.
Available Models
Ministral3:
- Ministral-3-3B-Instruct-2512
- Ministral-3-8B-Instruct-2512
- Ministral-3-14B-Instruct-2512
Devstral:
- Devstral-Small-2-24B-Instruct-2512
Architecture
Mistral3ForConditionalGeneration
Example HF Models
Try with NeMo AutoModel
1. Install (full instructions):
2. Clone the repo to get the example recipes:
3. Run the recipe from inside the repo:
Run with Docker
1. Pull the container and mount a checkpoint directory:
2. Navigate to the AutoModel directory (where the recipes are):
3. Run the recipe:
See the Installation Guide and LLM Fine-Tuning Guide.
Fine-Tuning
See the LLM Fine-Tuning Guide.