NeMo Platform#

Introduction#

NeMo Platform gives you the infrastructure to build and deploy specialized AI agents with open source models. It provides synthetic data generation, model fine-tuning and evaluation, security testing, real-time protection with guardrails, and inference. Production-grade features include RBAC and observability. Deploy locally with Docker or on Kubernetes, integrate with your existing tools, and customize models for your specific use cases while maintaining control over your AI stack.

Common use cases#

  • Customize and evaluate models — Generate synthetic training data, fine-tune models, and measure quality. See Example Applications for workflows like creating text-to-code datasets and fine-tuning with synthetic data.

  • Deploy and serve models — Run inference through the unified gateway and integrate with your existing infrastructure. See About Models and Inference and the Quickstart Installation for deployment examples.

  • Test and protect AI agents — Scan for vulnerabilities with Auditor, then block attacks in real-time with Guardrails.

  • Build RAG and search applications — Fine-tune embedding models for domain-specific retrieval and evaluate with RAG metrics.

Getting up and running#

Prerequisites:

Install the CLI and SDK:

pip install nemo-platform

Start the Quickstart Installation (local platform):

nmp quickstart up

The quickstart prompts for your NGC API key, then downloads and starts the platform. Once complete:

List available models and other key commands:

nmp models list              # See available models
nmp chat --help              # Chat options
nmp workspaces list          # View your workspaces
nmp --help                   # All commands

For full installation steps, GPU config, and SDK usage, see Quickstart Installation; for all commands, see NeMo Platform CLI.

Before you start#

Workspaces — All platform resources (models, datasets, jobs, evaluation results) belong to a workspace. Workspaces provide organizational and authorization boundaries—create separate workspaces to isolate teams, users, environments, or clients. The platform includes two built-in workspaces: default (general-purpose, editable by all) and system (read-only platform resources). When authentication is enabled, users are granted roles (Viewer, Editor, or Admin) within specific workspaces. See Workspaces for creating and managing workspaces.

Projects — Group related resources with projects. Projects are organizational tags within a workspace, useful for fine-tuning experiments, evaluation campaigns, or other work within a team. Access control applies at the workspace level, not per project.

Entities — Models, datasets, jobs, and configurations are entities—the shared data objects that power platform services. See the entities page for how they’re stored, scoped, and used.

Where to go next#

Start building:

Learn the platform:

Deploy to production:

Architecture#

NeMo Platform architecture diagram showing the platform layers: client interfaces (CLI, SDK, Studio UI) at the top, the platform core with API gateway, microservices (Customizer, Evaluator, Guardrails, Data Designer, Safe Synthesizer, Auditor, and more), entity storage, and NIM inference endpoints. Kubernetes and Docker deployment targets are shown on the left, with external integrations on the right.

Figure 1 Click to view full size.#