NeMo Guardrails Microservice for Production#

The NeMo Guardrails microservice in the NeMo microservices platform is the production-grade, high availability, scalable, end-to-end data flywheel deployment option, available for the NVIDIA AI Enterprise plan.

The NeMo microservices platform provides a fully-featured guardrails microservice that provides the same capabilities as the NeMo Guardrails library through its own API, as the guardrails microservice is built on top of the library.

Capability

NeMo Guardrails Library API Server

NeMo Guardrails Microservice

REST API for guardrailed chat completions

Guardrails configuration management

By managing the config folder structure

Through the platform’s config API endpoints

NeMo microservices platform integration

No need

Required

Production-grade reliability

Limited (self-managed and no high availability support out of the box)

Supported by the platform

Support

Open source community

NVIDIA AI Enterprise plan

When to Use Each Option#

Use the following guidelines to decide when to use each option.

NeMo Guardrails Library API Server#

The Guardrails API server included in the NeMo Guardrails library is suitable for:

  • Direct integration of guardrails into your application.

  • Proof-of-concept deployments.

  • Development and testing environments.

  • Evaluating guardrails configurations before production.

For this option, refer to the following topics:

NeMo Guardrails Microservice#

The NeMo Guardrails microservice is recommended for:

  • Production deployments requiring enterprise-grade reliability.

  • Organizations already using or planning to adopt the NeMo microservices platform.

Important

The NeMo Guardrails microservice operates within the NeMo microservices platform ecosystem and requires other microservices to function. It is not a standalone solution.

For deployment instructions and API documentation for the NeMo Guardrails microservice, refer to NVIDIA NeMo Microservices Documentation.