Use the NeMo Guardrails Python APIs | NVIDIA NeMo Guardrails Library Developer Guide

This section covers how to use the NeMo Guardrails library Python API to run guardrailed inference and integrate the guardrails into your application.

Overview

RailsConfig and LLMRails core classes for generating guarded responses.

Concept

Core Classes

RailsConfig and LLMRails class reference for loading and running guardrails.

Reference

Generation Options

Configure logging, LLM parameters, and rail selection for generation.

Reference

Streaming

Stream LLM responses in real-time with the stream_async method.

Tutorial

Check Messages

Validate messages against input and output rails using check_async and check methods.

Reference

Event-Based API

Use generate_events for low-level control over guardrails execution.

Reference