Guardrail Catalog

View as Markdown

The NeMo Guardrails library ships with a catalog of pre-built guardrails that you can activate out of the box. These guardrails span the most common safety and security concerns in LLM-powered applications from blocking harmful content and detecting jailbreak attempts to masking personally identifiable information and grounding responses in evidence.

Each guardrail is implemented as a configurable rail flow that you add to the input, output, or retrieval section of your config.yml. You can use NVIDIA-trained safety models, open-source community models, LLM self-check prompts, or third-party managed APIs, and combine multiple approaches for defense in depth.

Browse the catalog below to find the guardrail that fits your use case.