Integrating with NeMo Guardrails#

NeMo Guardrails uses the LangChain ChatNVIDIA connector to connect to a locally running NIM microservice like Llama 3.1 NemoGuard 8B TopicControl NIM. The topic control microservice exposes the standard OpenAI interface on the v1/completions and v1/chat/completions endpoints.

NeMo Guardrails simplifies the complexity of building the prompt template, parsing the topic control model responses, and provides a programmable method to build a chatbot with content safety rails.

To integrate NeMo Guardrails with the topic control microservice, create a config.yml file that is similar to the following example:

models:
  - type: main
    engine: openai
    model: gpt-3.5-turbo-instruct

  - type: "topic_control"
    engine: nim
    parameters:
      base_url: "http://localhost:8000/v1"
      model_name: "llama-3.1-nemoguard-8b-topic-control"

rails:
  input:
    flows:
      - topic safety check input $model=topic_control

Field engine specifies nim.
Field parameters.base_url specifies the IP address and port of the Llama 3.1 NemoGuard 8B TopicControl NIM host.
Field parameters.model_name in the Guardrails configuration must match the model name served by the Llama 3.1 NemoGuard 8B TopicControl NIM.
The rails definition specifies topic_control as the model.

Refer to NVIDIA NeMo Guardrails documentation for more information about the configuration file.