nemoguardrails.library.jailbreak_detection.actions
nemoguardrails.library.jailbreak_detection.actions
Module Contents
Functions
Data
API
async
Checks the user’s prompt to determine if it is attempt to jailbreak the model.
async
Uses a trained classifier to determine if a user input is a jailbreak attempt