Deploy NeMo Auditor with Docker#
Run the microservice on your local machine using Docker Compose for experimentation.
Prerequisites#
Before following this deployment guide, ensure that you have:
Sufficient disk space for generated artifacts (recommended: 10GB).
At least 4GB of available RAM.
Docker and Docker Compose installed on your system.
An NGC API Key with access to the NGC Catalog and Model endpoints on build.nvidia.com. Create an NGC API key following the instructions at Generating NGC API Keys. Specify the NGC Catalog and Public API Endpoints permissions when you generate the key.
The NGC CLI installed. Refer to Getting Started with the NGC CLI for details on setup.
Access to LLM endpoints, such as build.nvidia.com, local NIM, or other compatible endpoints.
Procedure#
Setup the NGC CLI following the instructions at Getting Started with the NGC CLI.
Set up environment variables:
$ export NGC_CLI_API_KEY="<your-ngc-api-key>"
Log in to NVIDIA NGC container registry:
$ docker login nvcr.io -u '$oauthtoken' -p $NGC_CLI_API_KEY
Download the Docker Compose configuration from NGC:
$ ngc registry resource download-version "nvidia/nemo-microservices/nemo-microservices-quickstart:25.12" $ cd nemo-microservices-quickstart_v25.12
Start NeMo Auditor:
$ export NEMO_MICROSERVICES_IMAGE_REGISTRY=nvcr.io/nvidia/nemo-microservices $ export NEMO_MICROSERVICES_IMAGE_TAG=25.12 $ export NIM_API_KEY="<your-nvidia-api-key>" # Specify NIM_API_KEY for accessing models from build.nvidia.com and using NIM audit targets. $ docker compose --profile auditor up --detach --quiet-pull --wait
You must specify the
NIM_API_KEYenvironment variable. You can specify a dummy value if you do not plan to access models from build.nvidia.com.You can also export the
OPENAI_API_KEYandREST_API_KEYenvironment variables if you plan to access models from api.openai.com or use the REST generator in an Auditor target. Refer to Managing Audit Targets for more information about targets.NeMo Auditor is available at http://localhost:8080/v1beta1/audit.
Verify the Deployment#
After starting the services, verify everything is working:
Check service status:
$ docker compose ps
Set
AUDITOR_BASE_URL:$ export AUDITOR_BASE_URL=http://localhost:8080
Check the auditor servics is running:
$ curl "${AUDITOR_BASE_URL}/v1beta1/audit/configs"
Refer to Auditor API for the REST API reference.
Stop the Service#
To stop the microservice:
$ docker compose --profile auditor down
Next Steps#
For more tutorials, refer to NeMo Auditor Tutorials.
For instructions on how to deploy the microservice on your Kubernetes cluster for production at scale, refer to Deploy NeMo Auditor Using Helm Chart.