Troubleshooting
This page lists common troubleshooting steps for NVIDIA Config Manager deployments.
Linux inotify Limit Errors
On Ubuntu and other Linux hosts, low default inotify limits can cause Kubernetes workloads to fail during install.
Check the current values:
Set the recommended minimums:
Chart Upload Fails
Confirm Helm can authenticate to the target OCI chart namespace. If you are using a local HTTP registry for testing, pass --plain-http to the airgapped bundle upload helper.
DNS not available
If DNS is not already configured for the site, add the Config Manager hostnames to your local /etc/hosts file, replacing <GATEWAY_IP> with the IP address of the gateway ingress. For example:
For local SSH forwarding, replace <GATEWAY_IP> with 127.0.0.1 and forward the gateway port from the deployment host.
Browser Certificate Warnings
When self-signed TLS is enabled, browsers and API clients must trust the generated certificate authority or explicitly accept the browser warning for every service hostname. Certificate trust is hostname-specific; accepting config-manager.example.com does not automatically trust nautobot.config-manager.example.com.
ESO Secrets Not Syncing
GatewayClass Already Exists
Use the --skip-gateway-class flag with the deploy script to skip the creation of the GatewayClass.
Helm Timeout
Increase --helm-timeout, inspect pod events, and verify storage class availability.
The installer is designed to be re-run after you fix the underlying issue. Re-run the same config with a longer timeout or corrected values. If a failed install left an unusable partial deployment and you do not need to preserve data, uninstall the Helm release and delete the namespace before retrying.
Images Not Loading
LoadBalancer Pending
Operator install fails offline
Confirm manifests/, charts/, and operator-versions.env are present in the bundle.
Pods Stuck in ImagePullBackOff
Confirm image names and tags match image-map.tsv, registry credentials, or node containerd stores.
For Kind deployments that build images locally, confirm the images were loaded into the same Kind cluster used by kubectl:
If the cluster name is not nv-config-manager, deploy with --kind-cluster <name>.