Operations
Day-2 operational reference for the DSX Event Bus: monitoring, configuration tuning, service internals, and the chart structure.
Monitoring
Surveyor Configuration
NATS Surveyor exports Prometheus metrics from the NATS cluster. Configure via the surveyor section in Helm values:
Prometheus Operator must be installed for the ServiceMonitor CRD.
Monitoring Architecture
The mTLS cluster’s SYS account is federated to the main cluster via leaf node, enabling centralized monitoring of both NATS instances from a single Surveyor.
NATS Metrics
Surveyor exposes metrics on port 7777 at /metrics:
nats_core_*— core server metrics (connections, messages, bytes)nats_account_*— per-account metricsnats_jetstream_*— JetStream stream and consumer metrics
Auth-Callout Metrics
Auth-callout exposes Prometheus metrics at :9090/metrics:
Health Endpoints
Internal Services
Kubernetes service names and ports within the dsx namespace:
Configuration Reference
mTLS Endpoint
The mTLS NATS cluster is enabled by default. It deploys a separate NATS instance that accepts MQTT connections authenticated with client certificates. This instance has no local JetStream; it connects to the main NATS cluster via leaf nodes.
When disabled (global.eventBus.mtls.enabled: false):
- No
nats-mtlspods, services, or config deployed - No
mqttMtlsgateway route created - mTLS-specific environment variables omitted
- mTLS leaf NKey entries omitted from auth-callout permissions
- mTLS secrets not required
MQTT Stream Configuration
JetStream streams for MQTT persistence are managed declaratively by the NACK controller:
Extra Accounts
Add cluster-wide NATS accounts beyond the defaults:
Properties are passed through to the NATS account configuration on each cluster. CPC leaf nodes bridge enabled extra accounts to CSC, while each account keeps its own permissions and JetStream API surface.
Subchart Configuration
Configure subcharts by prefixing values with the chart alias:
Teardown
helm uninstall removes chart-managed resources but leaves operator-provisioned secrets and service accounts in place. This is intentional — the chart does not own those resources.
Chart uninstall
Run on each cluster (CSC and every CPC):
Removing operator-provisioned resources
If you want a full teardown, delete the secrets and service accounts that were created during pre-deployment. These survive helm uninstall because they were created outside the chart:
Full namespace reset
To remove everything including the namespace:
If using a Vault-backed secrets pipeline, also remove the Vault PKI role, KV paths, and per-cluster Kubernetes auth mounts for a true clean slate.
Chart Dependencies
The nats-event-bus umbrella chart bundles these subcharts: