Is this page helpful?

DDCS: Configure#

This guide provides detailed information on the Derived Data Cache Service (DDCS) component and its use in a self-hosted NVCF cluster.

DDCS reduces scene load time and improves performance when properly configured and sized for the workload. Derived data generation is computationally expensive and time-consuming. DDCS trades network bandwidth for compute time by caching derived data, allowing multiple GPUs to share pre-generated content.

Derived data is often many times the size of the source content. During initial scene loads, GPUs generate the most derived content as they encounter assets for the first time. While DDCS may add some time to cold scene loads (as content must be generated and written to DDCS synchronously), the generated data becomes immediately available to other GPUs. During “warm” scene loads, render workers read all derived content from DDCS instead of regenerating it, significantly reducing load times.

Base Configuration#

DDCS requires some configuration to be properly installed. Create a file on your local machine called values.yaml. A base configuration is provided in the following dropdown.

values.yaml#

image:
  pullSecrets:
    - name: ngc-container-pull

cluster:
  replicas: 1
  selfAntiAffinity: false
  affinity:
    nodeAffinity:
      preferredDuringSchedulingIgnoredDuringExecution:
        - weight: 100
          preference:
            matchExpressions:
              - key: node-type
                operator: In
                values:
                  - compute
    podAntiAffinity:
      preferredDuringSchedulingIgnoredDuringExecution:
        - weight: 5
          podAffinityTerm:
            labelSelector:
              matchExpressions:
                - key: ddcs
                  operator: In
                  values:
                    - "kvnode"
            topologyKey: "kubernetes.io/hostname"

  container:
    resources:
      #limits:
      #  memory: 32Gi
      requests:
        memory: 32Gi

    storage:
      volume:
        size: 330Gi
        storageClassName: "gp3"

    settings:
      storageLimit: 300G
      engine:
        sys.cache_size: "10G"
        sys.block_cache_size: "18G"
        cf.max_write_buffer_number: 128
        sys.increase_parallelism: 8
        db.max_background_jobs: 8

monitoring:
  enabled: false

Complete Configuration Reference

The base configuration above covers the essential settings for most deployments. For advanced configuration options or to explore all available settings, refer to the complete values file below. This reference includes all configuration options available in the DDCS Helm chart, including advanced settings for TLS, OpenTelemetry, and resource management.

Complete DDCS values.yaml reference#

# Default values for ddcs.

name: ddcs

global:
  additionalLabels:

image:
  registry: nvcr.io
  repository: nvidia/omniverse/ddcs-dist-kv
  pullPolicy: IfNotPresent
  pullSecrets:
    - name: regcred
  tag: "latest"
  overrideTag: false

imagePullSecrets: []
nameOverride: ""
fullnameOverride: ""

# Whole cluster configuration
cluster:
  # The number of pods to deploy
  #
  # By default the pods have a scheduler preference to be placed on different nodes. If an OSSKV pod
  # is already deployed to a node, another will still be scheduled if possible to do so.
  # This allows the load to spread between nodes without making it impossible to scale up on a single node.
  #
  # When using networked storage classes like 'managed-csi-premium' on AKS or NVMesh classes, it may
  # benefit the deployment to have several replicas scheduled on the same node.
  # For example, in AKS *EACH* volume can likely burst to ~150MB/s.
  replicas: 1

  nodeSelector: {}

  tolerations: []

  # When true, the scheduler will prefer to place KV pods on nodes that do not already
  # have a KV store.
  selfAntiAffinity: true

  affinity: {}

  podAnnotations: {}

  podSecurityContext: {}
    # fsGroup: 2000

  # Controls an OpenTelemetry collector/operator deployment that can be used to
  # deploy sidcars in each pod. This sidecar will collect the metrics from the running service
  # and sends them to another OTEL collector.
  #
  # If enabled, this option will disable the ServiceMonitor.
  otelCollector:
    # When enabled `otelHost` and `otelPort` should be `localhost` and `4317`.
    enabled: false
    # The sidecar itself produces metrics, when true collect those metrics.
    includeCollectorMetrics: true
    batch:
    export:
      otlp:
        # Set this to the correct location of the collector in your cluster.
        endpoint: otel-collector.svc.svc.cluster.local:4317
        tls:
          insecure: true

  monitoring:
    labels:
    spec:

  # Configuration applied to each node.
  container:

    rustLog: "info"
    rustBacktrace: "full"

    securityContext: {}
      # capabilities:
      #   drop:
      #   - ALL
      # readOnlyRootFilesystem: true
      # runAsNonRoot: true
      # runAsUser: 1000

    resources:
      #limits:
      #  memory: 64G

    storage:
      volume:
        # When enabled the DB environment will reside on a k8s mounted volume.
        enabled: true
        # This number should be greater than what is provided in settings.size
        size: 300Gi
        # This is azure specific and should be changed to match a class available in your cluster.
        storageClassName: managed-csi

    settings:
      # Format to use when logging.
      # Allowed values are: human, human_no_color, json
      logFormat: "json"
      # The maximum amount of storage space the DB will target for use on the persistent volume.
      #
      # This includes all space necessary to maintain a WAL, SST files etc.
      storageLimit: 275G
      # Configuration for garbage collection.
      garbageCollection:
        # The minimum free capacity target for the target map.
        #
        # If the percentage of free map space falls below the given amount, GC
        # will begin.
        minFreeCapacity: 40
        # Once GC begins, the system will attempt to remove the given quantile of the entire
        # keyspace.
        #
        # If this number is 60, then 60% of keys will be deleted.
        deleteKeyspaceQuantile: 60
        # The interval on which to check the database capacity.
        checkDbCapacityMs: 1000
      # Settings for telemetry services.
      telemetry:
        # Enables or disables prometheus metric export.
        #
        # An http service is started on the specified port.
        prometheusMetricsExposition: true
        # The port to serve metrics on.
        prometheusMetricsPort: 3051
        # Enables or disables OTEL exposition.
        otelExposition: false
        # The OTEL service collection port.
        otelPort: 4317
        # The OTEL service host address.
        #
        # If otelCollector is enabled, this option should remain as `localhost`.
        otelHost: localhost
        # When true the IP env var must be specified.
        #
        # Useful for situations in which the host has the or pod runs an OTEL collector.
        otelPodIpAsHost: false
      # Engine specific configuration.
      engine:
        # Think of the row cache like a hashmap. When a key-pair is read it is placed in the map. The next lookup
        # of that key will be served by the row cache. The engine does not do a row scan for the data and no disk
        # io is necessary.
        # This is primarily where HOT data is served from.
        sys.cache_size: "32G"
        # Key/value pairs can also be served from the block cache. When the engine searches rows on disk it can keep
        # some amount of them cached in memory. This allows the next lookup to be fast if the data resides in the file
        # blocks that have been cached.
        # There is more that goes in the block cache, in general a large block cache can help reduce scan time.
        sys.block_cache_size: "8G"
        sys.block_cache_num_shard_bits: 8
        sys.increase_parallelism: 8
        sys.use_write_buffer_manager: 1

        # Should be equal to the number of threads.
        # Rocks db uses background jobs to flush data to disk
        # These two values should remain equal
        db.max_background_jobs: 8

        db.table_cache_numshardbits: 6

        db.allow_concurrent_memtable_write: true

        # This number limits the files that rocks will keep open. This is required because K8s
        # will consider the page files associated with open files as used memory by the POD. IE it will
        # the pod can be OOM killed for having too many open files.
        db.max_open_files: 128

        # Write buffers have a default size of 64mb
        # 128 * 64 MB = ~8GB of write capacity before writes are slowed to disk speed
        # When 1 buffer is full the engine switches to the next. When all buffers are full, writes are stalled until another
        # becomes available.
        cf.min_write_buffer_number_to_merge: 2
        cf.max_write_buffer_number: 128

        # Enables the usage of blob db
        cf.enable_blob_files: true
        cf.enable_blob_garbage_collection: true

        # Values of the given size are placed in new blob files instead of going through compaction.
        cf.min_blob_size: 1MB

        # Ensure that stale files are removed more often so that GC is not triggered when it does not need to be.
        # This is 3 minutes.
        #cf.delete_obsolete_files_period_micros: 180000000

        # Ensure that SST files are flowed through the compaction filter every 4 hours.
        cf.periodic_compaction_seconds: 14400
      # Configure the gRPC HTTP service.
      grpc:
        # Set the initial HTTP2 stream window size.
        # Must not be 0.
        initialStreamWindowSize: 512K
        # Set the initial HTTP2 connection window size.
        # Must not be 0.
        initialConnectionWindowSize: 32M
        # Max number of requests that can be operated on concurrently per connection.
        # Must not be 0.
        connectionConcurrencyLimit: 32
        # Max number of HTTP2 streams per connection.
        # 0 for unlimited.
        maxConcurrentStreams: 0
        # Max frame size for each HTTP2 data frame.
        # 0 for HTTP2 default.
        maxFrameSize: 0
        # Max time that a request can take.
        # Must not be 0.
        timeoutSeconds: 30
        # The amount of time to wait before timing out a proto buf write.
        writeTimeoutSeconds: 30
        # The amount of time in seconds before keepalive probes are sent.
        tcpKeepaliveAfterIdleSeconds: 5
        # The amount of time between each keepalive probe.
        tcpKeepaliveIntervalSeconds: 3
        # The number of probes to send before the socket is considered reset.
        tcpKeepaliveRetries: 15
        # The amount of time to wait between http2 keepalive probes.
        http2KeepaliveIntervalSeconds: 5
        # The amount of time to wait before connection is considered reset.
        http2KeepaliveTimeoutSeconds: 45
        # The max size that grpc service will allow.
        maxDecodingMessageSize: 5M
        # If true a tenant ID can be set through a string set in the metadata.
        tenantFromMetadata: false
        # TLS configuration options.
        tls:
          # If true, requires TLS/https encrypted transport
          enabled: false
          # Use existing kubernetes.io/tls secret
          secretName: ddcs-tls
          # The path to the certificate to use. Must be same directory as key.
          cert: "/tls/tls.crt"
          # The key for the certificate. Must be same directory as cert.
          key: "/tls/tls.key"
          # If true, the given root ca is used.
          includeCaRoot: false
          # The path to the ca root.
          caRoot: "/cert/path/ca.pem"
        # Controls verification of JWT
        jwt:
          # When enabled, any jwt provided in the authorization header will be validated
          enabled: false
          # When true, a valid JWT must be provided.
          require: false
          # The url to get the public JWK set from.
          jwkPublicKeysetUrl:
            - "https://example.com/jwk.json"
            - "test"
          # The interval on which to get the JWK set.
          jwkUpdateIntervalSecs: 1500
          # The size of the cross-connection cache for JWTs.
          cacheSizeMb: 128
          # When true the aud claim of the token is verified.
          verifyAud: false
          # Tokens will be required to have this value as their audience claim.
          audClaim: ""
          # When true verifies the expiration time of provided JWT.
          verifyExp: false
      # Settings that control how DDCS stores items in the storage engine.
      store:
        # The size after which values are treated as blobs.
        blobCutoff: 4M
        # The size of chunks to create when storing blobs.
        blobChunkSize: 4M
        mode: "legacy"

service:
  # Use loadbalancer for the service - allows external IP
  annotations: {}
  loadBalancer: false
  loadBalancerSourceRanges: []
  grpcPort: 3010
  metricsPort: 3051

monitoring:
  prometheusAlerts: true
  enabled: true
  interval: 5s
  path: /metrics
  port: http-metrics
  scheme: http
  scrapeTimeout: 5s

1. Provision and Scale#

Proper provisioning and scaling are critical for DDCS performance. When undersized or misconfigured for the workload, simulations will slow down or even fail. Ensure adequate network bandwidth and storage capacity based on your GPU count and scene complexity.

This guide assumes a “CPU/GPU” node split. That is, workloads that do not require a GPU are scheduled on Kubernetes nodes without GPUs (CPU nodes). Workloads that do require a GPU are scheduled on Kubernetes nodes with GPUs (GPU nodes).

As a baseline plan for ~3.3 Gbps for each GPU in the cluster.

AWS EKS Example

Assuming the SKUs for CPU/GPU nodes are m5.8xlarge and g6e.4xlarge respectively.

SKU	vCPU	RAM	NIC	GPU
m5.8xlarge (compute)	32	128 G	10 Gbps	-
g6e.4xlarge (gpu)	16	128 G	20 Gbps	1 L40S

If a cluster is provisioned with 25 GPUs a matching 83 Gbps (3.3 Gbps x 25 GPU) of bandwidth is necessary to facilitate caching.

Provision 9 compute nodes with 10Gbps and create 9 DDCS replicas.

A single DDCS pod should be placed on each compute node. The goal is to expand network capability with each pod, distributing cache load across multiple nodes. If 9 compute nodes were provisioned for caching, schedule one DDCS pod on each node.

values.yaml#

cluster:
  replicas: 1

2. Memory Configuration#

DDCS memory configuration includes Kubernetes resource limits and requests, along with engine-level cache settings. The base configuration allocates memory for the Point Cache, Block Cache, and Write Buffers.

Here are some common configuration modes that provide for various CPU and RAM allocations.

4 vCPU x 16GB RAM

values.yaml

container:
  resources:
    #limits:
    #  memory: 16Gi
    requests:
      memory: 16Gi

  settings:
    engine:
      sys.cache_size: "4G"
      sys.block_cache_size: "8G"
      cf.max_write_buffer_number: 64
      sys.increase_parallelism: 4
      db.max_background_jobs: 4

8 vCPU x 32GB RAM

values.yaml

container:
  resources:
    #limits:
    #  memory: 32Gi
    requests:
      memory: 32Gi

  settings:
    engine:
      sys.cache_size: "10G"
      sys.block_cache_size: "18G"
      cf.max_write_buffer_number: 128
      sys.increase_parallelism: 8
      db.max_background_jobs: 8

12 vCPU x 64GB RAM

values.yaml

container:
  resources:
    #limits:
    #  memory: 32Gi
    requests:
      memory: 32Gi

  settings:
    engine:
      sys.cache_size: "20G"
      sys.block_cache_size: "36G"
      cf.max_write_buffer_number: 128
      sys.increase_parallelism: 12
      db.max_background_jobs: 12

Advanced Memory Configuration

The provided guide for sizing DDCS should be followed where possible. If the configuration needs more advanced tuning, or the environment restricts memory this section may be helpful in understanding what the options mean.

Skip this if using the provided configuration.

Volume Configuration

Reading from persistent storage, even a network-attached volume, is often faster than regenerating derived data. Therefore, DDCS persists content to a Kubernetes volume.

values.yaml#

    storage:
      volume:
        size: 330Gi
        storageClassName: "gp3"

    settings:
      storageLimit: 300G

size determines the volume size that will be created and attached to each pod.: Depending on the cloud environment, volume size may also control the IOPS and throughput of the volume. Refer to your cloud provider’s documentation for specific performance characteristics.
storageClassName determines the performance characteristics of persistent volumes.: Select a storage class that provides high IOPS and throughput suitable for database workloads. Ideal conditions are 4000 IOPS and greater than 800 Mb/s sustained throughput.
storageLimit controls the maximum disk content before garbage collection is triggered.: This value should be 30-60GB smaller than the volume size to provide headroom for filesystem operations and prevent disk exhaustion. Filling a volume will result in IO errors and instability. When disk usage reaches 60% of this limit, garbage collection begins automatically. Smaller volumes will trigger garbage collection more frequently, which can reduce performance.

4. Telemetry#

DDCS exports Prometheus metrics for monitoring cache performance, hit rates, and storage utilization. Collection of these metrics is important for diagnosing potential problems with DDCS performance and optimizing cache configuration.

values.yaml#

monitoring:
  enabled: false

Metrics Configuration:

monitoring.enabled: When true, enables Prometheus metrics collection. Metrics are exposed via a Kubernetes service in Prometheus format.

Important

The ServiceMonitor CRD must be installed in the cluster for ServiceMonitor resources to work.

Configuration Recommendations#

The following configurations provide complete values files for different cluster sizes. Each configuration includes all base settings optimized for the specified GPU count and bandwidth requirements.

1-5 GPUs

values.yaml

image:
  pullSecrets:
    - name: ngc-container-pull

cluster:
  replicas: 2  # 2 nodes x 12.5 Gbps = 25 Gbps capacity
  selfAntiAffinity: false
  affinity:
    nodeAffinity:
      preferredDuringSchedulingIgnoredDuringExecution:
        - weight: 100
          preference:
            matchExpressions:
              - key: node-type
                operator: In
                values:
                  - compute
    podAntiAffinity:
      preferredDuringSchedulingIgnoredDuringExecution:
        - weight: 5
          podAffinityTerm:
            labelSelector:
              matchExpressions:
                - key: ddcs
                  operator: In
                  values:
                    - "kvnode"
            topologyKey: "kubernetes.io/hostname"

  container:
    resources:
      #limits:
      #  memory: 32Gi
      requests:
        memory: 32Gi

    storage:
      volume:
        size: 330Gi
        storageClassName: "gp3"

    settings:
      storageLimit: 300G
      engine:
        sys.cache_size: "10G"
        sys.block_cache_size: "18G"
        cf.max_write_buffer_number: 128
        sys.increase_parallelism: 8
        db.max_background_jobs: 8

monitoring:
  enabled: false

5-10 GPUs

values.yaml

image:
  pullSecrets:
    - name: ngc-container-pull

cluster:
  replicas: 3  # 3 nodes x 12.5 Gbps = 37.5 Gbps capacity
  selfAntiAffinity: false
  affinity:
    nodeAffinity:
      preferredDuringSchedulingIgnoredDuringExecution:
        - weight: 100
          preference:
            matchExpressions:
              - key: node-type
                operator: In
                values:
                  - compute
    podAntiAffinity:
      preferredDuringSchedulingIgnoredDuringExecution:
        - weight: 5
          podAffinityTerm:
            labelSelector:
              matchExpressions:
                - key: ddcs
                  operator: In
                  values:
                    - "kvnode"
            topologyKey: "kubernetes.io/hostname"

  container:
    resources:
      #limits:
      #  memory: 32Gi
      requests:
        memory: 32Gi

    storage:
      volume:
        size: 330Gi
        storageClassName: "gp3"

    settings:
      storageLimit: 300G
      engine:
        sys.cache_size: "10G"
        sys.block_cache_size: "18G"
        cf.max_write_buffer_number: 128
        sys.increase_parallelism: 8
        db.max_background_jobs: 8

monitoring:
  enabled: false

10-25 GPUs

values.yaml

image:
  pullSecrets:
    - name: ngc-container-pull

cluster:
  replicas: 6  # 6 nodes x 12.5 Gbps = 75 Gbps capacity
  selfAntiAffinity: false
  affinity:
    nodeAffinity:
      preferredDuringSchedulingIgnoredDuringExecution:
        - weight: 100
          preference:
            matchExpressions:
              - key: node-type
                operator: In
                values:
                  - compute
    podAntiAffinity:
      preferredDuringSchedulingIgnoredDuringExecution:
        - weight: 5
          podAffinityTerm:
            labelSelector:
              matchExpressions:
                - key: ddcs
                  operator: In
                  values:
                    - "kvnode"
            topologyKey: "kubernetes.io/hostname"

  container:
    resources:
      #limits:
      #  memory: 32Gi
      requests:
        memory: 32Gi

    storage:
      volume:
        size: 330Gi
        storageClassName: "gp3"

    settings:
      storageLimit: 300G
      engine:
        sys.cache_size: "10G"
        sys.block_cache_size: "18G"
        cf.max_write_buffer_number: 128
        sys.increase_parallelism: 8
        db.max_background_jobs: 8

monitoring:
  enabled: false

Summary#

This guide covered the configuration options for DDCS, including scaling considerations, memory allocation, storage sizing, and monitoring setup. Proper configuration of these settings is essential for optimal DDCS performance in your self-hosted NVCF cluster.

Once you have prepared your values.yaml file with the appropriate configuration, proceed to the DDCS: Deployment guide to deploy DDCS using Helm.