Preprocessing Guide

View as Markdown

Use these guides for cuVS preprocessing APIs that transform, compress, or project vectors before downstream search or analysis.

  • Binary Quantizer: compress vectors into binary representations for compact storage and fast comparisons.
  • PCA: reduce dimensionality with a linear projection while preserving as much variance as possible.
  • Product Quantization: split vectors into subvectors and encode each part with compact codebooks.
  • Scalar Quantizer: compress each vector dimension independently with scalar quantization.
  • Spectral Embedding: create lower-dimensional embeddings from graph structure.