For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
GitHubCUDA-X
    • Home
    • Installation
  • Getting Started
    • Introduction
      • Vector Search
      • Clustering
      • Compression
      • Vector Database
      • Tuning Indexes
    • Integrations
    • Use-cases
  • User Guide
    • API Guide
    • Benchmarking Guide
    • Field Guide
    • References
  • Developer Guide
    • Coding Guidelines
    • Contributing
  • API Reference
    • C API Documentation
    • Cpp API Documentation
    • Python API Documentation
    • Java API Documentation
    • Rust API Documentation
    • Go API Documentation
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogocuVS
GitHubCUDA-X
Getting Started

Introduction

||View as Markdown|

Start here to learn the main concepts behind NVIDIA cuVS, how they fit together, and which guide to read next.

  • Vector Search: learn the main index families, how they trade recall for performance, and where to start when choosing an index.
  • Clustering: compare clustering methods, including why K-Means is often the preferred partitioning method for large vector search systems.
  • Vector Compression: compare quantization and dimensionality reduction methods for reducing vector storage, bandwidth, and search cost.
  • Vector Database: understand the difference between a vector index and a vector database, including local, global, and hybrid partitioning.
  • Tuning Indexes: tune standalone indexes and vector databases for recall, latency, throughput, build time, and memory.
Previous

Getting Started

Next

Vector Search