> For clean Markdown of any page, append .md to the page URL. > For a complete documentation index, see https://docs.nvidia.com/cuvs/llms.txt. > For full documentation content, see https://docs.nvidia.com/cuvs/llms-full.txt. > For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.nvidia.com/cuvs/_mcp/server. # PQ _Source header: `cuvs/preprocessing/quantize/pq.hpp`_ ## Product Quantizer utilities ### kmeans_params_variant Alias for the variant holding either balanced or regular k-means parameters. ```cpp using kmeans_params_variant = std::variant; ``` ### preprocessing::quantize::pq::params Product Quantizer parameters. ```cpp struct params { ... }; ``` **Fields** | Name | Type | Description | | --- | --- | --- | | `pq_bits` | `uint32_t` | The bit length of the vector element after compression by PQ. Possible value range: [4-16]. Hint: the smaller the 'pq_bits', the smaller the index size and the faster the fit/transform time, but the lower the recall. | | `pq_dim` | `uint32_t` | The dimensionality of the vector after compression by PQ. When zero, dim / 4 is used as default. TODO: at the moment `dim` must be a multiple `pq_dim`. | | `use_subspaces` | `bool` | Whether to use subspaces for product quantization (PQ). When true, one PQ codebook is used for each subspace. Otherwise, a single PQ codebook is used. | | `use_vq` | `bool` | Whether to use Vector Quantization (KMeans) before product quantization (PQ). When true, VQ is used and PQ is trained on the residuals. | | `vq_n_centers` | `uint32_t` | Vector Quantization (VQ) codebook size - number of "coarse cluster centers". When zero, an optimal value is selected using a heuristic. (sqrt(n_rows)) | | `kmeans_params` | [`kmeans_params_variant`](/api-reference/cpp-api-preprocessing-quantize-pq#kmeans-params-variant) | K-means parameters for PQ codebook training. Set to cuvs::cluster::kmeans::balanced_params for balanced k-means (default), or cuvs::cluster::kmeans::params for regular k-means. The active variant type selects the algorithm; balanced k-means tends to be faster for PQ training where cluster sizes are approximately equal. Only L2Expanded metric is supported. The number of clusters is always set to 1 << pq_bits. | | `max_train_points_per_pq_code` | `uint32_t` | The max number of data points to use per PQ code during PQ codebook training. Using more data points per PQ code may increase the quality of PQ codebook but may also increase the build time. We will use `pq_n_centers * max_train_points_per_pq_code` training points to train each PQ codebook. | | `max_train_points_per_vq_cluster` | `uint32_t` | The max number of data points to use per VQ cluster during training. | ### preprocessing::quantize::pq::params::params Simplified constructor that will build an appropriate kmeans params object. ```cpp params(uint32_t pq_bits, uint32_t pq_dim, bool use_subspaces, bool use_vq, uint32_t vq_n_centers, uint32_t kmeans_n_iters, cuvs::cluster::kmeans::kmeans_type pq_kmeans_type = cuvs::cluster::kmeans::kmeans_type::KMeansBalanced, uint32_t max_train_points_per_pq_code = 256, uint32_t max_train_points_per_vq_cluster = 1024) : pq_bits(pq_bits), ``` **Parameters** | Name | Direction | Type | Description | | --- | --- | --- | --- | | `pq_bits` | | `uint32_t` | | | `pq_dim` | | `uint32_t` | | | `use_subspaces` | | `bool` | | | `use_vq` | | `bool` | | | `vq_n_centers` | | `uint32_t` | | | `kmeans_n_iters` | | `uint32_t` | | | `pq_kmeans_type` | | [`cuvs::cluster::kmeans::kmeans_type`](/api-reference/cpp-api-cluster-kmeans#cluster-kmeans-kmeans-type) | Default: `cuvs::cluster::kmeans::kmeans_type::KMeansBalanced`. | | `max_train_points_per_pq_code` | | `uint32_t` | Default: `256`. | | `max_train_points_per_vq_cluster` | | `uint32_t` | Default: `1024`. | **Returns** `void` ### preprocessing::quantize::pq::quantizer Defines and stores VPQ codebooks upon training ```cpp template struct quantizer { ... }; ``` **Fields** | Name | Type | Description | | --- | --- | --- | | `params_quantizer` | [`params`](/api-reference/cpp-api-preprocessing-quantize-pq#preprocessing-quantize-pq-params) | Parameters used to build this quantizer. | | `vpq_codebooks` | [`cuvs::neighbors::vpq_dataset`](/api-reference/cpp-api-neighbors-common#neighbors-vpq-dataset) | VPQ codebooks produced during training. | ### preprocessing::quantize::pq::build Initializes a product quantizer to be used later for quantizing the dataset. ```cpp quantizer build(raft::resources const& res, const params params, raft::device_matrix_view dataset); ``` The use of a pool memory resource is recommended for more consistent training performance. Usage example: **Parameters** | Name | Direction | Type | Description | | --- | --- | --- | --- | | `res` | in | `raft::resources const&` | raft resource | | `params` | in | [`const params`](/api-reference/cpp-api-preprocessing-quantize-pq#preprocessing-quantize-pq-params) | configure product quantizer, e.g. quantile | | `dataset` | in | `raft::device_matrix_view` | a row-major matrix view on device or host | **Returns** [`quantizer`](/api-reference/cpp-api-preprocessing-quantize-pq#preprocessing-quantize-pq-quantizer) **Additional overload:** `preprocessing::quantize::pq::build` ```cpp quantizer build(raft::resources const& res, const params params, raft::host_matrix_view dataset); ``` **Parameters** | Name | Direction | Type | Description | | --- | --- | --- | --- | | `res` | | `raft::resources const&` | | | `params` | | [`const params`](/api-reference/cpp-api-preprocessing-quantize-pq#preprocessing-quantize-pq-params) | | | `dataset` | | `raft::host_matrix_view` | | **Returns** [`quantizer`](/api-reference/cpp-api-preprocessing-quantize-pq#preprocessing-quantize-pq-quantizer) ### preprocessing::quantize::pq::transform Applies quantization transform to given dataset ```cpp void transform(raft::resources const& res, const quantizer& quant, raft::device_matrix_view dataset, raft::device_matrix_view codes_out, std::optional> vq_labels = std::nullopt); ``` Usage example: used, optional **Parameters** | Name | Direction | Type | Description | | --- | --- | --- | --- | | `res` | in | `raft::resources const&` | raft resource | | `quant` | in | [`const quantizer&`](/api-reference/cpp-api-preprocessing-quantize-pq#preprocessing-quantize-pq-quantizer) | a product quantizer | | `dataset` | in | `raft::device_matrix_view` | a row-major matrix view on device or host | | `codes_out` | out | `raft::device_matrix_view` | a row-major matrix view on device containing the PQ codes | | `vq_labels` | out | `std::optional>` | a vector view on device containing the VQ labels when VQ is Default: `std::nullopt`. | **Returns** `void` **Additional overload:** `preprocessing::quantize::pq::transform` ```cpp void transform(raft::resources const& res, const quantizer& quant, raft::host_matrix_view dataset, raft::device_matrix_view codes_out, std::optional> vq_labels = std::nullopt); ``` **Parameters** | Name | Direction | Type | Description | | --- | --- | --- | --- | | `res` | | `raft::resources const&` | | | `quant` | | [`const quantizer&`](/api-reference/cpp-api-preprocessing-quantize-pq#preprocessing-quantize-pq-quantizer) | | | `dataset` | | `raft::host_matrix_view` | | | `codes_out` | | `raft::device_matrix_view` | | | `vq_labels` | | `std::optional>` | Default: `std::nullopt`. | **Returns** `void` ### preprocessing::quantize::pq::get_quantized_dim Get the dimension of the quantized dataset (in bytes) ```cpp inline int64_t get_quantized_dim(const params& config); ``` **Parameters** | Name | Direction | Type | Description | | --- | --- | --- | --- | | `config` | in | [`const params&`](/api-reference/cpp-api-preprocessing-quantize-pq#preprocessing-quantize-pq-params) | product quantizer parameters | **Returns** `inline int64_t` the dimension of the quantized dataset ### preprocessing::quantize::pq::inverse_transform Applies inverse quantization transform to given dataset ```cpp void inverse_transform( raft::resources const& res, const quantizer& quant, raft::device_matrix_view pq_codes, raft::device_matrix_view out, std::optional> vq_labels = std::nullopt); ``` **Parameters** | Name | Direction | Type | Description | | --- | --- | --- | --- | | `res` | in | `raft::resources const&` | raft resource | | `quant` | in | [`const quantizer&`](/api-reference/cpp-api-preprocessing-quantize-pq#preprocessing-quantize-pq-quantizer) | a product quantizer | | `pq_codes` | in | `raft::device_matrix_view` | a row-major matrix view on device containing the PQ codes | | `out` | out | `raft::device_matrix_view` | a row-major matrix view on device | | `vq_labels` | in | `std::optional>` | a vector view on device containing the VQ labels when VQ is used, optional Default: `std::nullopt`. | **Returns** `void`