Kmeans
Python module: cuvs.cluster.kmeans
KMeansParams
Hyper-parameters for the kmeans algorithm
Parameters
Constructor
Members
metric
n_clusters
init_method
max_iter
tol
n_init
oversampling_factor
batch_samples
batch_centroids
init_size
streaming_batch_size
hierarchical
hierarchical_n_iters
cluster_cost
@auto_sync_resources
@auto_convert_output
Compute cluster cost given an input matrix and existing centroids
Parameters
Returns
Examples
fit
@auto_sync_resources
@auto_convert_output
Find clusters with the k-means algorithm
When X is a device array (CUDA array interface), standard on-device
k-means is used. When X is a host array (numpy ndarray or
__array_interface__), data is streamed to the GPU in batches
controlled by params.streaming_batch_size. For large host datasets, consider
reducing streaming_batch_size to reduce GPU memory usage.
Parameters
Returns
Examples
Host-data (batched) example:
predict
@auto_sync_resources
@auto_convert_output
Predict clusters with the k-means algorithm
Parameters
Returns
Examples