Flash-KMeans: Fast and Memory-Efficient Exact K-Means

(arxiv.org)

45 points | by matt_d 3 days ago

2 comments

wood_spirit 39 minutes ago
Does this have corresponding speed ups or memory gains for normal CPUs too? Just thinking about all the cups of coffee that have been made and drunk while scikit-learn kmeans chugs through a notebook :)
[-]
- snovv_crash 10 minutes ago
  For CPU with bigger K you would put the centroids in a search tree, so take advantage of the sparsity, while a GPU would calculate the full NxK distance matrix. So from my understanding the bottleneck they are fixing doesn't show up on CPU.
matrix2596 42 minutes ago
looks like flash attention concepts applied to kmeans, nice speedup results