Search papers, labs, and topics across Lattice.
CavMerge, a novel K-means merging algorithm, addresses the suboptimal performance of standard K-means on non-linearly separable data by merging results from an over-clustered K-means solution. The algorithm leverages local log-concavity assumptions to guide the merging process, eliminating the need for hyperparameter tuning. Experiments on synthetic and real-world datasets demonstrate CavMerge's superior performance and reliability compared to existing K-means merging techniques.
Ditch the hyperparameter tuning headache: CavMerge offers a provably consistent and computationally efficient way to merge K-means clusters, outperforming existing methods on non-linear data.
K-means clustering, a classic and widely-used clustering technique, is known to exhibit suboptimal performance when applied to non-linearly separable data. Numerous adjustments and modifications have been proposed to address this issue, including methods that merge K-means results from a relatively large K to obtain a final cluster assignment. However, existing methods of this nature often encounter computational inefficiencies and suffer from hyperparameter tuning. Here we present \emph{CavMerge}, a novel K-means merging algorithm that is intuitive, free of parameter tuning, and computationally efficient. Operating under minimal local distributional assumptions, our algorithm demonstrates strong consistency and rapid convergence guarantees. Empirical studies on various simulated and real datasets demonstrate that our method yields more reliable clusters in comparison to current state-of-the-art algorithms.