Apr 5, 2026arXiv:2604.04302

CavMerge: Merging K-means Based on Local Log-Concavity

AI Summary

CavMerge, a novel K-means merging algorithm, addresses the suboptimal performance of standard K-means on non-linearly separable data by merging results from an over-clustered K-means solution. The algorithm leverages local log-concavity assumptions to guide the merging process, eliminating the need for hyperparameter tuning. Experiments on synthetic and real-world datasets demonstrate CavMerge's superior performance and reliability compared to existing K-means merging techniques.

Key Contribution

Ditch the hyperparameter tuning headache: CavMerge offers a provably consistent and computationally efficient way to merge K-means clusters, outperforming existing methods on non-linear data.

Abstract

K-means clustering, a classic and widely-used clustering technique, is known to exhibit suboptimal performance when applied to non-linearly separable data. Numerous adjustments and modifications have been proposed to address this issue, including methods that merge K-means results from a relatively large K to obtain a final cluster assignment. However, existing methods of this nature often encounter computational inefficiencies and suffer from hyperparameter tuning. Here we present \emph{CavMerge}, a novel K-means merging algorithm that is intuitive, free of parameter tuning, and computationally efficient. Operating under minimal local distributional assumptions, our algorithm demonstrates strong consistency and rapid convergence guarantees. Empirical studies on various simulated and real datasets demonstrate that our method yields more reliable clusters in comparison to current state-of-the-art algorithms.

Natural Language Processing Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

CavMerge: Merging K-means Based on Local Log-Concavity

Related Papers