A graph-based estimator of the number of clusters
Assessing the number of clusters of a statistical population is one of the essential issues of unsupervised learning. Given n independent observations X1,...,Xn drawn from an unknown multivariate probability density f, we propose a new approach to estimate the number of connected components, or clusters, of the t-level set . The basic idea is to form a rough skeleton of the set using any preliminary estimator of f, and to count the number of connected components of the resulting graph. Under...