@msn

Automatic Extraction of Clusters from Hierarchical Clustering Representations.

, , , , and . Proc. of 7th Pacific-Asia Conf. of Advances in Knowledge Discovery and Data Mining, PAKDD 2003, Proceedings, page 75-87. Springer, (2003)

Abstract

Hierarchical clustering algorithms are typically more effective in detecting the true clustering structure of a data set than partitioning algorithms. However, hierarchical clustering algorithms do not actually create clusters, but compute only a hierarchical representation of the data set. This makes them unsuitable as an automatic pre-processing step for other algorithms that operate on detected clusters. This is true for both dendrograms and reachability plots, which have been proposed as hierarchical clustering representations, and which have different advantages and disadvantages. In this paper we first investigate the relation between dendrograms and reachability plots and introduce methods to convert them into each other showing that they essentially contain the same information. Based on reachability plots, we then introduce a technique that automatically determines the significant clusters in a hierarchical cluster representation. This makes it for the first time possible to use hierarchical clustering as an automatic pre-processing step that requires no user interaction to select clusters from a hierarchical cluster representation.

Links and resources

Tags

community

  • @msn
  • @dblp
@msn's tags highlighted