Supervised and Unsupervised Ensemble Methods and their Applications
A. Fred, and A. Lourenco. volume 126 of Studies in Computational Intelligence, chapter Cluster Ensemble Methods: from Single Clusterings to Combined Solutions, page 3-30. Springer Berlin / Heidelberg, (2008)
Abstract
Cluster ensemble methods attempt to find better and more robust clustering solutions by fusing information from several data partitionings. In this chapter, we address
the different phases of this recent approach: from the generation of the partitions,
the clustering ensemble, to the combination and validation of the combined result. While giving an overall revision of the state-of-the-art in the area, we focus on our own work on the subject. In particular, the Evidence Accumulation Clustering (EAC) paradigm is detailed and analyzed. For
the validation/selection of the final partition, we focus on metrics that can quantitatively measure the consistency between partitions and combined results, and thus enabling the choice of best results
without the use of additional information. Information-theoretic measures in conjunction with a variance analysis using bootstrapping are detailed and empirically evaluated. Experimental results throughout the paper illustrate the various concepts and methods addressed, using synthetic and real
data and involving both vectorial and string-based data representations. We show that the clustering ensemble approach can be used in very distinct contexts with the state of the art quality results
%0 Book Section
%1 Chapter2008
%A Fred, Ana
%A Lourenco, Andre
%B Studies in Computational Intelligence
%D 2008
%E Okun, Oleg
%E Ventini, Giorgio
%I Springer Berlin / Heidelberg
%K cluster_ensemble consistency_index evidence_accumulation_clustering hierarchical_clustering k-means normalized_mutual_information string_representation
%P 3-30
%T Supervised and Unsupervised Ensemble Methods and their Applications
%V 126
%X Cluster ensemble methods attempt to find better and more robust clustering solutions by fusing information from several data partitionings. In this chapter, we address
the different phases of this recent approach: from the generation of the partitions,
the clustering ensemble, to the combination and validation of the combined result. While giving an overall revision of the state-of-the-art in the area, we focus on our own work on the subject. In particular, the Evidence Accumulation Clustering (EAC) paradigm is detailed and analyzed. For
the validation/selection of the final partition, we focus on metrics that can quantitatively measure the consistency between partitions and combined results, and thus enabling the choice of best results
without the use of additional information. Information-theoretic measures in conjunction with a variance analysis using bootstrapping are detailed and empirically evaluated. Experimental results throughout the paper illustrate the various concepts and methods addressed, using synthetic and real
data and involving both vectorial and string-based data representations. We show that the clustering ensemble approach can be used in very distinct contexts with the state of the art quality results
%& Cluster Ensemble Methods: from Single Clusterings to Combined Solutions
@inbook{Chapter2008,
abstract = {Cluster ensemble methods attempt to find better and more robust clustering solutions by fusing information from several data partitionings. In this chapter, we address
the different phases of this recent approach: from the generation of the partitions,
the clustering ensemble, to the combination and validation of the combined result. While giving an overall revision of the state-of-the-art in the area, we focus on our own work on the subject. In particular, the Evidence Accumulation Clustering (EAC) paradigm is detailed and analyzed. For
the validation/selection of the final partition, we focus on metrics that can quantitatively measure the consistency between partitions and combined results, and thus enabling the choice of best results
without the use of additional information. Information-theoretic measures in conjunction with a variance analysis using bootstrapping are detailed and empirically evaluated. Experimental results throughout the paper illustrate the various concepts and methods addressed, using synthetic and real
data and involving both vectorial and string-based data representations. We show that the clustering ensemble approach can be used in very distinct contexts with the state of the art quality results},
added-at = {2009-10-25T21:32:29.000+0100},
author = {Fred, Ana and Lourenco, Andre},
biburl = {https://www.bibsonomy.org/bibtex/2d680a1c715e47360068b67c735cbb0a5/alourenco},
chapter = {Cluster Ensemble Methods: from Single Clusterings to Combined Solutions},
editor = {Okun, Oleg and Ventini, Giorgio},
interhash = {885c44583dfb0c28d203fddda6806ce2},
intrahash = {d680a1c715e47360068b67c735cbb0a5},
keywords = {cluster_ensemble consistency_index evidence_accumulation_clustering hierarchical_clustering k-means normalized_mutual_information string_representation},
pages = {3-30},
publisher = {Springer Berlin / Heidelberg},
series = {Studies in Computational Intelligence},
timestamp = {2009-10-25T21:32:29.000+0100},
title = {Supervised and Unsupervised Ensemble Methods and their Applications},
volume = 126,
year = 2008
}