Book,

Knowledge Mining Using Robust Clustering

S. Äyrämö.
Jyväskylä Studies in Computing University of Jyväsylä, (2006)

Abstract

This work is devoted to the development of scalable and robust algorithms for data mining and knowledge discovery problems. The main interest lies in so-called prototype-based clustering methods that are implemented using iterative relocation algorithms. Different elements of prototype-based data clustering are discussed and basic algorithms are described. In order to support the usability of the new methods and algorithms, a modified knowledge mining process model is also proposed. The refined model is based on the well-known knowledge discovery process, but it emphasizes more domain analysis and ''black box'' nature of data mining. Significance and importance of knowledge mining are clarified by outlining the current body of the existing knowledge with real applications.As the main outcome of this thesis, a highly automated robust clustering method is presented. The method consists of a number of separately developed and tested elements such as initialization, prototype estimation, and missing data strategy. Non-smooth nature of the robust statistics is rigorously considered from the point of view of non-smooth optimization. Numerical and statistical properties, such as robustness, scalability, computational and statistical efficiency, of the presented methods are tested and illustrated through a number of numerical experiments. The results are completed with some analytic results and illustrative real-world examples. Furthermore, in order to estimate the correct number of clusters, a new proposal of a cluster validity index is given.

BibTeX key: äyrämö2006knowledge
entry type: book
year: 2006
publisher: University of Jyväsylä
series: Jyväskylä Studies in Computing
volume: 63

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@book{äyrämö2006knowledge, abstract = {This work is devoted to the development of scalable and robust algorithms for data mining and knowledge discovery problems. The main interest lies in so-called prototype-based clustering methods that are implemented using iterative relocation algorithms. Different elements of prototype-based data clustering are discussed and basic algorithms are described. In order to support the usability of the new methods and algorithms, a modified knowledge mining process model is also proposed. The refined model is based on the well-known knowledge discovery process, but it emphasizes more domain analysis and ''black box'' nature of data mining. Significance and importance of knowledge mining are clarified by outlining the current body of the existing knowledge with real applications.As the main outcome of this thesis, a highly automated robust clustering method is presented. The method consists of a number of separately developed and tested elements such as initialization, prototype estimation, and missing data strategy. Non-smooth nature of the robust statistics is rigorously considered from the point of view of non-smooth optimization. Numerical and statistical properties, such as robustness, scalability, computational and statistical efficiency, of the presented methods are tested and illustrated through a number of numerical experiments. The results are completed with some analytic results and illustrative real-world examples. Furthermore, in order to estimate the correct number of clusters, a new proposal of a cluster validity index is given.}, added-at = {2011-04-05T09:29:33.000+0200}, author = {Äyrämö, Sami}, biburl = {https://www.bibsonomy.org/bibtex/25d1934d97fd0336a44429e55632be98c/vipirtti}, interhash = {d5167e0ebe7b2e0a3100dcc60a2551c4}, intrahash = {5d1934d97fd0336a44429e55632be98c}, keywords = {clustering data mining robust toread}, publisher = {University of Jyväsylä}, series = {Jyväskylä Studies in Computing}, timestamp = {2011-04-05T09:29:33.000+0200}, title = {Knowledge Mining Using Robust Clustering}, volume = 63, year = 2006 }

BibSonomy

Knowledge Mining Using Robust Clustering

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on