copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories

D. Li, J. Deogun, W. Spaulding, and B. Shuart. Transactions on Rough Sets IV, volume 3700 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2005)
DOI: 10.1007/11574798_3

Abstract

Missing data, commonly encountered in many fields of study, introduce inaccuracy in the analysis and evaluation. Previous methods used for handling missing data (e.g., deleting cases with incomplete information, or substituting the missing values with estimated mean scores), though simple to implement, are problematic because these methods may result in biased data models. Fortunately, recent advances in theoretical and computational statistics have led to more flexible techniques to deal with the missing data problem. In this paper, we present missing data imputation methods based on clustering, one of the most popular techniques in Knowledge Discovery in Databases (KDD). We combine clustering with soft computing, which tends to be more tolerant of imprecision and uncertainty, and apply fuzzy and rough clustering algorithms to deal with incomplete data. The experiments show that a hybridization of fuzzy set and rough set theories in missing data imputation algorithms leads to the best performance among our four algorithms, i.e., crisp K-means, fuzzy K-means, rough K-means, and rough-fuzzy K-means imputation algorithms.

Description

Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories - Springer

@vivion's tags highlighted

Cite this publication

@incollection{li2005dealing, abstract = {Missing data, commonly encountered in many fields of study, introduce inaccuracy in the analysis and evaluation. Previous methods used for handling missing data (e.g., deleting cases with incomplete information, or substituting the missing values with estimated mean scores), though simple to implement, are problematic because these methods may result in biased data models. Fortunately, recent advances in theoretical and computational statistics have led to more flexible techniques to deal with the missing data problem. In this paper, we present missing data imputation methods based on clustering, one of the most popular techniques in Knowledge Discovery in Databases (KDD). We combine clustering with soft computing, which tends to be more tolerant of imprecision and uncertainty, and apply fuzzy and rough clustering algorithms to deal with incomplete data. The experiments show that a hybridization of fuzzy set and rough set theories in missing data imputation algorithms leads to the best performance among our four algorithms, i.e., crisp K-means, fuzzy K-means, rough K-means, and rough-fuzzy K-means imputation algorithms.}, added-at = {2013-01-02T15:37:11.000+0100}, author = {Li, Dan and Deogun, Jitender and Spaulding, William and Shuart, Bill}, biburl = {https://www.bibsonomy.org/bibtex/2d437cdac233b3cd04bc452cf830e70de/vivion}, booktitle = {Transactions on Rough Sets IV}, description = {Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories - Springer}, doi = {10.1007/11574798_3}, editor = {Peters, JamesF. and Skowron, Andrzej}, interhash = {ec408dc0cb566735b791b5f54c63797f}, intrahash = {d437cdac233b3cd04bc452cf830e70de}, isbn = {978-3-540-29830-4}, keywords = {algorithms data fuzzy missing soft-computing}, pages = {37-57}, publisher = {Springer Berlin Heidelberg}, series = {Lecture Notes in Computer Science}, timestamp = {2013-01-02T15:37:11.000+0100}, title = {Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories}, url = {http://dx.doi.org/10.1007/11574798_3}, volume = 3700, year = 2005 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories

Comments and Reviews
(0)