copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Subspace Clustering for High Dimensional Data: A Review

L. Parsons, E. Haque, and H. Liu. SIGKDD Exploration, 6 (1): 90-105 (2004)

Abstract

Subspace clustering is an extension of traditional cluster- ing that seeks to ¯nd clusters in di®erent subspaces within a dataset. Often in high dimensional data, many dimen- sions are irrelevant and can mask existing clusters in noisy data. Feature selection removes irrelevant and redundant dimensions by analyzing the entire dataset. Subspace clus- tering algorithms localize the search for relevant dimensions allowing them to ¯nd clusters that exist in multiple, possi- bly overlapping subspaces. There are two major branches of subspace clustering based on their search strategy. Top- down algorithms ¯nd an initial clustering in the full set of dimensions and evaluate the subspaces of each cluster, it- eratively improving the results. Bottom-up approaches ¯nd dense regions in low dimensional spaces and combine them to form clusters. This paper presents a survey of the various subspace clustering algorithms along with a hierarchy orga- nizing the algorithms by their de¯ning characteristics. We then compare the two main approaches to subspace cluster- ing using empirical scalability and accuracy tests and discuss some potential applications where subspace clustering could be particularly useful.

Links and resources

BibTeX key: text.clustering.review.2004
entry type: article
year: 2004
journal: SIGKDD Exploration
number: 1
pages: 90-105
volume: 6
Document: http://www.sigkdd.org/explorations/issues/6-1-2004-06/parsons.pdf

@huiyangsfsu's tags highlighted

Cite this publication

search on

Meta data

Last update 15 years ago
Created 15 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Subspace Clustering for High Dimensional Data: A Review

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Subspace Clustering for High Dimensional Data: A Review

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Subspace Clustering for High Dimensional Data: A Review

Comments and Reviews
(0)