copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient discovery of error-tolerant frequent itemsets in high dimensions

C. Yang, U. Fayyad, and P. Bradley. KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, page 194--203. New York, NY, USA, ACM, (2001)
DOI: http://doi.acm.org/10.1145/502512.502539

Abstract

We present a generalization of frequent itemsets allowing for the notion of errors in the itemset definition. We motivate the problem and present an efficient algorithm that identifies error-tolerant frequent clusters of items in transactional data (customer-purchase data, web browsing data, text, etc.). The algorithm exploits sparseness of the underlying data to find large groups of items that are correlated over database records (rows). The notion of transaction coverage allows us to extend the algorithm and view it as a fast clustering algorithm for discovering segments of similar transactions in binary sparse data. We evaluate the new algorithm on three real-world applications: clustering high-dimensional data, query selectivity estimation and collaborative filtering. Results show that the algorithm consistently uncovers structure in large sparse databases that other traditional clustering algorithms fail to find.

Description

Efficient discovery of error-tolerant frequent itemsets in high dimensions

Links and resources

BibTeX key: Yang01
entry type: inproceedings
address: New York, NY, USA
booktitle: KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
year: 2001
pages: 194--203
publisher: ACM
location: San Francisco, California
isbn: 1-58113-391-X
DOI: http://doi.acm.org/10.1145/502512.502539
url: http://portal.acm.org/citation.cfm?id=502512.502539

@mboley's tags highlighted

Cite this publication

search on

Meta data

Last update 16 years ago
Created 16 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient discovery of error-tolerant frequent itemsets in high dimensions

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Efficient discovery of error-tolerant frequent itemsets in high dimensions

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient discovery of error-tolerant frequent itemsets in high dimensions

Comments and Reviews
(0)