copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Selecting the right interestingness measure for association patterns

P. Tan, V. Kumar, and J. Srivastava. KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, page 32--41. New York, NY, USA, ACM, (2002)
DOI: http://doi.acm.org/10.1145/775047.775053

Abstract

Many techniques for association rule mining and feature selection require a suitable metric to capture the dependencies among variables in a data set. For example, metrics such as support, confidence, lift, correlation, and collective strength are often used to determine the interestingness of association patterns. However, many such measures provide conflicting information about the interestingness of a pattern, and the best metric to use for a given application domain is rarely known. In this paper, we present an overview of various measures proposed in the statistics, machine learning and data mining literature. We describe several key properties one should examine in order to select the right measure for a given application domain. A comparative study of these properties is made using twenty one of the existing measures. We show that each measure has different properties which make them useful for some application domains, but not for others. We also present two scenarios in which most of the existing measures agree with each other, namely, support-based pruning and table standardization. Finally, we present an algorithm to select a small set of tables such that an expert can select a desirable measure by looking at just this small set of tables.

Description

Selecting the right interestingness measure for association patterns

Links and resources

BibTeX key: tan/kdd/2002
entry type: inproceedings
address: New York, NY, USA
booktitle: KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
year: 2002
pages: 32--41
publisher: ACM
location: Edmonton, Alberta, Canada
isbn: 1-58113-567-X
DOI: http://doi.acm.org/10.1145/775047.775053
url: http://portal.acm.org/citation.cfm?id=775053

@mboley's tags highlighted

Cite this publication

@inproceedings{tan/kdd/2002, abstract = {Many techniques for association rule mining and feature selection require a suitable metric to capture the dependencies among variables in a data set. For example, metrics such as support, confidence, lift, correlation, and collective strength are often used to determine the interestingness of association patterns. However, many such measures provide conflicting information about the interestingness of a pattern, and the best metric to use for a given application domain is rarely known. In this paper, we present an overview of various measures proposed in the statistics, machine learning and data mining literature. We describe several key properties one should examine in order to select the right measure for a given application domain. A comparative study of these properties is made using twenty one of the existing measures. We show that each measure has different properties which make them useful for some application domains, but not for others. We also present two scenarios in which most of the existing measures agree with each other, namely, support-based pruning and table standardization. Finally, we present an algorithm to select a small set of tables such that an expert can select a desirable measure by looking at just this small set of tables.}, added-at = {2010-02-17T10:21:16.000+0100}, address = {New York, NY, USA}, author = {Tan, Pang-Ning and Kumar, Vipin and Srivastava, Jaideep}, biburl = {https://www.bibsonomy.org/bibtex/25b4ecdb28cdcba4e70e676f8b77b3372/mboley}, booktitle = {KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining}, description = {Selecting the right interestingness measure for association patterns}, doi = {http://doi.acm.org/10.1145/775047.775053}, interhash = {ebeff860f22400afd4188d0742842899}, intrahash = {5b4ecdb28cdcba4e70e676f8b77b3372}, isbn = {1-58113-567-X}, keywords = {associationRules interestingness patternMining}, location = {Edmonton, Alberta, Canada}, pages = {32--41}, publisher = {ACM}, timestamp = {2010-02-17T10:21:16.000+0100}, title = {Selecting the right interestingness measure for association patterns}, url = {http://portal.acm.org/citation.cfm?id=775053}, year = 2002 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Selecting the right interestingness measure for association patterns

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Selecting the right interestingness measure for association patterns

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Selecting the right interestingness measure for association patterns

Comments and Reviews
(0)