Article,

Efficient Discovery of the Most Interesting Associations

G. Webb, and J. Vreeken.
Transactions on Knowledge Discovery from Data, 8 (3): 15:1-15:31 (2014)

Abstract

Self-sufficient itemsets have been proposed as an effective approach to summarizing the key associations in data. However, their computation appears highly demanding, as assessing whether an itemset is selfsufficient requires consideration of all pairwise partitions of the itemset into pairs of subsets as well as consideration of all supersets. This paper presents the first published algorithm for efficiently discovering self-sufficient itemsets. This branch-and-bound algorithm deploys two powerful pruning mechanisms based on upper-bounds on itemset value and statistical significance level. It demonstrates that finding top-k productive and non-redundant itemsets, with post processing to identify those that are not independently productive, can efficiently identify small sets of key associations. We present extensive evaluation of the strengths and limitations of the technique including comparisons with alternative approaches to finding the most interesting associations.

BibTeX key: WebbVreeken13
entry type: article
year: 2014
journal: Transactions on Knowledge Discovery from Data
number: 3
pages: 15:1-15:31
publisher: ACM
volume: 8
url: http://dx.doi.org/10.1145/2601433

BibSonomy

Efficient Discovery of the Most Interesting Associations

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on