@mboley

Ambiguous Frequent Itemset Mining and Polynomial Delay Enumeration

, and . Advances in Knowledge Discovery and Data Mining, (2008)

Abstract

Mining frequently appearing patterns in a database is a basic problem in recent informatics, especially in data mining. Particularly, when the input database is a collection of subsets of an itemset, called transaction, the problem is called the frequent itemsetmining problem, and it has been extensively studied. The items in a frequent itemset appear in many records simultaneously,thus they can be considered to be a cluster with respect to these records. However, in this sense, the condition that everyitem appears in each record is quite strong. We should allow for several missing items in these records. In this paper, weapproach this problem from the algorithm theory, and consider the model that can be solved efficiently and possibly valuablein practice. We introduce ambiguous frequent itemsets which allow missing items in their occurrence records. More precisely,for given thresholds θ and σ, an ambiguous frequent itemset P has a transaction set

Description

fault-tolerant frequent set mining with polynomial delay.

Links and resources

Tags

community

  • @dblp
  • @mboley
@mboley's tags highlighted