Article,

Free-sets: a condensed representation of boolean data for the approximation of frequency queries

J. Boulicaut, A. Bykowski, and C. Rigotti.
Data Mining and Knowledge Discovery, 7 (1): 5--22 (2003)

Abstract

Given a large collection of transactions containing items, a basic common data mining problem is to extract the so-called frequent itemsets (i.e., sets of items appearing in at least a given number of transactions). In this paper, we propose a structure called free-sets, from which we can approximate any itemset support (i.e., the number of transactions containing the itemset) and we formalize this notion in the framework of -adequate representations (H. Mannila and H. Toivonen, 1996. In Proc. of the Second International Conference on Knowledge Discovery and Data Mining (KDD'96), pp. 189-194). We show that frequent free-sets can be efficiently extracted using pruning strategies developed for frequent itemset discovery, and that they can be used to approximate the support of any frequent itemset. Experiments on real dense data sets show a significant reduction of the size of the output when compared with standard frequent itemset extraction. abriged

BibTeX key: boulicaut2003free
entry type: article
year: 2003
journal: Data Mining and Knowledge Discovery
number: 1
pages: 5--22
publisher: Springer
volume: 7
md5sum: 8ef936d15ab37cf1e342903e0a4ed033
citations: 209
file: file://Free-Sets A Condensed Representation of Boolean Data for the Approximation of Frequency Queries.pdf:pdf
pdfmeat: timestamp: 2013-08-04 16:36:50; queries: 3; inode: 1703980
citedbyid: 17814465378246632528
mailhosts: insa-lyon.fr
url: http://link.springer.com/article/10.1023/A:1021571501451

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{boulicaut2003free, abstract = {Given a large collection of transactions containing items, a basic common data mining problem is to extract the so-called frequent itemsets (i.e., sets of items appearing in at least a given number of transactions). In this paper, we propose a structure called free-sets, from which we can approximate any itemset support (i.e., the number of transactions containing the itemset) and we formalize this notion in the framework of -adequate representations (H. Mannila and H. Toivonen, 1996. In Proc. of the Second International Conference on Knowledge Discovery and Data Mining (KDD'96), pp. 189-194). We show that frequent free-sets can be efficiently extracted using pruning strategies developed for frequent itemset discovery, and that they can be used to approximate the support of any frequent itemset. Experiments on real dense data sets show a significant reduction of the size of the output when compared with standard frequent itemset extraction. [abriged]}, added-at = {2013-08-04T16:39:02.000+0200}, author = {Boulicaut, Jean-Fran{\c{c}}ois and Bykowski, Artur and Rigotti, Christophe}, biburl = {https://www.bibsonomy.org/bibtex/2c383f2597ac24fc4c3be8a8409eac821/francesco.k}, citations = {209}, citedbyid = {17814465378246632528}, file = {file://Free-Sets A Condensed Representation of Boolean Data for the Approximation of Frequency Queries.pdf:pdf}, interhash = {106f95be16f08e44f94a05b71b4d1369}, intrahash = {c383f2597ac24fc4c3be8a8409eac821}, journal = {Data Mining and Knowledge Discovery}, keywords = {imported}, mailhosts = {insa-lyon.fr}, md5sum = {8ef936d15ab37cf1e342903e0a4ed033}, number = 1, pages = {5--22}, pdfmeat = {timestamp: 2013-08-04 16:36:50; queries: 3; inode: 1703980}, publisher = {Springer}, timestamp = {2013-08-04T16:39:02.000+0200}, title = {Free-sets: a condensed representation of boolean data for the approximation of frequency queries}, url = {http://link.springer.com/article/10.1023/A:1021571501451}, volume = 7, year = 2003 }

BibSonomy

Free-sets: a condensed representation of boolean data for the approximation of frequency queries

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on