Levelwise search and borders of theories in knowledge discovery

H. Mannila, и H. Toivonen.
Data mining and knowledge discovery, 1 (3): 241--258 (1997)

Аннотация

One of the basic problems in knowledge discovery in databases (KDD) is the following: given a data set r, a class L of sentences for defining subgroups of r, and a selection predicate, find all sentences of L deemed interesting by the selection predicate. We analyze the simple levelwise algorithm for finding all such descriptions. We give bounds for the number of database accesses that the algorithm makes. For this, we introduce the concept of the border of a theory, a notion that turns out to be surprisingly powerful in analyzing the algorithm. We also consider the verification problem of a KDD process: given r and a set of sentences S ⊆ L, determine whether S is exactly the set of interesting statements about r. We show strong connections between the verification problem and the hypergraph transversal problem. The verification problem arises in a natural way when using sampling to speed up the pattern discovery step in KDD. Keywords: theory of knowledge discovery, association rules, episodes, integrity constraints, hypergraph transversals

ключ BibTeX: mannila1997levelwise
тип записи: article
год: 1997
журнал: Data mining and knowledge discovery
номер: 3
страницы: 241--258
издательство: Springer
том: 1
md5sum: 5cb1d6fa19d0b81f8b1cac15647cb87f
citations: 635
file: file://Levelwise search and borders of theories in knowledge discovery.pdf:pdf
pdfmeat: timestamp: 2013-08-04 16:49:12; queries: 2; inode: 1704014
citedbyid: 11021353798018632937
mailhosts: cs.helsinki.fi
url: http://link.springer.com/article/10.1023/A:1009796218281

тэги

Пользователи данного ресурса

Комментарии и рецензиипоказать / перейти в невидимый режим

Пожалуйста, войдите в систему, чтобы принять участие в дискуссии (добавить собственные рецензию, или комментарий)

Цитировать эту публикацию

@article{mannila1997levelwise, abstract = {One of the basic problems in knowledge discovery in databases (KDD) is the following: given a data set r, a class L of sentences for defining subgroups of r, and a selection predicate, find all sentences of L deemed interesting by the selection predicate. We analyze the simple levelwise algorithm for finding all such descriptions. We give bounds for the number of database accesses that the algorithm makes. For this, we introduce the concept of the border of a theory, a notion that turns out to be surprisingly powerful in analyzing the algorithm. We also consider the verification problem of a KDD process: given r and a set of sentences S ⊆ L, determine whether S is exactly the set of interesting statements about r. We show strong connections between the verification problem and the hypergraph transversal problem. The verification problem arises in a natural way when using sampling to speed up the pattern discovery step in KDD. Keywords: theory of knowledge discovery, association rules, episodes, integrity constraints, hypergraph transversals}, added-at = {2013-08-04T16:50:47.000+0200}, author = {Mannila, Heikki and Toivonen, Hannu}, biburl = {https://www.bibsonomy.org/bibtex/229ddfde366cb9c45dffbe77a77c92ce7/francesco.k}, citations = {635}, citedbyid = {11021353798018632937}, file = {file://Levelwise search and borders of theories in knowledge discovery.pdf:pdf}, interhash = {d50c17b81b91904aed719482ff653692}, intrahash = {29ddfde366cb9c45dffbe77a77c92ce7}, journal = {Data mining and knowledge discovery}, keywords = {imported}, mailhosts = {cs.helsinki.fi}, md5sum = {5cb1d6fa19d0b81f8b1cac15647cb87f}, number = 3, pages = {241--258}, pdfmeat = {timestamp: 2013-08-04 16:49:12; queries: 2; inode: 1704014}, publisher = {Springer}, timestamp = {2013-08-04T16:50:47.000+0200}, title = {Levelwise search and borders of theories in knowledge discovery}, url = {http://link.springer.com/article/10.1023/A:1009796218281}, volume = 1, year = 1997 }

BibSonomy