Book,

Probabilistic Databases

D. Suciu, D. Olteanu, C. Ré, and C. Koch.
Synthesis Lectures on Data Management Morgan & Claypool, San Rafael, CA, (2011)
DOI: 10.2200/S00362ED1V01Y201105DTM016

Abstract

Probabilistic databases are databases where the value of some attributes or the presence of some records are uncertain and known only with some probability. Applications in many areas such as information extraction, RFID and scientific data management, data cleaning, data integration, and financial risk assessment produce large volumes of uncertain data, which are best modeled and processed by a probabilistic database. This book presents the state of the art in representation formalisms and query processing techniques for probabilistic data. It starts by discussing the basic principles for representing large probabilistic databases, by decomposing them into tuple-independent tables, block-independent-disjoint tables, or U-databases. Then it discusses two classes of techniques for query evaluation on probabilistic databases. In extensional query evaluation, the entire probabilistic inference can be pushed into the database engine and, therefore, processed as effectively as the evaluation of standard SQL queries. The relational queries that can be evaluated this way are called safe queries. In intensional query evaluation, the probabilistic inference is performed over a propositional formula called lineage expression: every relational query can be evaluated this way, but the data complexity dramatically depends on the query being evaluated, and can be 'Number/Sharp P'-hard. The book also discusses some advanced topics in probabilistic data management such as top-k query processing, sequential probabilistic databases, indexing and materialized views, and Monte Carlo databases.

BibTeX key: SuciuOlteanuEtAl11
entry type: book
address: San Rafael, CA
year: 2011
publisher: Morgan & Claypool
series: Synthesis Lectures on Data Management
volume: 16
file: eBook:2011/SuciuOlteanuEtAl11.pdf:PDF;Amazon Search inside:http\://www.amazon.de/gp/reader/1608456803/:URL
issn: 2153-5418
isbn: 978-1-60845-680-2
groups: public
intrahash: 29e91910295619ba6360541ce550fb24
DOI: 10.2200/S00362ED1V01Y201105DTM016
timestamp: 2011.05.01
username: flint63

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@book{SuciuOlteanuEtAl11, abstract = {Probabilistic databases are databases where the value of some attributes or the presence of some records are uncertain and known only with some probability. Applications in many areas such as information extraction, RFID and scientific data management, data cleaning, data integration, and financial risk assessment produce large volumes of uncertain data, which are best modeled and processed by a probabilistic database. This book presents the state of the art in representation formalisms and query processing techniques for probabilistic data. It starts by discussing the basic principles for representing large probabilistic databases, by decomposing them into tuple-independent tables, block-independent-disjoint tables, or U-databases. Then it discusses two classes of techniques for query evaluation on probabilistic databases. In extensional query evaluation, the entire probabilistic inference can be pushed into the database engine and, therefore, processed as effectively as the evaluation of standard SQL queries. The relational queries that can be evaluated this way are called safe queries. In intensional query evaluation, the probabilistic inference is performed over a propositional formula called lineage expression: every relational query can be evaluated this way, but the data complexity dramatically depends on the query being evaluated, and can be 'Number/Sharp P'-hard. The book also discusses some advanced topics in probabilistic data management such as top-k query processing, sequential probabilistic databases, indexing and materialized views, and Monte Carlo databases.}, added-at = {2017-05-14T09:24:25.000+0200}, address = {San Rafael, CA}, author = {Suciu, Dan and Olteanu, Dan and R{\'e}, Christopher and Koch, Christoph}, biburl = {https://www.bibsonomy.org/bibtex/2810318231c3450639f671d78fe60c240/flint63}, doi = {10.2200/S00362ED1V01Y201105DTM016}, file = {eBook:2011/SuciuOlteanuEtAl11.pdf:PDF;Amazon Search inside:http\://www.amazon.de/gp/reader/1608456803/:URL}, groups = {public}, interhash = {6f21ee43326b70484bab72a407616be7}, intrahash = {29e91910295619ba6360541ce550fb24}, isbn = {978-1-60845-680-2}, issn = {2153-5418}, keywords = {01624 103 book ai numerical knowledge processing database logic algorithm}, publisher = {Morgan \& Claypool}, series = {Synthesis Lectures on Data Management}, timestamp = {2017-05-14T10:12:23.000+0200}, title = {Probabilistic Databases}, username = {flint63}, volume = 16, year = 2011 }

BibSonomy

Probabilistic Databases

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on