copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format

J. Beckmann, A. Halverson, R. Krishnamurthy, and J. Naughton. Proceedings of the 22nd International Conference on Data Engineering, page 58--. Washington, DC, USA, IEEE Computer Society, (2006)
DOI: 10.1109/ICDE.2006.67

Abstract

"Sparse" data, in which relations have many attributes that are null for most tuples, presents a challenge for relational database management systems. If one uses the normal "horizontal" schema to store such data sets in any of the three leading commercial RDBMS, the result is tables that occupy vast amounts of storage, most of which is devoted to nulls. If one attempts to avoid this storage blowup by using a "vertical" schema, the storage utilization is indeed better, but query performance is orders of magnitude slower for certain classes of queries. In this paper, we argue that the proper way to handle sparse data is not to use a vertical schema, but rather to extend the RDBMS tuple storage format to allow the representation of sparse attributes as interpreted fields. The addition of interpreted storage allows for efficient and transparent querying of sparse data, uniform access to all attributes, and schema scalability. We show, through an implementation in PostgreSQL, that the interpreted storage approach dominates in query efficiency and ease-of-use over the current horizontal storage and vertical schema approaches over a wide range of queries and sparse data sets.

Description

Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format

Links and resources

BibTeX key: Beckmann:2006:ERS:1129754.1129918
entry type: inproceedings
address: Washington, DC, USA
booktitle: Proceedings of the 22nd International Conference on Data Engineering
year: 2006
pages: 58--
publisher: IEEE Computer Society
series: ICDE '06
acmid: 1129918
isbn: 0-7695-2570-9
DOI: 10.1109/ICDE.2006.67
url: http://dx.doi.org/10.1109/ICDE.2006.67

@sac's tags highlighted

Cite this publication

@inproceedings{Beckmann:2006:ERS:1129754.1129918, abstract = {"Sparse" data, in which relations have many attributes that are null for most tuples, presents a challenge for relational database management systems. If one uses the normal "horizontal" schema to store such data sets in any of the three leading commercial RDBMS, the result is tables that occupy vast amounts of storage, most of which is devoted to nulls. If one attempts to avoid this storage blowup by using a "vertical" schema, the storage utilization is indeed better, but query performance is orders of magnitude slower for certain classes of queries. In this paper, we argue that the proper way to handle sparse data is not to use a vertical schema, but rather to extend the RDBMS tuple storage format to allow the representation of sparse attributes as interpreted fields. The addition of interpreted storage allows for efficient and transparent querying of sparse data, uniform access to all attributes, and schema scalability. We show, through an implementation in PostgreSQL, that the interpreted storage approach dominates in query efficiency and ease-of-use over the current horizontal storage and vertical schema approaches over a wide range of queries and sparse data sets.}, acmid = {1129918}, added-at = {2012-05-07T13:27:39.000+0200}, address = {Washington, DC, USA}, author = {Beckmann, Jennifer L. and Halverson, Alan and Krishnamurthy, Rajasekar and Naughton, Jeffrey F.}, biburl = {https://www.bibsonomy.org/bibtex/2bacb6766270ac5cc5c44164e216aa5ad/sac}, booktitle = {Proceedings of the 22nd International Conference on Data Engineering}, description = {Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format}, doi = {10.1109/ICDE.2006.67}, interhash = {69bc090d280ef2377bd9ffd53934fddd}, intrahash = {bacb6766270ac5cc5c44164e216aa5ad}, isbn = {0-7695-2570-9}, keywords = {datasets rdbms sparse xml}, pages = {58--}, publisher = {IEEE Computer Society}, series = {ICDE '06}, timestamp = {2012-05-07T13:27:39.000+0200}, title = {Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format}, url = {http://dx.doi.org/10.1109/ICDE.2006.67}, year = 2006 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format

Comments and Reviews
(0)