Inproceedings,

Can RDB2RDF Tools Feasibly Expose Large Science Archives for Data Integration?

A. Gray, N. Gray, and I. Ounis.
6th Annual European Semantic Web Conference (ESWC2009), page 491-505. (June 2009)

Abstract

Many science archive centres publish very large volumes of image, simulation, and experiment data. In order to integrate and analyse the available data, scientists need to be able to (i) identify and locate all the data relevant to their work; (ii) understand the multiple heterogeneous data models in which the data is published; and (iii) interpret and process the data they retrieve. RDF has been shown to be a generally successful framework within which to perform such data integration work. It can be equally successful in the context of scientific data, if it is demonstrably practical to expose that data as RDF. In this paper we investigate the capabilities of RDF to enable the integration of scientific data sources. Specifically, we discuss the suitability of SPARQL for expressing scientific queries, and the performance of several triple stores and RDB2RDF tools for executing queries over a moderately sized sample of a large astronomical data set. We found that more research and improvements are required into SPARQL and RDB2RDF tools to efficiently expose existing science archives for data integration.

BibTeX key: tools2009
entry type: inproceedings
booktitle: 6th Annual European Semantic Web Conference (ESWC2009)
year: 2009
month: June
pages: 491-505
url: http://data.semanticweb.org/conference/eswc/2009/paper/163

BibSonomy

Can RDB2RDF Tools Feasibly Expose Large Science Archives for Data Integration?

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on