A Semantic Web Middleware for Virtual Data Integration on the Web
A. Langegger, W. Wöß, and M. Blöchl. Proceedings of the 5th European Semantic Web Conference, Berlin, Heidelberg, Springer Verlag, (June 2008)
Abstract
In this contribution a system is presented, which provides access to distributed data sources using Semantic Web technology. While it was primarily designed for data sharing and scientific collaboration, it is regarded as a base technology useful for many other Semantic Web applications. The proposed system allows to retrieve data using SPARQL queries, data sources can register and abandon freely, and all RDF Schema or OWL vocabularies can be used to describe their data, as long as they are accessible on the Web. Data heterogeneity is addressed by RDF-wrappers like D2R-Server placed on top of local information systems. A query does not directly refer to actual endpoints, instead it contains graph patterns adhering to a virtual data set. A mediator finally pulls and joins RDF data from different endpoints providing a transparent on-the-fly view to the end-user. The SPARQL protocol has been defined to enable systematic data access to remote endpoints. However, remote SPARQL queries require the explicit notion of endpoint URIs. The presented system allows users to execute queries without the need to specify target endpoints. Additionally, it is possible to execute join and union operations across different remote endpoints. The optimization of such distributed operations is a key factor concerning the performance of the overall system. Therefore, proven concepts from database research can be applied.
%0 Conference Paper
%1 langegger2008semantic
%A Langegger, Andreas
%A Wöß, Wolfram
%A Blöchl, Martin
%B Proceedings of the 5th European Semantic Web Conference
%C Berlin, Heidelberg
%D 2008
%E Hauswirth, Manfred
%E Koubarakis, Manolis
%E Bechhofer, Sean
%I Springer Verlag
%K processing data mediator integration distributed sparql query semantic web query-processing-2
%T A Semantic Web Middleware for Virtual Data Integration on the Web
%U http://data.semanticweb.org/conference/eswc/2008/papers/244
%X In this contribution a system is presented, which provides access to distributed data sources using Semantic Web technology. While it was primarily designed for data sharing and scientific collaboration, it is regarded as a base technology useful for many other Semantic Web applications. The proposed system allows to retrieve data using SPARQL queries, data sources can register and abandon freely, and all RDF Schema or OWL vocabularies can be used to describe their data, as long as they are accessible on the Web. Data heterogeneity is addressed by RDF-wrappers like D2R-Server placed on top of local information systems. A query does not directly refer to actual endpoints, instead it contains graph patterns adhering to a virtual data set. A mediator finally pulls and joins RDF data from different endpoints providing a transparent on-the-fly view to the end-user. The SPARQL protocol has been defined to enable systematic data access to remote endpoints. However, remote SPARQL queries require the explicit notion of endpoint URIs. The presented system allows users to execute queries without the need to specify target endpoints. Additionally, it is possible to execute join and union operations across different remote endpoints. The optimization of such distributed operations is a key factor concerning the performance of the overall system. Therefore, proven concepts from database research can be applied.
@inproceedings{langegger2008semantic,
abstract = {In this contribution a system is presented, which provides access to distributed data sources using Semantic Web technology. While it was primarily designed for data sharing and scientific collaboration, it is regarded as a base technology useful for many other Semantic Web applications. The proposed system allows to retrieve data using SPARQL queries, data sources can register and abandon freely, and all RDF Schema or OWL vocabularies can be used to describe their data, as long as they are accessible on the Web. Data heterogeneity is addressed by RDF-wrappers like D2R-Server placed on top of local information systems. A query does not directly refer to actual endpoints, instead it contains graph patterns adhering to a virtual data set. A mediator finally pulls and joins RDF data from different endpoints providing a transparent on-the-fly view to the end-user. The SPARQL protocol has been defined to enable systematic data access to remote endpoints. However, remote SPARQL queries require the explicit notion of endpoint URIs. The presented system allows users to execute queries without the need to specify target endpoints. Additionally, it is possible to execute join and union operations across different remote endpoints. The optimization of such distributed operations is a key factor concerning the performance of the overall system. Therefore, proven concepts from database research can be applied.},
added-at = {2008-05-28T14:50:00.000+0200},
address = {Berlin, Heidelberg},
author = {Langegger, Andreas and Wöß, Wolfram and Blöchl, Martin},
biburl = {https://www.bibsonomy.org/bibtex/2fc7f36c61174d07ed9bdb4608c250284/eswc2008},
booktitle = {Proceedings of the 5th European Semantic Web Conference},
editor = {Hauswirth, Manfred and Koubarakis, Manolis and Bechhofer, Sean},
interhash = {7d1e1c709cbbfb8f8f8f7865d7ca1632},
intrahash = {fc7f36c61174d07ed9bdb4608c250284},
keywords = {processing data mediator integration distributed sparql query semantic web query-processing-2},
month = {June},
publisher = {Springer Verlag},
series = {LNCS},
timestamp = {2008-05-28T14:50:01.000+0200},
title = {A Semantic Web Middleware for Virtual Data Integration on the Web},
url = {http://data.semanticweb.org/conference/eswc/2008/papers/244},
year = 2008
}