@pewu

The Sesame Lucene Sail: RDF Queries with Full-text Search

, , , , and . Technical Report, 2008-1. NEPOMUK Consortium, (February 2008)

Abstract

With the growth of the Semantic Web, the requirements on storing and querying RDF has become more sophisticated. When a larger amount of data has to be managed, queries in structured query languages, such as SPARQL, are not always powerful enough. Use of additional keywords for querying can further reduce the result set towards the actual relevant answers, however, SPARQL only provides complete string matching or filtering based on regular expressions, which is a very slow operation. In contrast, state of the art Information Retrieval (IR) techniques provide sophisticated features such as keyword search, lemmatisation, stemming and ranking. In this paper we present a combination of structured RDF queries and full-text search. It is implemented as an extension of an established RDF store (Sesame) with IR capabilities using the text search library Lucene, without requiring modifications to existing RDF query languages.

Links and resources

Tags

community

  • @leobard
  • @wernerino
  • @pewu
  • @nepomuk
@pewu's tags highlighted