@cstrasser

An adaptation of the vector-space model for ontology-based information retrieval

, , and . IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 19 (2): 261--272 (February 2007)

Abstract

Semantic search has been one of the motivations of the Semantic Web since it was envisioned. We propose a model for the exploitation of ontology-based knowledge bases to improve search over large document repositories. In our view of Information Retrieval on the Semantic Web, a search engine returns documents rather than, or in addition to, exact values in response to user queries. For this purpose, our approach includes an ontology-based scheme for the semiautomatic annotation of documents and a retrieval system. The retrieval model is based on an adaptation of the classic vector-space model, including an annotation weighting algorithm, and a ranking algorithm. Semantic search is combined with conventional keyword-based retrieval to achieve tolerance to knowledge base incompleteness. Experiments are shown where our approach is tested on corpora of significant scale, showing clear improvements with respect to keyword-based search.

Links and resources

Tags

community

  • @davidlan
  • @mediadigits
  • @lepsky
  • @cstrasser
  • @mbjones.89
@cstrasser's tags highlighted