@sebastian

Large-Scale Named Entity Disambiguation Based on Wikipedia Data

. Proceedings of EMNLP-CoNLL 2007, page 708–716. (2007)

Abstract

This paper presents a large-scale system for the recognition and semantic disambiguation of named entities based on information extracted from a large encyclopedic collection and Web search results. It describes in detail the disambiguation paradigm employed and the information extraction process from Wikipedia. Through a process of maximizing the agreement between the contextual information extracted from Wikipedia and the context of a document, as well as the agreement among the category tags associated with the candidate entities, the implemented system shows high disambiguation accuracy on both news stories and Wikipedia articles.

Links and resources

Tags

community

  • @pitman
  • @kabloom
  • @flawed
  • @aquablue
  • @parismic
  • @dblp
  • @folke
  • @sebastian
  • @stefano
  • @brightbyte
@sebastian's tags highlighted