@jaeschke

YAGO: a core of semantic knowledge

, , and . Proceedings of the 16th international conference on World Wide Web, page 697--706. New York, NY, USA, ACM, (2007)
DOI: 10.1145/1242572.1242667

Abstract

We present YAGO, a light-weight and extensible ontology with high coverage and quality. YAGO builds on entities and relations and currently contains more than 1 million entities and 5 million facts. This includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE). The facts have been automatically extracted from Wikipedia and unified with WordNet, using a carefully designed combination of rule-based and heuristic methods described in this paper. The resulting knowledge base is a major step beyond WordNet: in <i>quality</i> by adding knowledge about individuals like persons, organizations, products, etc. with their semantic relationships - and in <i>quantity</i> by increasing the number of facts by more than an order of magnitude. Our empirical evaluation of fact correctness shows an accuracy of about 95%. YAGO is based on a logically clean model, which is decidable, extensible, and compatible with RDFS. Finally, we show how YAGO can be further extended by state-of-the-art information extraction techniques.

Links and resources

Tags

community

  • @jullybobble
  • @skarab
  • @thoni
  • @mchaves
  • @dblp
  • @mgns
  • @jaeschke
  • @chato
  • @lysander07
  • @yannik.bonda
  • @casi
  • @mhermans
  • @terraces
  • @brightbyte
  • @ngrandy
@jaeschke's tags highlighted