@mkroell

An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources

, , and . IEEE Trans. on Knowl. and Data Eng., 15 (4): 871--882 (2003)
DOI: http://dx.doi.org/10.1109/TKDE.2003.1209005

Abstract

Semantic similarity between words is becoming a generic problem for many applications of computational linguistics and artificial intelligence. This paper explores the determination of semantic similarity by a number of information sources, which consist of structural semantic information from a lexical taxonomy and information content from a corpus. To investigate how information sources could be used effectively, a variety of strategies for using various possible information sources are implemented. A new measure is then proposed which combines information sources nonlinearly. Experimental evaluation against a benchmark set of human similarity ratings demonstrates that the proposed measure significantly outperforms traditional similarity measures.

Links and resources

Tags

community

  • @mkroell
  • @sb3000
  • @wt_08
  • @dblp
@mkroell's tags highlighted