Inproceedings,

Learning to match and cluster entity names

, and .
ACM SIGIR'01 workshop on Mathematical /Formal Methods in IR, 2001., (2001)

Abstract

Introduction Information retrieval is, in large part, the study of methods for assessing the similarity of pairs of documents. Document similarity metrics have been used for many tasks including ad hoc document retrieval, text classification YC1994, and summarization GC1998,SSMB1997. Another problem area in which similarity metrics are central is record linkage (e.g., KA1985), where one wishes to determine if two database records taken from different source databases refer to the same...

Tags

Users

  • @brusilovsky
  • @aho
  • @sam_chapman

Comments and Reviews