Аннотация

Introduction Information retrieval is, in large part, the study of methods for assessing the similarity of pairs of documents. Document similarity metrics have been used for many tasks including ad hoc document retrieval, text classification YC1994, and summarization GC1998,SSMB1997. Another problem area in which similarity metrics are central is record linkage (e.g., KA1985), where one wishes to determine if two database records taken from different source databases refer to the same...

Линки и ресурсы

тэги

сообщество

  • @brusilovsky
  • @aho
  • @sam_chapman
@brusilovsky- тэги данного пользователя выделены