@dzibold

Relevance models for topic detection and tracking

, , , , , and . Proceedings of the second international conference on Human Language Technology Research, page 115--121. San Francisco, CA, USA, Morgan Kaufmann Publishers Inc., (2002)

Abstract

We extend relevance modeling to the link detection task of Topic Detection and Tracking (TDT) and show that it substantially improves performance. Relevance modeling, a statistical language modeling technique related to query expansion, is used to enhance the topic model estimate associated with a news story, boosting the probability of words that are associated with the story even when they do not appear in the story. To apply relevance modeling to TDT, it had to be extended to work with stories rather than short queries, and the similarity comparison had to be changed to a modified form of Kullback-Leibler. We demonstrate that relevance models result in very substantial improvements over the language modeling baseline. We also show how the use of relevance modeling makes it possible to choose a single parameter for within- and cross-mode comparisons of stories.

Description

Relevance models for topic detection and tracking

Links and resources

Tags

community

  • @lillejul
  • @dzibold
  • @folke
@dzibold's tags highlighted