@brusilovsky

Text classification and named entities for new event detection

, and . SIGIR '04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, page 297--304. New York, NY, USA, ACM Press, (2004)
DOI: 10.1145/1008992.1009044

Abstract

New Event Detection is a challenging task that still offers scope for great improvement after years of effort. In this paper we show how performance on New Event Detection (NED) can be improved by the use of text classification techniques as well as by using named entities in a new way. We explore modifications to the document representation in a vector space-based NED system. We also show that addressing named entities preferentially is useful only in certain situations. A combination of all the above results in a multi-stage NED system that performs much better than baseline single-stage NED systems.

Links and resources

Tags

community

  • @gerds0n
  • @brusilovsky
  • @lillejul
  • @aho
  • @dblp
  • @utahell
@brusilovsky's tags highlighted