@beate

Framework and algorithms for trend analysis in massive temporal data sets

, and . CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge management, page 168--177. New York, NY, USA, ACM, (2004)
DOI: http://doi.acm.org/10.1145/1031171.1031208

Abstract

Mining massive temporal data streams for significant trends, emerging buzz, and unusually high or low activity is an important problem with several commercial applications. In this paper, we propose a framework based on relational records and metric spaces to study such problems. Our framework provides the necessary mathematical underpinnings for this genre of problems, and leads to efficient algorithms in the stream/sort model of massive data sets (where the algorithm makes passes over the data, computes a new stream on the fly, and is allowed to sort the intermediate data). Our algorithm makes novel use of metric approximations in the data stream context, and highlights the role of hierarchical organization of large data sets in designing efficient algorithms in the stream/sort model.

Description

Framework and algorithms for trend analysis in massive temporal data sets

Links and resources

Tags

community

  • @hotho
  • @beate
  • @dblp
@beate's tags highlighted