J. Kleinberg. Data Mining and Knowledge Discovery, 7 (4):
373--397(October 2003)
Abstract
A fundamental problem in text data mining is to extract meaningful structure from document streams that arrive continuously over time. E-mail and news articles are two natural examples of such streams, each characterized by topics that appear, grow in intensity for a period of time, and then fade away. The published literature in a particular research field can be seen to exhibit similar phenomena over a much longer time scale. Underlying much of the text mining work in this area is the following intuitive premiseâthat the appearance of a topic in a document stream is signaled by a âburst of activity,â with certain features rising sharply in frequency as the topic emerges.
ER -
%0 Journal Article
%1 kleinberg2003
%A Kleinberg, Jon
%D 2003
%J Data Mining and Knowledge Discovery
%K analysis bayesian imported mail model multistates series stream time
%N 4
%P 373--397
%T Bursty and Hierarchical Structure in Streams
%U http://dx.doi.org/10.1023/A:1024940629314
%V 7
%X A fundamental problem in text data mining is to extract meaningful structure from document streams that arrive continuously over time. E-mail and news articles are two natural examples of such streams, each characterized by topics that appear, grow in intensity for a period of time, and then fade away. The published literature in a particular research field can be seen to exhibit similar phenomena over a much longer time scale. Underlying much of the text mining work in this area is the following intuitive premiseâthat the appearance of a topic in a document stream is signaled by a âburst of activity,â with certain features rising sharply in frequency as the topic emerges.
ER -
@article{kleinberg2003,
abstract = {A fundamental problem in text data mining is to extract meaningful structure from document streams that arrive continuously over time. E-mail and news articles are two natural examples of such streams, each characterized by topics that appear, grow in intensity for a period of time, and then fade away. The published literature in a particular research field can be seen to exhibit similar phenomena over a much longer time scale. Underlying much of the text mining work in this area is the following intuitive premiseâthat the appearance of a topic in a document stream is signaled by a âburst of activity,â with certain features rising sharply in frequency as the topic emerges.
ER -},
added-at = {2007-10-13T20:31:04.000+0200},
author = {Kleinberg, Jon},
biburl = {https://www.bibsonomy.org/bibtex/2162eb7b3982f6edd7a1322c661d7bed7/andreab},
description = {SpringerLink - Journal Article},
interhash = {85f02e6bdb8c5c56b7e71d4eb63bf855},
intrahash = {162eb7b3982f6edd7a1322c661d7bed7},
journal = {Data Mining and Knowledge Discovery},
keywords = {analysis bayesian imported mail model multistates series stream time},
month = {#oct#},
number = 4,
pages = {373--397},
timestamp = {2007-10-13T20:31:04.000+0200},
title = {Bursty and Hierarchical Structure in Streams},
url = {http://dx.doi.org/10.1023/A:1024940629314},
volume = 7,
year = 2003
}