Inproceedings,

Which Words Do You Remember? Temporal Properties of Language Use in Digital Archives

, , and .
Theory and Practice of Digital Libraries, volume 7489 of Lecture Notes in Computer Science, page 32-37. Berlin / Heidelberg, Springer, (2012)
DOI: 10.1007/978-3-642-33290-6_4

Abstract

Knowing the behavior of terms in written texts can help us tailor fit models, algorithms and resources to improve access to digital libraries and help us answer information needs in longer spanning archives. In this paper we investigate the behavior of English written text in blogs in comparison to traditional texts from the New York Times, The Times Archive, and the British National Corpus. We show that user generated content, similar to spoken content, differs in characteristics from ‘professionally’ written text and experiences a more dynamic behavior.

Tags

Users

  • @tahmasebi
  • @gerhardgossen
  • @trisse69
  • @dblp

Comments and Reviews