@ntran

Topic Cropping: Leveraging Latent Topics for the Analysis of Small Corpora

, , , , und . Research and Advanced Technology for Digital Libraries, Volume 8092 von Lecture Notes in Computer Science, Seite 297-308. Springer Berlin Heidelberg, (2013)
DOI: 10.1007/978-3-642-40501-3_30

Zusammenfassung

Topic modeling has gained a lot of popularity as a means for identifying and describing the topical structure of textual documents and whole corpora. There are, however, many document collections such as qualitative studies in the digital humanities that cannot easily benefit from this technology. The limited size of those corpora leads to poor quality topic models. Higher quality topic models can be learned by incorporating additional domain-specific documents with similar topical content. This, however, requires finding or even manually composing such corpora, requiring considerable effort. For solving this problem, we propose a fully automated adaptable process of

Beschreibung

Topic Cropping: Leveraging Latent Topics for the Analysis of Small Corpora - Springer

Links und Ressourcen

Tags

Community

  • @zerr
  • @kerstinbischoff
  • @dblp
  • @ntran
  • @niederee
@ntrans Tags hervorgehoben