
Latent Dirichlet Allocation with topic-in-set knowledge

, and . Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing, page 43--48. Stroudsburg, PA, USA, Association for Computational Linguistics, (2009)


Latent Dirichlet Allocation is an unsupervised graphical model which can discover latent topics in unlabeled data. We propose a mechanism for adding partial supervision, called topic-in-set knowledge, to latent topic modeling. This type of supervision can be used to encourage the recovery of topics which are more relevant to user modeling goals than the topics which would be recovered otherwise. Preliminary experiments on text datasets are presented to demonstrate the potential effectiveness of this method.


Latent Dirichlet Allocation with topic-in-set knowledge

Links and resources



  • @schwemmlein
  • @ans
@schwemmlein's tags highlighted