copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge.

R. Gallagher, K. Reing, D. Kale, and G. Steeg. (2016)cite arxiv:1611.10277Comment: 21 pages, 7 figures.

Abstract

While generative models such as Latent Dirichlet Allocation (LDA) have proven fruitful in topic modeling, they often require detailed assumptions and careful specification of hyperparameters. Such model complexity issues only compound when trying to generalize generative models to incorporate human input. We introduce Correlation Explanation (CorEx), an alternative approach to topic modeling that does not assume an underlying generative model, and instead learns maximally informative topics through an information-theoretic framework. This framework naturally generalizes to hierarchical and semi-supervised extensions with no additional modeling assumptions. In particular, word-level domain knowledge can be flexibly incorporated within CorEx through anchor words, allowing topic separability and representation to be promoted with minimal human intervention. Across a variety of datasets, metrics, and experiments, we demonstrate that CorEx produces topics that are comparable in quality to those produced by unsupervised and semi-supervised variants of LDA.

Description

[1611.10277] Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge

Links and resources

BibTeX key: gallagher2016anchored
entry type: misc
year: 2016
url: http://arxiv.org/abs/1611.10277
note: cite arxiv:1611.10277Comment: 21 pages, 7 figures

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge.

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge.

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge.

Comments and Reviews
(0)