copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Finding scientific topics

T. Griffiths, and M. Steyvers. Proceedings of the National academy of Sciences, 101 (suppl 1): 5228--5235 (2004)

Abstract

A first step in identifying the content of a document is determining which topics that document addresses. We describe a generative model for documents, introduced by Blei, Ng, and Jordan Blei, D. M., Ng, A. Y. & Jordan, M. I. (2003) J. Machine Learn. Res. 3, 993-1022, in which each document is generated by choosing a distribution over topics and then choosing each word in the document from a topic selected according to this distribution. We then present a Markov chain Monte Carlo algorithm for inference in this model. We use this algorithm to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics. We show that the extracted topics capture meaningful structure in the data, consistent with the class designations pro- vided by the authors of the articles, and outline further applica- tions of this analysis, including identifying ‘‘hot topics’’ by exam- ining temporal dynamics and tagging abstracts to illustrate semantic content.

Links and resources

BibTeX key: griffiths2004finding
entry type: article
year: 2004
journal: Proceedings of the National academy of Sciences
number: suppl 1
pages: 5228--5235
publisher: National Acad Sciences
volume: 101
Document: http://www.pnas.org/content/101/suppl_1/5228.full.pdf

@huiyangsfsu's tags highlighted

Cite this publication

search on

Meta data

Last update 8 years ago
Created 8 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Finding scientific topics

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Finding scientific topics

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Finding scientific topics

Comments and Reviews
(0)