copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Joint Latent Topic Models for Text and Citations

R. Nallapati, A. Ahmed, E. Xing, and W. Cohen. Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, page 542--550. New York, NY, USA, ACM, (2008)
DOI: 10.1145/1401890.1401957

Abstract

In this work, we address the problem of joint modeling of text and citations in the topic modeling framework. We present two different models called the Pairwise-Link-LDA and the Link-PLSA-LDA models. The Pairwise-Link-LDA model combines the ideas of LDA 4 and Mixed Membership Block Stochastic Models 1 and allows modeling arbitrary link structure. However, the model is computationally expensive, since it involves modeling the presence or absence of a citation (link) between every pair of documents. The second model solves this problem by assuming that the link structure is a bipartite graph. As the name indicates, Link-PLSA-LDA model combines the LDA and PLSA models into a single graphical model. Our experiments on a subset of Citeseer data show that both these models are able to predict unseen data better than the baseline model of Erosheva and Lafferty 8, by capturing the notion of topical similarity between the contents of the cited and citing documents. Our experiments on two different data sets on the link prediction task show that the Link-PLSA-LDA model performs the best on the citation prediction task, while also remaining highly scalable. In addition, we also present some interesting visualizations generated by each of the models.

Links and resources

BibTeX key: nallapati2008joint
entry type: inproceedings
address: New York, NY, USA
booktitle: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
year: 2008
pages: 542--550
publisher: ACM
series: KDD '08
location: Las Vegas, Nevada, USA
acmid: 1401957
isbn: 978-1-60558-193-4
numpages: 9
DOI: 10.1145/1401890.1401957
url: http://doi.acm.org/10.1145/1401890.1401957

@jaeschke's tags highlighted

Cite this publication

@inproceedings{nallapati2008joint, abstract = {In this work, we address the problem of joint modeling of text and citations in the topic modeling framework. We present two different models called the Pairwise-Link-LDA and the Link-PLSA-LDA models. The Pairwise-Link-LDA model combines the ideas of LDA [4] and Mixed Membership Block Stochastic Models [1] and allows modeling arbitrary link structure. However, the model is computationally expensive, since it involves modeling the presence or absence of a citation (link) between every pair of documents. The second model solves this problem by assuming that the link structure is a bipartite graph. As the name indicates, Link-PLSA-LDA model combines the LDA and PLSA models into a single graphical model. Our experiments on a subset of Citeseer data show that both these models are able to predict unseen data better than the baseline model of Erosheva and Lafferty [8], by capturing the notion of topical similarity between the contents of the cited and citing documents. Our experiments on two different data sets on the link prediction task show that the Link-PLSA-LDA model performs the best on the citation prediction task, while also remaining highly scalable. In addition, we also present some interesting visualizations generated by each of the models.}, acmid = {1401957}, added-at = {2014-05-09T11:27:16.000+0200}, address = {New York, NY, USA}, author = {Nallapati, Ramesh M. and Ahmed, Amr and Xing, Eric P. and Cohen, William W.}, biburl = {https://www.bibsonomy.org/bibtex/25f73dfd41a79734fd4a1cb374e794d46/jaeschke}, booktitle = {Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining}, doi = {10.1145/1401890.1401957}, interhash = {a26c569691371044fbb629b829f0be5c}, intrahash = {5f73dfd41a79734fd4a1cb374e794d46}, isbn = {978-1-60558-193-4}, keywords = {citation lda link model paper prediction sota text topic}, location = {Las Vegas, Nevada, USA}, numpages = {9}, pages = {542--550}, publisher = {ACM}, series = {KDD '08}, timestamp = {2014-07-28T15:57:31.000+0200}, title = {Joint Latent Topic Models for Text and Citations}, url = {http://doi.acm.org/10.1145/1401890.1401957}, year = 2008 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Joint Latent Topic Models for Text and Citations

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Joint Latent Topic Models for Text and Citations

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Joint Latent Topic Models for Text and Citations

Comments and Reviews
(0)