Article,

Experimental explorations on short text topic mining between LDA and NMF based Schemes.

Y. Chen, H. Zhang, R. Liu, Z. Ye, and J. Lin.
Knowl. Based Syst., (January 2019)
DOI: https://doi.org/10.1016/j.knosys.2018.08.011

Abstract

Learning topics from short texts has become a critical and fundamental task for understanding the widely-spread streaming social messages, e.g., tweets, snippets and questions/answers. Up to date, there are two distinctive topic learning schemes: generative probabilistic graphical models and geometrically linear algebra approaches, with LDA and NMF being the representative works, respectively. Since these two methods both could uncover the latent topics hidden in the unstructured short texts, some interesting doubts are coming to our minds that which one is better and why? Are there any other more effective extensions? In order to explore valuable insights between LDA and NMF based learning schemes, we comprehensively conduct a series of experiments into two parts. Specifically, the basic LDA and NMF are compared with different experimental settings on several public short text datasets in the first part which would exhibit that NMF tends to perform better than LDA; in the second part, we propose a novel model called “Knowledge-guided Non-negative Matrix Factorization for Better Short Text Topic Mining” (abbreviated as KGNMF), which leverages external knowledge as a semantic regulator with low-rank formalizations, yielding up a time-efficient algorithm. Extensive experiments are conducted on three representative corpora with currently typical short text topic models to demonstrate the effectiveness of our proposed KGNMF. Overall, learning with NMF-based schemes is another effective manner in short text topic mining in addition to the popular LDA-based paradigms.

BibTeX key: journals/kbs/ChenZLYL19
entry type: article
year: 2019
month: January
journal: Knowl. Based Syst.
pages: 1-13
volume: 163
ee: https://doi.org/10.1016/j.knosys.2018.08.011
DOI: https://doi.org/10.1016/j.knosys.2018.08.011
url: https://www.sciencedirect.com/science/article/abs/pii/S0950705118304076

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Journal Article %1 journals/kbs/ChenZLYL19 %A Chen, Yong %A Zhang, Hui %A Liu, Rui %A Ye, Zhiwen %A Lin, Jianying %D 2019 %J Knowl. Based Syst. %K knowledge-guided-nmf nmf topic-modeling transfer-learning unsupervised %P 1-13 %R https://doi.org/10.1016/j.knosys.2018.08.011 %T Experimental explorations on short text topic mining between LDA and NMF based Schemes. %U https://www.sciencedirect.com/science/article/abs/pii/S0950705118304076 %V 163 %X Learning topics from short texts has become a critical and fundamental task for understanding the widely-spread streaming social messages, e.g., tweets, snippets and questions/answers. Up to date, there are two distinctive topic learning schemes: generative probabilistic graphical models and geometrically linear algebra approaches, with LDA and NMF being the representative works, respectively. Since these two methods both could uncover the latent topics hidden in the unstructured short texts, some interesting doubts are coming to our minds that which one is better and why? Are there any other more effective extensions? In order to explore valuable insights between LDA and NMF based learning schemes, we comprehensively conduct a series of experiments into two parts. Specifically, the basic LDA and NMF are compared with different experimental settings on several public short text datasets in the first part which would exhibit that NMF tends to perform better than LDA; in the second part, we propose a novel model called “Knowledge-guided Non-negative Matrix Factorization for Better Short Text Topic Mining” (abbreviated as KGNMF), which leverages external knowledge as a semantic regulator with low-rank formalizations, yielding up a time-efficient algorithm. Extensive experiments are conducted on three representative corpora with currently typical short text topic models to demonstrate the effectiveness of our proposed KGNMF. Overall, learning with NMF-based schemes is another effective manner in short text topic mining in addition to the popular LDA-based paradigms.

@article{journals/kbs/ChenZLYL19, abstract = {Learning topics from short texts has become a critical and fundamental task for understanding the widely-spread streaming social messages, e.g., tweets, snippets and questions/answers. Up to date, there are two distinctive topic learning schemes: generative probabilistic graphical models and geometrically linear algebra approaches, with LDA and NMF being the representative works, respectively. Since these two methods both could uncover the latent topics hidden in the unstructured short texts, some interesting doubts are coming to our minds that which one is better and why? Are there any other more effective extensions? In order to explore valuable insights between LDA and NMF based learning schemes, we comprehensively conduct a series of experiments into two parts. Specifically, the basic LDA and NMF are compared with different experimental settings on several public short text datasets in the first part which would exhibit that NMF tends to perform better than LDA; in the second part, we propose a novel model called “Knowledge-guided Non-negative Matrix Factorization for Better Short Text Topic Mining” (abbreviated as KGNMF), which leverages external knowledge as a semantic regulator with low-rank formalizations, yielding up a time-efficient algorithm. Extensive experiments are conducted on three representative corpora with currently typical short text topic models to demonstrate the effectiveness of our proposed KGNMF. Overall, learning with NMF-based schemes is another effective manner in short text topic mining in addition to the popular LDA-based paradigms.}, added-at = {2020-11-26T20:09:36.000+0100}, author = {Chen, Yong and Zhang, Hui and Liu, Rui and Ye, Zhiwen and Lin, Jianying}, biburl = {https://www.bibsonomy.org/bibtex/27171a9695fe1dd2d021f3c1d0223b9ca/ghagerer}, doi = {https://doi.org/10.1016/j.knosys.2018.08.011}, ee = {https://doi.org/10.1016/j.knosys.2018.08.011}, interhash = {c6db48253afb8d117eb840d48c050f2f}, intrahash = {7171a9695fe1dd2d021f3c1d0223b9ca}, journal = {Knowl. Based Syst.}, keywords = {knowledge-guided-nmf nmf topic-modeling transfer-learning unsupervised}, month = {January}, pages = {1-13}, timestamp = {2020-11-26T20:09:36.000+0100}, title = {Experimental explorations on short text topic mining between LDA and NMF based Schemes.}, url = {https://www.sciencedirect.com/science/article/abs/pii/S0950705118304076}, volume = 163, year = 2019 }

BibSonomy

Experimental explorations on short text topic mining between LDA and NMF based Schemes.

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on