@chris_o

Unsupervised Text Learning Based on Context Mixture Model with Dirichlet Prior

, , and . Advanced Web and NetworkTechnologies, and Applications, (2008)

Abstract

In this paper, we proposed a bayesian mixture model, in which introduce a context variable, which has Dirichlet prior, in a bayesian framework to model text multiple topics and then clustering. It is a novel unsupervised text learning algorithmto cluster large-scale web data. In addition, parameters estimation we adopt Maximum Likelihood (ML) and EM algorithm to estimatethe model parameters, and employed BIC principle to determine the number of clusters. Experimental results show that methodwe proposed distinctly outperformed baseline algorithms.

Description

SpringerLink Beta -

Links and resources

Tags

community

  • @dblp
  • @chris_o
@chris_o's tags highlighted