MALLET is an integrated collection of Java code useful for statistical natural language processing, document classification, clustering, information extraction, and other machine learning applications to text.
A. Takasu. JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries, page 49--60. Washington, DC, USA, IEEE Computer Society, (2003)
H. Han, C. Giles, E. Manavoglu, H. Zha, Z. Zhang, and E. Fox. JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries, page 37--48. Washington, DC, USA, IEEE Computer Society, (2003)
Y. Zhou, L. Wu, F. Weng, and H. Schmidt. Proceedings of the 2003 conference on Empirical methods in natural language processing, page 153--159. Morristown, NJ, USA, Association for Computational Linguistics, (2003)
B. Gutmann, and K. Kersting. Proceedings of the 15th European Conference on Machine Learning (ECML-2006), volume 4212 of LNAI (Lecture Notes in Artificial Intelligence), page 174--185. Berlin, Germany, Springer, (September 2006)