MALLET is an integrated collection of Java code useful for statistical natural language processing, document classification, clustering, information extraction, and other machine learning applications to text.
A. Takasu. JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries, page 49--60. Washington, DC, USA, IEEE Computer Society, (2003)
J. Lafferty, A. McCallum, and F. Pereira. Proc. 18th International Conf. on Machine Learning, page 282--289. Morgan Kaufmann, San Francisco, CA, (2001)