Bow (or libbow) is a library of C code useful for writing statistical text analysis, language modeling and information retrieval programs. The current distribution includes the library, as well as front-ends for document classification (rainbow), document
Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, a
R. Almeida, and V. Almeida. WWW '04: Proceedings of the 13th international conference on World Wide Web, page 413--421. New York, NY, USA, ACM Press, (2004)