Other articles where Thesaurus is discussed: library: Thesauri: A new use of the term thesaurus, now widespread, dates from the early 1950s in the work of H.P. Luhn, at International Business Machines Corporation (IBM), who was searching for a computer process that could create a list of authorized terms for the indexing…
Bow (or libbow) is a library of C code useful for writing statistical text analysis, language modeling and information retrieval programs. The current distribution includes the library, as well as front-ends for document classification (rainbow), document
Emily Drabinski , Queering the Catalog: Queer Theory and the Politics of Correction, The Library Quarterly: Information, Community, Policy, Vol. 83, No. 2 (April 2013), pp. 94-111
Libtextcat is a library with functions that implement the classification technique described in Cavnar & Trenkle, "N-Gram-Based Text Categorization" [1]. It was primarily developed for language guessing, a task on which it is known to perform with near-pe
LIBLINEAR is a linear classifier for data with millions of instances and features. It supports L2-regularized logistic regression (LR), L2-loss linear SVM, and L1-loss linear SVM.
Main features of LIBLINEAR include
* Same data format as LIBSVM, our general-purpose SVM solver, and also similar usage
* Multi-class classification: 1) one-vs-the rest, 2) Crammer & Singer
* Cross validation for model selection
* Probability estimates (logistic regression only)
* Weights for unbalanced data
* MATLAB/Octave, Java interfaces
W. Martins, M. Goncalves, A. Laender, and G. Pappa. Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries, page 193--202. New York, NY, USA, ACM, (2009)