OpenNLP is an organizational center for open source projects related to natural language processing. It hosts a variety of java-based NLP tools which perform sentence detection, tokenization, pos-tagging, chunking and parsing, named-entity detection, and coreference using the OpenNLP Maxent machine learning package.
ASV Toolbox is a modular collection of tools for the exploration of written language data. They work either on word lists or text and solve several linguistic classification and clustering tasks. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction.
MuNPEx is a multi-lingual noun phrase (NP) extraction component developed for the GATE architecture, implemented in JAPE. It currently supports English, German, French, and Spanish (in beta).
MuNPEx requires a part-of-speech (POS) tagger to work and can additionally use detected named entities (NEs) to improve chunking performance. Please read the documentation (or source code) for more details.
SentiWordNet is a lexical resource for opinion mining. SentiWordNet assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity.
The TreeTagger is a tool for annotating text with part-of-speech and lemma information which has been developed within the TC project at the Institute for Computational Linguistics of the University of Stuttgart. The TreeTagger has been successfully used to tag German, English, French, Italian, Dutch, Spanish, Bulgarian, Russian, Greek, Portuguese, Chinese and old French texts and is easily adaptable to other languages if a lexicon and a manually tagged training corpus are available.
Online Demo of the TreeTagger. A tool for annotating text with part-of-speech and lemma information which has been developed at the Institute for Computational Linguistics of the University of Stuttgart.
Shalmaneser is a supervised learning toolbox for shallow semantic parsing, i.e. the automatic assignment of semantic classes and roles to text. The system was developed for Frame Semantics; thus we use Frame Semantics terminology and call the classes frames and the roles frame elements. However, the architecture is reasonably general, and with a certain amount of adaption, Shalmaneser should be usable for other paradigms (e.g., PropBank roles) as well. Shalmaneser caters both for end users, and for researchers.
S. Bloehdorn, and A. Moschitti. CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, page 861--864. New York, NY, USA, ACM, (2007)
G. Siolas, and F. d'Alché Buc. IJCNN '00: Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks (IJCNN'00)-Volume 5, page 5205. Washington, DC, USA, IEEE Computer Society, (2000)
X. Li, and D. Roth. Proceedings of the 19th international conference on Computational linguistics, page 1--7. Morristown, NJ, USA, Association for Computational Linguistics, (2002)
M. Collins, and N. Duffy. ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, page 263--270. Morristown, NJ, USA, Association for Computational Linguistics, (2002)
A. Moschitti, and F. Zanzotto. ICML '07: Proceedings of the 24th international conference on Machine learning, page 649--656. New York, NY, USA, ACM, (2007)
R. Swan, and J. Allan. CIKM '99: Proceedings of the eighth international conference on Information and knowledge management, page 38--45. New York, NY, USA, ACM, (1999)
J. Kleinberg. KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, page 91--101. New York, NY, USA, ACM, (2002)
E. Riloff. Connectionist, statistical, and symbolic approaches to learning for natural language processing, 1040, page 275--289. Heidelberg, DE, Springer Verlag, (1996)
P. Koomen, V. Punyakanok, D. Roth, and W. tau Yih. Proceedings of the Ninth Conference on Computational Natural Language Learning (CoNLL-2005), page 181--184. Ann Arbor, Michigan, Association for Computational Linguistics, (June 2005)
A. Devitt, and K. Ahmad. Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, page 984--991. Prague, Czech Republic, Association for Computational Linguistics, (June 2007)
M. Mittermayer, and G. Knolmayer. ICDM '06: Proceedings of the Sixth International Conference on Data Mining, page 1002--1007. Washington, DC, USA, IEEE Computer Society, (2006)
M. Mittermayer. HICSS '04: Proceedings of the Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04) - Track 3, page 30064.2. Washington, DC, USA, IEEE Computer Society, (2004)
X. Wan, and J. Yang. SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, page 143--150. New York, NY, USA, ACM, (2007)
G. Fung, J. Yu, and W. Lam. Computational Intelligence for Financial Engineering, 2003. Proceedings. 2003 IEEE International Conference on, (20-23 March 2003)
A. Esuli, and F. Sebastiani. Proceedings of EACL-06, 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, IT., (2006)
M. Gamon. COLING '04: Proceedings of the 20th international conference on Computational Linguistics, page 841. Morristown, NJ, USA, Association for Computational Linguistics, (2004)