ASV Toolbox is a modular collection of tools for the exploration of written language data. They work either on word lists or text and solve several linguistic classification and clustering tasks. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction.
MuNPEx is a multi-lingual noun phrase (NP) extraction component developed for the GATE architecture, implemented in JAPE. It currently supports English, German, French, and Spanish (in beta).
MuNPEx requires a part-of-speech (POS) tagger to work and can additionally use detected named entities (NEs) to improve chunking performance. Please read the documentation (or source code) for more details.
J. Kleinberg. KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, page 91--101. New York, NY, USA, ACM, (2002)
X. Wan, and J. Yang. SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, page 143--150. New York, NY, USA, ACM, (2007)
A. Esuli, and F. Sebastiani. Proceedings of EACL-06, 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, IT., (2006)
R. Swan, and J. Allan. CIKM '99: Proceedings of the eighth international conference on Information and knowledge management, page 38--45. New York, NY, USA, ACM, (1999)
A. Moschitti, and F. Zanzotto. ICML '07: Proceedings of the 24th international conference on Machine learning, page 649--656. New York, NY, USA, ACM, (2007)
M. Mittermayer. HICSS '04: Proceedings of the Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04) - Track 3, page 30064.2. Washington, DC, USA, IEEE Computer Society, (2004)
P. Koomen, V. Punyakanok, D. Roth, and W. tau Yih. Proceedings of the Ninth Conference on Computational Natural Language Learning (CoNLL-2005), page 181--184. Ann Arbor, Michigan, Association for Computational Linguistics, (June 2005)
X. Li, and D. Roth. Proceedings of the 19th international conference on Computational linguistics, page 1--7. Morristown, NJ, USA, Association for Computational Linguistics, (2002)