Part-of-Speech Tagging, Phrase Chunking and Named Entity Recognition with Python NLTK. Taggers and chunkers trained on treebank, brown, conll2000, ieer.
Stanford CoreNLP provides a set of human language technology tools. It can give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, mark up the structure of sentences in terms of phrases and syntactic dependencies, indicate which noun phrases refer to the same entities, indicate sentiment, extract particular or open-class relations between entity mentions, get the quotes people said, etc.
Stanford CoreNLP provides a set of natural language analysis tools. It can give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, and mark up the structure of sentences in terms of phrases and word dependencies, indicate which noun phrases refer to the same entities, indicate sentiment, extract open-class relations between mentions, etc.
To help researchers investigate relation extraction, we’re releasing a human-judged dataset of two relations about public figures on Wikipedia: nearly 10,000 examples of “place of birth”, and over 40,000 examples of “attended or graduated from an institution”. Each of these was judged by at least 5 raters, and can be used to train or evaluate relation extraction systems. We also plan to release more relations of new types in the coming months.
B. Klimek, M. Ackermann, A. Kirschenbaum, und S. Hellmann. Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology, German Society for Computational Linguistics and Language Technology, (2017)
M. Schwab, R. Jäschke, und F. Fischer. Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Seite 110--115. Association for Computational Linguistics, (2023)
B. Klimek, M. Ackermann, A. Kirschenbaum, und S. Hellmann. Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology, German Society for Computational Linguistics and Language Technology, (2017)
M. Peters, W. Ammar, C. Bhagavatula, und R. Power. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1, Seite 1756--1765. (2017)