To help researchers investigate relation extraction, we’re releasing a human-judged dataset of two relations about public figures on Wikipedia: nearly 10,000 examples of “place of birth”, and over 40,000 examples of “attended or graduated from an institution”. Each of these was judged by at least 5 raters, and can be used to train or evaluate relation extraction systems. We also plan to release more relations of new types in the coming months.
M. Baroni, и R. Zamparelli. Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, стр. 1183--1193. Stroudsburg, PA, USA, Association for Computational Linguistics, (2010)
R. Reichart, и A. Rappoport. Proceedings of the Thirteenth Conference on Computational Natural Language Learning, стр. 156--164. Stroudsburg, PA, USA, Association for Computational Linguistics, (2009)
C. Manning. Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I, стр. 171--189. Berlin, Heidelberg, Springer-Verlag, (2011)