To help researchers investigate relation extraction, we’re releasing a human-judged dataset of two relations about public figures on Wikipedia: nearly 10,000 examples of “place of birth”, and over 40,000 examples of “attended or graduated from an institution”. Each of these was judged by at least 5 raters, and can be used to train or evaluate relation extraction systems. We also plan to release more relations of new types in the coming months.
J. Chu-Carroll, and J. Prager. CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, page 505--514. New York, NY, USA, ACM, (2007)
L. Lesmo, A. Mazzei, and D. Radicioni. HT '09: Proceedings of the Twentieth ACM Conference on Hypertext and Hypermedia, New York, NY, USA, ACM, (July 2009)
M. Romanello, M. Berti, A. Babeu, and G. Crane. HT '09: Proceedings of the Twentieth ACM Conference on Hypertext and Hypermedia, New York, NY, USA, ACM, (July 2009)