Snorkel is a system for programmatically building and managing training datasets without manual labeling. In Snorkel, users can develop large training datasets in hours or days rather than hand-labeling them over weeks or months.
T. Finin, W. Murnane, A. Karandikar, N. Keller, J. Martineau, and M. Dredze. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, page 80--88. Stroudsburg, PA, USA, Association for Computational Linguistics, (2010)
B. Pereira Nunes, R. Kawase, S. Dietze, D. Taibi, M. Casanova, and W. Nejdl. Proceedings of the Web of Linked Entities Workshop in conjuction with the 11th International Semantic Web Conference, volume 906 of CEUR-WS.org, page 45--57. (November 2012)