I've been thinking about the best approach to implement pure function verification in the Scala compiler. An approach similar to the one in D would fit a lot better than the one used in Haskell (which would break all existing code and cause some problems due to strict evaluation). A solution using annotations would be quite simple to implement:
Researchers at Google annotated English-language Web pages from the ClueWeb09 and ClueWeb12 corpora. The annotation process was automatic, and hence imperfect. However, the annotations are of generally high quality, as they strove for high precision (and, by necessity, lower recall). For each entity they recognized with high confidence, they provide the beginning and end byte offsets of the entity mention in the input text, its Freebase identifier (mid), and two confidence levels (computed differently, see below).
You might consider using this data in conjunction with the recently released Freebase annotations of several TREC query sets.
Concept search, full-text search and annotation structure search in one scaleable index: "Mímir is a multi-paradigm information management index and repository which can be used to index and search over text, annotations, semantic schemas (ontologies), and semantic meta-data (instance data). It allows queries that arbitrarily mix full-text, structural, linguistic and semantic queries and that can scale to gigabytes of text. A typical semantic annotation project deals with large quantities of data of different kinds. Mímir provides a framework for implementing indexing and search functionality across all these data type."
thumbtack collect, organize, share use thumbtack to collect a list of your favorite restaurants and share them with your friends plan a trip- collect information about places to stay and things to do research your next purchase- store, analyze and sift through your options in thumbtack take notes and share them with your team
J. Parvanova, V. Alexiev, и S. Kostadinov. International Workshop on Collaborative Annotations in Shared Environment: metadata, vocabularies and techniques in the Digital Humanities (DH-CASE 2013). Collocated with DocEng 2013, Florence, Italy, (сентября 2013)
R. Snow, B. O'Connor, D. Jurafsky, и A. Ng. Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, стр. 254--263. Honolulu, Hawaii, Association for Computational Linguistics, (октября 2008)
M. Sabou, K. Bontcheva, L. Derczynski, и A. Scharl. Proceedings of the Ninth International Conference on Language Resources and Evaluation, ŁREC\ 2014, Reykjavik, Iceland, May 26-31, 2014, стр. 859--866. European Language Resources Association \(ELRA)\, (2014)
B. USADEL, F. POREE, A. NAGEL, M. LOHSE, A. CZEDIK-EYSENBERG, и M. STITT. Plant Cell Environ, 32 (9):
1211-29(2009)Usadel, Bjorn Poree, Fabien Nagel, Axel Lohse, Marc Czedik-Eysenberg, Angelika Stitt, Mark Comparative Study Research Support, Non-U.S. Gov't United States Plant, cell & environment Plant Cell Environ. 2009 Sep;32(9):1211-29. Epub 2009 Mar 24..
D. Weber, A. Voit, G. Kollotzek, и N. Henze. Proceedings of the 18th International Conference on Mobile and Ubiquitous Multimedia, стр. 24:1--24:12. New York, NY, USA, ACM, (2019)
J. Kurhila, M. Miettinen, P. Nokelainen, P. Flor&\#233;en, и H. Tirri. ITiCSE '03: Proceedings of the 8th annual conference on Innovation and technology in computer science education, 35, стр. 173--177. New York, NY, USA, ACM Press, (сентября 2003)
C. Wang, и G. Chen. ITiCSE '04: Proceedings of the 9th annual SIGCSE conference on Innovation and technology in computer science education, 36, стр. 132--136. New York, NY, USA, ACM Press, (сентября 2004)
B. Brush, D. Bargeron, J. Grudin, и A. Gupta. CHI '02: Proceedings of the SIGCHI conference on Human factors in computing systems, стр. 89--96. New York, NY, USA, ACM Press, (2002)