Researchers at Google annotated English-language Web pages from the ClueWeb09 and ClueWeb12 corpora. The annotation process was automatic, and hence imperfect. However, the annotations are of generally high quality, as they strove for high precision (and, by necessity, lower recall). For each entity they recognized with high confidence, they provide the beginning and end byte offsets of the entity mention in the input text, its Freebase identifier (mid), and two confidence levels (computed differently, see below).
You might consider using this data in conjunction with the recently released Freebase annotations of several TREC query sets.
. In more recent times, scientist have harnessed the power of the public not only to collect data on a larger scale than perhaps would otherwise be possible, but also to analyse data gathered by professional researchers. Such data analysis projects include Zooniverse’s Galaxy Zoo and Cell Slider projects, whilst WheelMap, Wide Noise and the Opal Tree Health Survey focus on data collection.
Das schicke Bear County Brennholzlager CA1950 bietet Platz für bis zu 2,8 m³ Holz, ein Holzdach mit Dachpappe und ein druckimprägnierter Fußboden gehören zum Lieferumfang. ✓ Rechnungskauf ✓ Trusted Shops zertifiziert ✓ Montageservice
S. Siersdorfer, and S. Sizov. HT '09: Proceedings of the 20th ACM conference on Hypertext and hypermedia, page 261--270. New York, NY, USA, ACM, (2009)
A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. Proceedings of the First Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures, page 87-102. Aalborg, Aalborg Universitetsforlag, (2006)
C. Ordonez. HIKM '06: Proceedings of the international workshop on Healthcare information and knowledge management, page 17--24. New York, NY, USA, ACM Press, (2006)
S. Niwa, T. Doi, and S. Honiden. Proceedings of the Third International Conference on Information Technology: New Generations (TNG'06), page 388-393. (2006)
G. Mishne. WWW '06: Proceedings of the 15th international conference on World Wide Web, page 953--954. New York, NY, USA, ACM Press, (2006)paper presented at the poster track.
L. Wu, X. Hua, N. Yu, W. Ma, and S. Li. MM '08: Proceeding of the 16th ACM international conference on Multimedia, page 31--40. New York, NY, USA, ACM, (2008)
I. Kim. Intelligent Agents: Specification, Modeling, and Applications. 4th Pacific Rim International Workshop on Multi-Agents, PRIMA 2001. Proceedings (Lecture Notes in Artificial Intelligence Vol.2132), page 210--21. Dept. of Comput. Sci., Kyonggi Univ., Suwon, South Korea, Springer-Verlag, (2001)