This collection consists of ~20M web queries collected from ~650k users over three months.
The data is sorted by anonymous user ID and sequentially arranged.
Here at Google Research we have been using word n-gram models for a variety of R&D projects, such as statistical machine translation, speech recognition, spelling correction, entity detection, information extraction, and others. While such models have usu
E. Kanemasu. (1994)Data set. Available on-line http://www.daac.ornl.gov from Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. doi:10.3334/ORNLDAAC/111. Also published in D. E. Strebel, D. R. Landis, K. F. Huemmrich, and B. W. Meeson (eds.), Collected Data of the First ISLSCP Field Experiment, Vol. 1: Surface Observations and Non-Image Data Sets. CD-ROM. National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, Maryland, U.S.A. (available from http://www.daac.ornl.gov)..
J. Pfister, K. Kobs, and A. Hotho. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, page 816-825. (June 2021)
S. Bowman, G. Angeli, C. Potts, and C. Manning. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics, (2015)
X. Wang, Z. Wang, X. Han, W. Jiang, R. Han, Z. Liu, J. Li, P. Li, Y. Lin, and J. Zhou. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), page 1652--1671. Online, Association for Computational Linguistics, (November 2020)
J. McAuley, C. Targett, Q. Shi, and A. Van Den Hengel. Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, page 43--52. (2015)
T. McCoy, E. Pavlick, and T. Linzen. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, page 3428--3448. Florence, Italy, Association for Computational Linguistics, (July 2019)
A. Jaiswal, S. Singh, and S. Tripathy. 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT), page 1-6. IEEE, (July 2023)
S. Wunderlich, M. Ring, D. Landes, and A. Hotho. International Joint Conference: 12th International Conference on Computational Intelligence in Security for Information Systems (CISIS 2019) and 10th International Conference on EUropean Transnational Education (ICEUTE 2019) - Seville, Spain, May 13-15, 2019, Proceedings, volume 951 of Advances in Intelligent Systems and Computing, page 14--24. Springer, (2019)
R. Snow, B. O'Connor, D. Jurafsky, and A. Ng. EMNLP '08: Proceedings of the Conference on Empirical Methods in Natural Language Processing, page 254--263. Morristown, NJ, USA, Association for Computational Linguistics, (2008)