The Net Data Directory collects and shares information on different sources of data about the Internet. For more about the project, see our about page. To get started, use the search box below, or check out our quick start guide.
Web search engines have changed our lives - enabling instant access to information about subjects that are both deeply important to us, as well as passing whims. The search engines that provide answers to our search queries also log those queries, in order to improve their algorithms. Academic research on search queries has shown that they can provide valuable information on diverse topics including word and phrase similarity, topical seasonality and may even have potential for sociology, as well as providing a barometer of the popularity of many subjects. At the same time, individuals are rightly concerned about what the consequences of accidental leaking or deliberate sharing of this information may mean for their privacy. In this talk I will cover the applications which have benefited from mining query logs, the risks that privacy can be breached by sharing query logs, and current algorithms for mining logs in a way to prevent privacy breaches.
H. Zhang, A. Santos, and J. Freire. Proceedings of the 30th ACM International Conference on Information &$\mathsemicolon$ Knowledge Management, ACM, (October 2021)
M. Paris, and R. Jäschke. Proceedings of the 14th International Conference on Knowledge Science, Engineering and Management, volume 12816 of Lecture Notes in Artificial Intelligence, page 1--14. Springer, (2021)
R. Jäschke, and S. Rudolph. Contributions to the 11th International Conference on Formal Concept Analysis, page 19--34. Technische Universität Dresden, (May 2013)
F. Suchanek, G. Kasneci, and G. Weikum. Proceedings of the 16th international conference on World Wide Web, page 697--706. New York, NY, USA, ACM, (2007)
B. Pereira Nunes, R. Kawase, S. Dietze, D. Taibi, M. Casanova, and W. Nejdl. Proceedings of the Web of Linked Entities Workshop in conjuction with the 11th International Semantic Web Conference, volume 906 of CEUR-WS.org, page 45--57. (November 2012)
T. Joachims. Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, page 133--142. New York, NY, USA, ACM, (2002)
A. Brew, D. Greene, and P. Cunningham. Proceedings of the 19th European Conference on Artificial Intelligence, volume 215 of Frontiers in Artificial Intelligence and Applications, page 145--150. Amsterdam, The Netherlands, The Netherlands, IOS Press, (2010)
S. Tramp, P. Frischmuth, T. Ermilov, and S. Auer. Proceedings of the EKAW 2010 - Knowledge Engineering and Knowledge Management by the Masses; 11th October-15th October 2010 - Lisbon, Portugal, volume 6317 of Lecture Notes in Artificial Intelligence, page 135--149. Berlin / Heidelberg, Springer, (October 2010)
T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, page 154--161. New York, NY, USA, ACM, (2005)