@kmd-ovgu

Privacy-preserving Query Log Mining for Business Confidentiality Protection

, , and . ACM Trans. Web, 4 (3): 10:1--10:26 (July 2010)
DOI: 10.1145/1806916.1806919

Abstract

We introduce the concern of confidentiality protection of business information for the publication of search engine query logs and derived data. We study business confidentiality, as the protection of nonpublic data from institutions, such as companies and people in the public eye. In particular, we relate this concern to the involuntary exposure of confidential Web site information, and we transfer this problem into the field of privacy-preserving data mining. We characterize the possible adversaries interested in disclosing Web site confidential data and the attack strategies that they could use. These attacks are based on different vulnerabilities found in query log for which we present several anonymization heuristics to prevent them. We perform an experimental evaluation to estimate the remaining utility of the log after the application of our anonymization techniques. Our experimental results show that a query log can be anonymized against these specific attacks while retaining a significant volume of useful data.

Links and resources

Tags

community

  • @kmd-ovgu
  • @dblp
@kmd-ovgu's tags highlighted