Article,

Enhancing Privacy of Confidential Data using K Anonymization

, and .
International Journal on Network Security, 1 (2): 4 (July 2010)

Abstract

Recent advances in the field of data collection and related technologies have inaugurated a new era of research where existing data mining algorithms should be reconsidered from a different point of view, this of privacy preservation. Much research has been done recently on privacy preserving data mining (PPDM) based on perturbation, randomization and secure multiparty computations and more recently on anonymity including k-anonymity and l-diversity. We use the technique of k-Anonymization to de-associate sensitive attributes from the corresponding identifiers. This is done by anonymizing the linking attributes so that at least k released records match each value combination of the linking attributes. This paper proposes a k- Anonymization solution for classification. The proposed method has been implemented and evaluated using UCI repository datasets. After the k-anonymization solution is determined for the original data, classification, a data mining technique using the ID3 algorithm, is applied on both the original table and the compressed table .The accuracy of the both is compared by determining the entropy and the information gain values. Experiments show that the quality of classification can be preserved even for highly restrictive anonymity requirements.

Tags

Users

  • @ideseditor

Comments and Reviews