,

Data Sharing Taxonomy Records for Security Conservation

, и .
Computer Science & Information Technology (CS & IT), 7 (2): 49-58 (января 2017)

Аннотация

Here, we discuss the Classification is a fundamental problem in data analysis. Training a classifier requires accessing a large collection of data. Releasing person-specific data, such as customer data or patient records, may pose a threat to an individual’s privacy. Even after removing explicit identifying information such as Name and SSN, it is still possible to link released records back to their identities by matching some combination of non identifying attributes such as Sex,Zip,Birthdate. A useful approach to combat such linking attacks, called k-anonymization is anonymizing the linking attributes so that at least k released records match each value combination of the linking attributes. Our goal is to find a k-anonymization which preserves the classification structure. Experiments of real-life data show that the quality of classification can be preserved even for highly restrictive anonymity requirements.

тэги

Пользователи данного ресурса

  • @laimbee

Комментарии и рецензии