"A Practical Anonymization Approach for Imbalanced Datasets"

Person-specific data owned by different data holders is usually anonymized before being shared with researchers or data-miners. Anonymization is a pertinent solution for releasing useful information while ensuring privacy. Many anonymization approaches have been proposed but majority of the existing approaches do not consider the influence of user’s attributes on privacy and utility. Consequently, privacy preservation and utility enhancement become challenging and particularly difficult when anonymizing imbalanced datasets that contain less heterogeneous values. To address these problems for imbalanced datasets, we propose a practical anonymization approach that effectively preserves users’ privacy while maintaining high utility of anonymous data. It quantifies the influence of attributes on the degree of user’s reidentification in order to protect user’s privacy. Data transformation is performed adjustably considering the influence of users’ attributes and their distributions. Experimental results obtained from real-world datasets show the efficacy of our approach and verify the abovementioned assertions.

Keywords: imbalanced datasets; anonymization; anonymization approach; practical anonymization; approach imbalanced

Journal Title: IT Professional
Year Published: 2022

Link to full text (if available)

Share on Social Media: Sign Up to like & get
recommendations!
1

LAUSR

You are not signed in:

Sign Up!

Related content

More Information News Social Media Video Recommended