There is a strong positive correlation between the development of deep learning and the amount of public data available. Not all data can be released in their raw form because of the risk to the privacy of the related individuals. The main objective of privacy-preserving data publication is to anonymize the data while maintaining their utility. In this paper, we propose a privacy-preserving semi-generative adversarial network (PPSGAN) that selectively adds noise to class-independent features of each image to enable the processed image to maintain its original class label. Our experiments on training classifiers with synthetic datasets anonymized with various methods confirm that PPSGAN shows better utility than other conventional methods, including blurring, noise-adding, filtering, and generation using GANs.
[1]
Xiaoqian Jiang,et al.
DPSynthesizer: Differentially Private Data Synthesizer for Privacy Preserving Data Sharing
,
2014,
Proc. VLDB Endow..
[2]
Zhiwei Steven Wu,et al.
Privacy-Preserving Generative Deep Neural Networks Support Clinical Data Sharing
,
2017,
bioRxiv.
[3]
Anand D. Sarwate,et al.
Differentially Private Empirical Risk Minimization
,
2009,
J. Mach. Learn. Res..
[4]
Geoffrey E. Hinton,et al.
Visualizing Data using t-SNE
,
2008
.
[5]
Jihoon Yang,et al.
Latent-Space-Level Image Anonymization With Adversarial Protector Networks
,
2019,
IEEE Access.