DP-GAN: Differentially private consecutive data publishing using generative adversarial nets

Abstract In the era of big data, increasingly massive volumes of data is generated and published consecutively for both research and commercial purposes. The potential value of sensitive information also attracts interest from adversaries and thereby arises public concern. Current research mostly focuses on privacy-preserving data publishing in a statistic manner rather than taking the dynamics and correlation of context into consideration. Motivated by this, we propose a novel idea that combining differential privacy and generative adversarial nets. Generative adversarial nets and its extensions are used to generate a synthetic data set with indistinguishable statistic features while differential privacy guarantees a trade-off between the privacy protection and data utility. By employing a min-max game with three players, we devise a deep generative model, namely, DP-GAN model, for synthetic data generation while fulfilling the privacy constraints in a differentially private manner. Extensive simulation results on a real-world data set testify the superiority of the proposed model in terms of privacy protection, data utility, and efficiency.

[1]  Ali Inan,et al.  Sensitivity Analysis for Non-Interactive Differential Privacy: Bounds and Efficient Algorithms , 2020, IEEE Transactions on Dependable and Secure Computing.

[2]  Hai Liu,et al.  Adaptive Differential Privacy Interactive Publishing Model Based on Dynamic Feedback , 2018, 2018 International Conference on Networking and Network Applications (NaNA).

[3]  Yonghong Tian,et al.  GAN-Driven Personalized Spatial-Temporal Private Data Sharing in Cyber-Physical Social Systems , 2020, IEEE Transactions on Network Science and Engineering.

[4]  Wojciech Zaremba,et al.  Improved Techniques for Training GANs , 2016, NIPS.

[5]  Cynthia Dwork,et al.  Calibrating Noise to Sensitivity in Private Data Analysis , 2006, TCC.

[6]  Wanlei Zhou,et al.  GAN-DP: Generative Adversarial Net Driven Differentially Privacy-Preserving Big Data Publishing , 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[7]  Shruti Patil,et al.  A Survey on Privacy Enhanced Role Based Data Aggregation via Differential Privacy , 2018, 2018 International Conference On Advances in Communication and Computing Technology (ICACCT).

[8]  Xinyu Yang,et al.  A Survey on Internet of Things: Architecture, Enabling Technologies, Security and Privacy, and Applications , 2017, IEEE Internet of Things Journal.

[9]  Ju Ren,et al.  GANobfuscator: Mitigating Information Leakage Under GAN via Differential Privacy , 2019, IEEE Transactions on Information Forensics and Security.

[10]  Yong Xiang,et al.  Generative Adversarial Nets Enhanced Continual Data Release Using Differential Privacy , 2019, ICA3PP.

[11]  Yen-Wei Chen,et al.  Automatic Generation of Facial Expression Using Generative Adversarial Nets , 2018, 2018 IEEE 7th Global Conference on Consumer Electronics (GCCE).

[12]  Wanlei Zhou,et al.  A Hybrid Privacy Protection Scheme in Cyber-Physical Social Networks , 2018, IEEE Transactions on Computational Social Systems.

[13]  Hasan Sakir Bilge,et al.  Recent Trends in Deep Generative Models: a Review , 2018, 2018 3rd International Conference on Computer Science and Engineering (UBMK).

[14]  Mihaela van der Schaar,et al.  PATE-GAN: Generating Synthetic Data with Differential Privacy Guarantees , 2018, ICLR.

[15]  Ke Xiao,et al.  Privacy of Things: Emerging Challenges and Opportunities in Wireless Internet of Things , 2018, IEEE Wireless Communications.

[16]  Masatoshi Yoshikawa,et al.  Quantifying Differential Privacy in Continuous Data Release Under Temporal Correlations , 2017, IEEE Transactions on Knowledge and Data Engineering.

[17]  Xiao Lu,et al.  Real-Time and Spatio-Temporal Crowd-Sourced Social Network Data Publishing with Differential Privacy , 2018, IEEE Transactions on Dependable and Secure Computing.

[18]  Philip S. Yu,et al.  Differentially Private Data Publishing and Analysis: A Survey , 2017, IEEE Transactions on Knowledge and Data Engineering.

[19]  Song Guo,et al.  Big Data Meet Green Challenges: Big Data Toward Green Applications , 2016, IEEE Systems Journal.

[20]  Hong Shen,et al.  A New Lower Bound of Privacy Budget for Distributed Differential Privacy , 2017, 2017 18th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT).

[21]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[22]  Bertrand Le Saux,et al.  Generative Adversarial Networks for Realistic Synthesis of Hyperspectral Samples , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[23]  Sébastien Gambs,et al.  Differential Privacy Models for Location-Based Services , 2016, Trans. Data Priv..

[24]  Moni Naor,et al.  Differential privacy under continual observation , 2010, STOC '10.

[25]  Elaine Shi,et al.  Private and Continual Release of Statistics , 2010, TSEC.

[26]  Cynthia Dwork,et al.  Differential Privacy , 2006, ICALP.

[27]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[28]  Aaron Roth,et al.  The Algorithmic Foundations of Differential Privacy , 2014, Found. Trends Theor. Comput. Sci..

[29]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[30]  Anand D. Sarwate,et al.  Robust Privacy-Utility Tradeoffs Under Differential Privacy and Hamming Distortion , 2016, IEEE Transactions on Information Forensics and Security.

[31]  Ian Goodfellow,et al.  Deep Learning with Differential Privacy , 2016, CCS.

[32]  Jun Zhu,et al.  Triple Generative Adversarial Nets , 2017, NIPS.

[33]  Xiaodong Wang,et al.  Privacy on the Edge: Customizable Privacy-Preserving Context Sharing in Hierarchical Edge Computing , 2020, IEEE Transactions on Network Science and Engineering.

[34]  Teng Wang,et al.  Survey on Improving Data Utility in Differentially Private Sequential Data Publishing , 2017, IEEE Transactions on Big Data.