A Privacy Preserving Scheme for Big data Publishing in the Cloud using k-Anonymization and Hybridized Optimization Algorithm

One of the emerging research areas in the recent years is big data due to the enormous data flow in various fields, like hospitals, government records, social sites, etc. In this field, cloud computing has drawn significant importance as the user can transfer huge volume of data through the servers. Hence, it is necessary to protect the data so that the third party cannot access the information provided by the cloud users. This work introduces the k-anonymization model for privacy preservation in the cloud. The proposed scheme is driven by the newly developed optimization model, namely Dragon Particle Swarm Optimization (Dragon-PSO) which combines the Dragonfly Algorithm (DA) and Particle Swarm Optimization (PSO) algorithm. The proposed scheme derives the fitness function for the proposed Dragon-PSO algorithm attaining high value for privacy and utility. The proposed scheme is evaluated based on two metrics, Information Loss and Classification Accuracy.

[1]  S. K. Nagar,et al.  Particle swarm optimization algorithm and its parameters: A review , 2016, 2016 International Conference on Control, Computing, Communication and Materials (ICCCCM).

[2]  Suman Madan,et al.  Privacy preserving data publishing and data anonymization approaches: A review , 2017, 2017 International Conference on Computing, Communication and Automation (ICCCA).

[3]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[4]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[5]  H. Zhen,et al.  A new anonymity model for privacy-preserving data publishing , 2014, China Communications.

[6]  Seyedali Mirjalili,et al.  Dragonfly algorithm: a new meta-heuristic optimization technique for solving single-objective, discrete, and multi-objective problems , 2015, Neural Computing and Applications.

[7]  Wenyong Wang,et al.  Efran (O): "Efficient Scalar Homomorphic Scheme on MapReduce for Data Privacy Preserving" , 2016, 2016 IEEE 3rd International Conference on Cyber Security and Cloud Computing (CSCloud).

[8]  Wei Tian,et al.  A Survey of Privacy Preserving Data Publishing using Generalization and Suppression , 2014 .

[9]  Philip S. Yu,et al.  Privacy-preserving data publishing: A survey of recent developments , 2010, CSUR.

[10]  Charu C. Aggarwal,et al.  Privacy-preserving big data publishing , 2015, SSDBM.

[11]  Chunxiao Jiang,et al.  Information Security in Big Data: Privacy and Data Mining , 2014, IEEE Access.

[12]  Peng Liu,et al.  An Improved Privacy Preserving Algorithm for Publishing Social Network Data , 2013, 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing.

[13]  Yufei Tao,et al.  Anatomy: simple and effective privacy preservation , 2006, VLDB.

[14]  Jun Yang,et al.  New Data Publishing Framework in the Big Data Environments , 2014, 2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[15]  Ji Zhang,et al.  On Efficient and Robust Anonymization for Privacy Protection on Massive Streaming Categorical Information , 2017, IEEE Transactions on Dependable and Secure Computing.

[16]  Suman Madan,et al.  Hybrid privacy preservation model for big data publishing on cloud , 2018 .

[17]  Hong Ding,et al.  Practical m-k-Anonymization for Collaborative Data Publishing without Trusted Third Party , 2017, Secur. Commun. Networks.

[18]  Philip S. Yu,et al.  Anonymizing Classification Data for Privacy Preservation , 2007, IEEE Transactions on Knowledge and Data Engineering.

[19]  Rajkumar Buyya,et al.  Ensuring Security and Privacy Preservation for Cloud Data Services , 2016, ACM Comput. Surv..

[20]  A. K. Ilavarasi,et al.  An evolutionary feature set decomposition based anonymization for classification workloads: Privacy Preserving Data Mining , 2017, Cluster Computing.