An Approach for Collaborative Data Publishing Using Self-adaptive Genetic Grey Wolf Optimizer

This paper introduces an algorithm, termed self-adaptive genetic grey wolf optimizer (self-adaptive genetic GWO), for privacy preservation using a C-mixture factor. The C-mixture factor improves the privacy of data, in which the data does not satisfy the privacy constraints, such as l-diversity, m-privacy, and k-anonymity. Experimentation is carried out using the adult dataset, and the effectiveness of the proposed self-adaptive genetic GWO is checked depending on the information loss and the average equivalence class metric values and is evaluated to be the best when compared to other existing techniques with low information loss value as 0.1724 and average equivalence class value as 0.71, respectively.

[1]  Philip S. Yu,et al.  Differentially Private Data Publishing and Analysis: A Survey , 2017, IEEE Transactions on Knowledge and Data Engineering.

[2]  Sagar S. Badhiye,et al.  Preserving privacy in collaborative data publishing from heterogeneity attack , 2016, 2016 World Conference on Futuristic Trends in Research and Innovation for Social Welfare (Startup Conclave).

[3]  Charu C. Aggarwal,et al.  Managing dimensionality in data privacy anonymization , 2015, Knowledge and Information Systems.

[4]  Jingyu Hua,et al.  Privacy-Preserving Utility Verification of the Data Published by Non-Interactive Differentially Private Mechanisms , 2016, IEEE Transactions on Information Forensics and Security.

[5]  Bradley Malin,et al.  COAT: COnstraint-based anonymization of transactions , 2010, Knowledge and Information Systems.

[6]  Elisa Bertino,et al.  Efficient systematic clustering method for k-anonymization , 2011, Acta Informatica.

[7]  Qi Wang,et al.  On the privacy preserving properties of random data perturbation techniques , 2003, Third IEEE International Conference on Data Mining.

[8]  Qingshan Jiang,et al.  A General Framework for Privacy Preserving Sequential Data Publishing , 2017, 2017 31st International Conference on Advanced Information Networking and Applications Workshops (WAINA).

[9]  J. McCall,et al.  Genetic algorithms for modelling and optimisation , 2005 .

[10]  Philip S. Yu,et al.  Differentially private query learning: From data publishing to model publishing , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[11]  Ali Kashif Bashir,et al.  Improved Generalization for Secure Data Publishing , 2018, IEEE Access.

[12]  Deepali Vora,et al.  PRIVACY preservation in big data using anonymization techniques , 2017, 2017 International Conference on Data Management, Analytics and Innovation (ICDMAI).

[13]  Jiajin Le,et al.  A self-adaptation data publishing algorithm framework , 2011, 2011 International Conference on Mechatronic Science, Electric Engineering and Computer (MEC).

[14]  Ai-qiang Gao,et al.  Privacy preservation for attribute order sensitive workload in medical data publishing , 2009, 2009 IEEE International Symposium on IT in Medicine & Education.

[15]  Xiaojiang Du,et al.  Towards Privacy Preserving Publishing of Set-Valued Data on Hybrid Cloud , 2018, IEEE Transactions on Cloud Computing.

[16]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..

[17]  T. Senthil Murugan,et al.  C-mixture and multi-constraints based genetic algorithm for collaborative data publishing , 2016, J. King Saud Univ. Comput. Inf. Sci..

[18]  Benjamin C. M. Fung,et al.  m-Privacy for Collaborative Data Publishing , 2011, CollaborateCom 2011.

[19]  Suman Madan,et al.  Privacy preserving data publishing and data anonymization approaches: A review , 2017, 2017 International Conference on Computing, Communication and Automation (ICCCA).