Privacy-preserving association rule mining for horizontally partitioned healthcare data: a case study on the heart diseases

In recent years, a trend of electronic health record (EHR) system can be seen increasingly in the hospitals, which has generated huge amount of electronically stored data of patients. Association rule mining technique is very helpful in the numerous applications of healthcare (e.g., correlation between disease and symptoms, disease and offering effective treatment and predicting risks of disease based on the historical data, etc.). The data collected by an EHR system are very important for the medical research. Currently, a patient health report is derived on the basis of a physician’s own experience and on the association rule mining results of a local EHR system maintained by a particular hospital. Association rule mining results will be more accurate if the data of all local EHR systems are integrated and association rule mining is performed. Integration of local EHR systems requires the sharing of local EHR data. Sharing of patient records violates the privacy of patients. Hence, medical research is focused on the problem of mining association rules without sharing of local private EHR data. Privacy-preserving distributed association rule mining (PPDARM) solves this issue by mining the association rules while preserving the privacy of patients. In this paper, an approach for the PPDARM is proposed for collaboratively performing association rule mining by all local EHR systems while preserving the privacy. The proposed approach is also analysed with the heart disease dataset.

[1]  Jimeng Sun,et al.  Publishing data from electronic health records while preserving privacy: A survey of algorithms , 2014, J. Biomed. Informatics.

[2]  Xiang-Yang Li,et al.  Collusion-Tolerable Privacy-Preserving Sum and Product Calculation without Secure Channel , 2015, IEEE Transactions on Dependable and Secure Computing.

[3]  George Hripcsak,et al.  Electronic Health Record Systems , 2014 .

[4]  Sameh A. Salem,et al.  Privacy-Preserving Data Mining (PPDM) Method for Horizontally Partitioned Data , 2012 .

[5]  Latanya Sweeney,et al.  Matching Known Patients to Health Records in Washington State Data , 2013, ArXiv.

[6]  Carlos Ordonez,et al.  Association rule discovery with the train and test approach for heart disease prediction , 2006, IEEE Transactions on Information Technology in Biomedicine.

[7]  Chris Clifton,et al.  Privacy preserving data mining over vertically partitioned data , 2004 .

[8]  Chris Clifton,et al.  Defining Privacy for Data Mining , 2002 .

[9]  Hoai Bac Le,et al.  An Enhanced Scheme for Privacy-Preserving Association Rules Mining on Horizontally Distributed Databases , 2012, 2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future.

[10]  Li Yan,et al.  Privacy-preserving distributed association rule mining based on the secret sharing technique , 2010, The 2nd International Conference on Software Engineering and Data Mining.

[11]  Murat Kantarcioglu,et al.  A Survey of Privacy-Preserving Methods Across Horizontally Partitioned Data , 2008, Privacy-Preserving Data Mining.

[12]  Yehuda Lindell,et al.  Privacy Preserving Data Mining , 2000, Journal of Cryptology.

[13]  Sellappan Palaniappan,et al.  Intelligent heart disease prediction system using data mining techniques , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[14]  Elaine B. Barker,et al.  SP 800-57. Recommendation for Key Management, Part 1: General (revised) , 2007 .

[15]  Zhengxin Chen,et al.  Privacy-Preserving Data Mining of Medical Data Using Data Separation-Based Techniques , 2007, Data Sci. J..

[16]  Elaine B. Barker Recommendation for Key Management, Part 1: General , 2016 .

[17]  Chris Clifton,et al.  Privacy-preserving distributed mining of association rules on horizontally partitioned data , 2004, IEEE Transactions on Knowledge and Data Engineering.

[18]  Nirali R. Nanavati,et al.  A novel privacy-preserving scheme for collaborative frequent itemset mining across vertically partitioned data , 2015, Secur. Commun. Networks.

[19]  Devesh C. Jinwala,et al.  Analysis and Evaluation of Schemes for Secure Sum in Collaborative Frequent Itemset Mining across Horizontally Partitioned Data , 2014 .

[20]  Chris Clifton,et al.  Privacy Preserving Naïve Bayes Classifier for Vertically Partitioned Data , 2004, SDM.

[21]  K. Elavarasi,et al.  Privacy preserving mining of association rulesin horizontally distributed databases , 2014 .

[22]  Xun Yi,et al.  Classification of Privacy-preserving Distributed Data Mining protocols , 2011, 2011 Sixth International Conference on Digital Information Management.

[23]  M. B. Malik,et al.  Privacy Preserving Data Mining Techniques: Current Scenario and Future Prospects , 2012, 2012 Third International Conference on Computer and Communication Technology.

[24]  Tzung-Pei Hong,et al.  Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining , 2014, TheScientificWorldJournal.

[25]  J Haraldsdóttir,et al.  Tracking of cardiovascular disease risk factors including maximal oxygen uptake and physical activity from late teenage to adulthood An 8‐year follow‐up study , 1993, Journal of internal medicine.

[26]  Philip S. Yu,et al.  A General Survey of Privacy-Preserving Data Mining Models and Algorithms , 2008, Privacy-Preserving Data Mining.

[27]  Ahmed Z. Emam,et al.  Comprehensive Survey on Privacy Preserving Association Rule Mining: Models, Approaches, Techniques and Algorithms , 2014, Int. J. Artif. Intell. Tools.

[28]  Aris Gkoulalas-Divanis,et al.  A Survey of Association Rule Hiding Methods for Privacy , 2008, Privacy-Preserving Data Mining.

[29]  Gyeong Ho Lee,et al.  Diagnostic Analysis of Patients with Essential Hypertension Using Association Rule Mining , 2010, Healthcare informatics research.

[30]  Antoine Joux,et al.  Separating Decision Diffie–Hellman from Computational Diffie–Hellman in Cryptographic Groups , 2003, Journal of Cryptology.

[31]  Xiao Ming Zhu Research on Privacy Preserving Data Mining Association Rules Protocol , 2013 .

[32]  M. Cevdet Ince,et al.  An expert system for detection of breast cancer based on association rules and neural network , 2009, Expert Syst. Appl..

[33]  Yehuda Lindell,et al.  Secure Multiparty Computation for Privacy-Preserving Data Mining , 2009, IACR Cryptol. ePrint Arch..

[34]  Kun Liu,et al.  Multi-party, Privacy-Preserving Distributed Data Mining Using a Game Theoretic Framework , 2007, PKDD.

[35]  Jouni Lampinen,et al.  A Classification method based on principal component analysis and differential evolution algorithm applied for prediction diagnosis from clinical EMR heart data sets , 2010 .

[36]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.