PPDP: An efficient and privacy-preserving disease prediction scheme in cloud-based e-Healthcare system

Abstract Disease prediction systems have played an important role in people’s life, since predicting the risk of diseases is essential for people to lead a healthy life. The recent proliferation of data mining techniques has given rise to disease prediction systems. Specifically, with the vast amount of medical data generated every day, Single-Layer Perceptron can be utilized to obtain valuable information to construct a disease prediction system. Although the disease prediction system is quite promising, many challenges may limit it in practical use, including information security and prediction efficiency. In this paper, we propose an efficient and privacy-preserving disease prediction system, called PPDP. In PPDP, patients’ historical medical data are encrypted and outsourced to the cloud server, which can be further utilized to train prediction models by using Single-Layer Perceptron learning algorithm in a privacy-preserving way. The risk of diseases for new coming medical data can be computed based on the prediction models. In particular, PPDP builds on new medical data encryption, disease learning and disease prediction algorithms that novelly utilize random matrices. Security analysis indicates that PPDP offers a required level of privacy protection. In addition, real experiments on different datasets show that computation costs of data encryption, disease learning and disease prediction are several magnitudes lower than existing disease prediction schemes.

[1]  Jin Liu,et al.  An Energy Efficient Data Transmission Mechanism for Middleware of Wireless Sensor Network , 2014, 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[2]  Zoran Obradovic,et al.  A privacy-preserving framework for distributed clinical decision support , 2011, 2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS).

[3]  Jian Wang,et al.  Collusion-resisting secure nearest neighbor query over encrypted data in cloud, revisited , 2016, 2016 IEEE/ACM 24th International Symposium on Quality of Service (IWQoS).

[4]  Andrew P. Bradley,et al.  Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus , 2010, IEEE Transactions on Information Technology in Biomedicine.

[5]  Kui Ren,et al.  CloudBI: Practical Privacy-Preserving Outsourcing of Biometric Identification in the Cloud , 2015, ESORICS.

[6]  Nelson G. Durdle,et al.  A support vectors classifier approach to predicting the risk of progression of adolescent idiopathic scoliosis , 2005, IEEE Transactions on Information Technology in Biomedicine.

[7]  Yvonne Vergouwe,et al.  Prediction models for clustered data: comparison of a random intercept and standard regression model , 2013, BMC Medical Research Methodology.

[8]  Ming Li,et al.  Securing Personal Health Records in Cloud Computing: Patient-Centric and Fine-Grained Data Access Control in Multi-owner Settings , 2010, SecureComm.

[9]  Elif Derya Übeyli,et al.  Multiclass Support Vector Machines for EEG-Signals Classification , 2007, IEEE Trans. Inf. Technol. Biomed..

[10]  Giovanni Parmigiani,et al.  PancPRO: risk assessment for individuals with a family history of pancreatic cancer. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[11]  Xiaoxia Liu,et al.  Efficient and Privacy-Preserving Online Medical Prediagnosis Framework Using Nonlinear SVM , 2017, IEEE Journal of Biomedical and Health Informatics.

[12]  Naixue Xiong,et al.  A Lightweight Encryption Scheme Combined with Trust Management for Privacy-Preserving in Body Sensor Networks , 2015, Journal of Medical Systems.

[13]  Michael Naehrig,et al.  Improved Security for a Ring-Based Fully Homomorphic Encryption Scheme , 2013, IMACC.

[14]  Shen Yan,et al.  Security Analysis on Privacy-Preserving Cloud Aided Biometric Identification Schemes , 2016, ACISP.

[15]  Yuchen Zhang,et al.  HEALER: homomorphic computation of ExAct Logistic rEgRession for secure rare disease variants analysis in GWAS , 2015, Bioinform..

[16]  Nikos Mamoulis,et al.  Secure kNN computation on encrypted databases , 2009, SIGMOD Conference.

[17]  Laurence T. Yang,et al.  Privacy Preserving Deep Computation Model on Cloud for Big Data Feature Learning , 2016, IEEE Transactions on Computers.

[18]  Sushil Jajodia,et al.  Over-encryption: Management of Access Control Evolution on Outsourced Data , 2007, VLDB.

[19]  Muttukrishnan Rajarajan,et al.  Privacy-Preserving Clinical Decision Support System Using Gaussian Kernel-Based Classification , 2014, IEEE Journal of Biomedical and Health Informatics.

[20]  Michael Naehrig,et al.  Private Predictive Analysis on Encrypted Medical Data , 2014, IACR Cryptol. ePrint Arch..

[21]  Wei Jiang,et al.  k-Nearest Neighbor Classification over Semantically Secure Encrypted Relational Data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[22]  Shucheng Yu,et al.  Efficient privacy-preserving biometric identification in cloud computing , 2013, 2013 Proceedings IEEE INFOCOM.

[23]  P. K. Anooj,et al.  Clinical decision support system: Risk level prediction of heart disease using weighted fuzzy rules , 2012, J. King Saud Univ. Comput. Inf. Sci..

[24]  Mu-Chen Chen,et al.  Prediction model building and feature selection with support vector machines in breast cancer diagnosis , 2008, Expert Syst. Appl..

[25]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT.

[26]  Cheng Huang,et al.  PSLP: Privacy-preserving single-layer perceptron learning for e-Healthcare , 2015, 2015 10th International Conference on Information, Communications and Signal Processing (ICICS).

[27]  Xiaofeng Wang,et al.  A Fuzzy Control Theory and Neural Network Based Sensor Network Control System , 2014, 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[28]  Kun Liu,et al.  An Attacker's View of Distance Preserving Maps for Privacy Preserving Data Mining , 2006, PKDD.

[29]  Yiwei Thomas Hou,et al.  Privacy-preserving multi-keyword fuzzy search over encrypted data in the cloud , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[30]  Jianfeng Ma,et al.  Privacy-Preserving Patient-Centric Clinical Decision Support System on Naïve Bayesian Classification , 2016, IEEE Journal of Biomedical and Health Informatics.

[31]  Richard Y. K. Fung,et al.  Simulation-Based Optimization for Surgery Scheduling in Operation Theatre Management Using Response Surface Method , 2015, Journal of Medical Systems.