A High Accuracy Integrated Bagging-Fuzzy-GBDT Prediction Algorithm for Heart Disease Diagnosis

Associated with high morbidity and mortality, heart disease has become a severe threat to peoples health throughout the world. The recent development of Internet of Things (IoT) and machine learning in e-healthcare have contributed to the monitoring, prediction and diagnosis of heart disease. Particularly, the heart disease prediction can effectively facilitate disease prevention, diagnosis and timely treatment. However, traditional prediction models are weak in accuracy and generalization. In this paper, we propose a high accuracy integrated prediction algorithm for heart disease diagnosis. The fuzzy logic and Bootstrap Aggregating (Bagging) algorithm based on Gradient Boosting Decision Tree (GBDT) algorithm are combined to process heart disease data and generate multiple weak classifiers. At first, we integrate the fuzzy logic with GBDT to reduce the complexity of data. Moreover, we develop the Fuzzy-GBDT model integrated Bagging algorithm to avoid the interference of sensitive points and achieve partial parallelism. The simulation results show the proposed Fuzzy-Bagging-GBDT algorithm improves the accuracy and recall of heart disease prediction compared with GBDT.

[1]  Soni Jyoti,et al.  Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction , 2011 .

[2]  Lili Zhang,et al.  The IoT-based heart disease monitoring system for pervasive healthcare service , 2017, KES.

[3]  Paul Rad,et al.  Distributed machine learning cloud teleophthalmology IoT for predicting AMD disease progression , 2019, Future Gener. Comput. Syst..

[4]  A. F. Isakovic,et al.  Graphene oxide: Nylon ECG sensors for wearable IoT healthcare—nanomaterial and SoC interface , 2018, 2017 Sensors Networks Smart and Emerging Technologies (SENSET).

[5]  William Herrington,et al.  Association Between Diabetes and Cause-Specific Mortality in Rural and Urban Areas of China , 2017, JAMA.

[6]  Jennifer Neville,et al.  Predicting Prefix Availability in the Internet , 2010, 2010 Proceedings IEEE INFOCOM.

[7]  Gholam Ali Montazer,et al.  A fuzzy-evidential hybrid inference engine for coronary heart disease risk assessment , 2010, Expert Syst. Appl..

[8]  T. Sasikala,et al.  Healthcare Monitoring System Based on IoT Using AMQP Protocol , 2019, ICCNCT - 2019.

[9]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[10]  Yanjiao Chen,et al.  InPrivate Digging: Enabling Tree-based Distributed Data Mining with Differential Privacy , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[11]  Bin Li,et al.  Robust Dynamic Spectrum Access in Uncertain Channels: A Fuzzy Payoffs Game Approach , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[12]  Ali Taghipour,et al.  hs-CRP is strongly associated with coronary heart disease (CHD): A data mining approach using decision tree algorithm , 2017, Comput. Methods Programs Biomed..

[13]  Jianxin Li,et al.  Modeling and clustering attacker activities in IoT through machine learning techniques , 2019, Inf. Sci..