A Hybrid Intelligent System Framework for the Prediction of Heart Disease Using Machine Learning Algorithms

Heart disease is one of the most critical human diseases in the world and affects human life very badly. In heart disease, the heart is unable to push the required amount of blood to other parts of the body. Accurate and on time diagnosis of heart disease is important for heart failure prevention and treatment. The diagnosis of heart disease through traditional medical history has been considered as not reliable in many aspects. To classify the healthy people and people with heart disease, noninvasive-based methods such as machine learning are reliable and efficient. In the proposed study, we developed a machine-learning-based diagnosis system for heart disease prediction by using heart disease dataset. We used seven popular machine learning algorithms, three feature selection algorithms, the cross-validation method, and seven classifiers performance evaluation metrics such as classification accuracy, specificity, sensitivity, Matthews’ correlation coefficient, and execution time. The proposed system can easily identify and classify people with heart disease from healthy people. Additionally, receiver optimistic curves and area under the curves for each classifier was computed. We have discussed all of the classifiers, feature selection algorithms, preprocessing methods, validation method, and classifiers performance evaluation metrics used in this paper. The performance of the proposed system has been validated on full features and on a reduced set of features. The features reduction has an impact on classifiers performance in terms of accuracy and execution time of classifiers. The proposed machine-learning-based decision support system will assist the doctors to diagnosis heart patients efficiently.

[1]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  K Vanisree,et al.  Decision Support System for Congenital Heart Disease Diagnosis based on Signs and Symptoms using Neural Networks , 2011 .

[3]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[4]  Naftali Tishby,et al.  Margin based feature selection - theory and algorithms , 2004, ICML.

[5]  Kapil Wankhade,et al.  Decision support system for heart disease based on support vector machine and Artificial Neural Network , 2010, 2010 International Conference on Computer and Communication Technology (ICCCT).

[6]  Abdulkadir Sengür,et al.  Effective diagnosis of heart disease through neural networks ensembles , 2009, Expert Syst. Appl..

[7]  Qeethara Al-Shayea Artificial Neural Networks in Medical Diagnosis , 2024, International Journal of Research Publication and Reviews.

[8]  Randal S. Olson,et al.  Relief-Based Feature Selection: Introduction and Review , 2017, J. Biomed. Informatics.

[9]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[10]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[11]  Oyebade K. Oyedotun,et al.  Heart Diseases Diagnosis Using Neural Networks Arbitration , 2015 .

[12]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[13]  Oluwarotimi Williams Samuel,et al.  An integrated decision support system based on ANN and Fuzzy_AHP for heart failure risk prediction , 2017, Expert Syst. Appl..

[14]  B. Edmonds Using Localised ‘Gossip’ to Structure Distributed Learning , 2005 .

[15]  P. Warner Ordinal logistic regression , 2008, Journal of Family Planning and Reproductive Health Care.

[16]  Lala Septem Riza,et al.  Birthmark-Based Software Classification Using Rough Sets , 2017 .

[17]  Novruz Allahverdi,et al.  Design of a hybrid system for the diabetes and heart diseases , 2008, Expert Syst. Appl..

[18]  B. L. Deekshatulu,et al.  Classification of Heart Disease using Artificial Neural Network and Feature Subset Selection , 2013 .

[19]  Dayou Liu,et al.  A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis , 2011, Expert Syst. Appl..

[20]  Ratna Babu Chinnam,et al.  mr2PSO: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification , 2011, Inf. Sci..

[21]  G. Fonarow,et al.  Epidemiology and risk profile of heart failure , 2011, Nature Reviews Cardiology.

[22]  Philip H. W. Leong,et al.  Grammar-Based Feature Generation for Time-Series Prediction , 2015 .

[23]  R. Detrano,et al.  International application of a new probability algorithm for the diagnosis of coronary artery disease. , 1989, The American journal of cardiology.

[24]  Jane Labadin,et al.  Feature selection based on mutual information , 2015, 2015 9th International Conference on IT in Asia (CITA).

[25]  Pankaj Kumar,et al.  EARLY HEART DISEASE PREDICTION USING DATA MINING TECHNIQUES , 2014, CSE 2014.

[26]  Janaina Mourão Miranda,et al.  Classifying brain states and determining the discriminating activation patterns: Support Vector Machine on functional MRI data , 2005, NeuroImage.

[27]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[28]  Sameh Ghwanmeh,et al.  Innovative Artificial Neural Networks-Based Decision Support System for Heart Diseases Diagnosis , 2013 .

[29]  Robert P. W. Duin,et al.  STATISTICAL PATTERN RECOGNITION , 2005 .

[30]  Peter Waiganjo Wagacha Induction of Decision Trees , 2003 .

[31]  A. Khera,et al.  Forecasting the Future of Cardiovascular Disease in the United States: A Policy Statement From the American Heart Association , 2011, Circulation.

[32]  Peter Bühlmann Regression shrinkage and selection via the Lasso: a retrospective (Robert Tibshirani): Comments on the presentation , 2011 .

[33]  V. Roger,et al.  The Heart Failure Epidemic , 2010, International journal of environmental research and public health.

[34]  Shah Nazir,et al.  Fuzzy logic based decision support system for component security evaluation , 2018, Int. Arab J. Inf. Technol..

[35]  A. V.DavidSánchez,et al.  Advanced support vector machines and kernel methods , 2003, Neurocomputing.

[36]  K. Larsen,et al.  Interpreting Parameters in the Logistic Regression Model with Random Effects , 2000, Biometrics.

[37]  Murthy V. Jonnalagedda,et al.  Clustering based on correlation fractal dimension over an evolving data stream , 2018, Int. Arab J. Inf. Technol..

[38]  Ms. Ishtake " Intelligent Heart Disease Prediction System Using Data Mining Techniques " , .