Prediction of Coronary Heart Disease using Machine Learning: An Experimental Analysis

The field of medical analysis is often referred to be a valuable source of rich information. Coronary Heart Disease (CHD) is one of the major causes of death all around the world therefore early detection of CHD can help reduce these rates. The challenge lies in the complexity of the data and correlations when it comes to prediction using conventional techniques. The aim of this research is to use the historical medical data to predict CHD using Machine Learning (ML) technology. The scope of this research is limited to using three supervised learning techniques namely Naïve Bayes (NB), Support Vector Machine (SVM) and Decision Tree (DT), to discover correlations in CHD data that might help improving the prediction rate. Using the South African Heart Disease dataset of 462 instances, intelligent models are derived by the considered ML techniques using 10-fold cross validation. Empirical results using different performance evaluation measures report that probabilistic models derived by NB are promising in detecting CHD.

[1]  Xiao-Jun Zeng,et al.  Evaluation and Comparison of Different Machine Learning Methods to Predict Outcome of Tuberculosis Treatment Course , 2013 .

[2]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[3]  P. Priyanka,et al.  A Reliable Classifier Model Using Data Mining Approach for Heart Disease Prediction , 2013 .

[4]  Fadi Thabtah,et al.  Autism Spectrum Disorder Screening: Machine Learning Adaptation and DSM-5 Fulfillment , 2017, ICMHI.

[5]  Fadi A. Thabtah,et al.  Associative Classification Approaches: Review and Comparison , 2014, J. Inf. Knowl. Manag..

[6]  Sulabha S. Apte,et al.  Improved Study of Heart Disease Prediction System using Data Mining Classification Techniques , 2012 .

[7]  Firuz Kamalov,et al.  A new computational intelligence approach to detect autistic features for autism screening , 2018, Int. J. Medical Informatics.

[8]  Fadi Thabtah,et al.  An accessible and efficient autism screening method for behavioural data and predictive analyses , 2018, Health Informatics J..

[9]  K. Manimekalai Prediction of Heart Diseases using Data Mining Techniques , 2016 .

[10]  Fadi Thabtah,et al.  A new machine learning model based on induction of rules for autism detection , 2020, Health Informatics J..

[11]  Arko Provo Mukherjee,et al.  Heart Disease Diagnosis and Prediction Using Machine Learning and Data Mining Techniques : A Review , 2017 .

[12]  Fadi A. Thabtah,et al.  An experimental study of three different rule ranking formulas in associative classification , 2012, 2012 International Conference for Internet Technology and Secured Transactions.

[13]  Bhojane Yogesh,et al.  Intelligent rule-based Phishing Websites Classification , 2016 .

[14]  Patrick Kierkegaard,et al.  Electronic health record: Wiring Europe's healthcare , 2011, Comput. Law Secur. Rev..

[15]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[16]  Syed Asif Hassan,et al.  A Machine Learning Model to Predict the Onset of Alzheimer Disease using Potential Cerebrospinal Fluid (CSF) Biomarkers , 2017 .

[17]  Palli Suryachandra,et al.  Comparison of machine learning algorithms for breast cancer , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[18]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[19]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[20]  F. Thabtah Machine learning in autistic spectrum disorder behavioral research: A review and ways forward , 2019, Informatics for health & social care.

[21]  Samer Al Hawari,et al.  A Comprehensive Comparative Study Using Vector Space Model with K-Nearest Neighbor on Text Categorization Data , 2008 .

[22]  Saurabh Pal,et al.  Early Prediction of Heart Diseases Using Data Mining Techniques , 2013 .

[23]  Fadi A. Thabtah,et al.  Mr-arm: a Map-Reduce Association Rule Mining Framework , 2013, Parallel Process. Lett..