Adapting rough-fuzzy classifier to solve class imbalance problem in heart disease prediction using FCM

The main objective of this research is to develop a heart disease prediction technique by solving class imbalance problem. Class imbalance problem severely affects the performance of the prediction if the distribution of data is not clearly defined. To overcome class imbalance problem and achieve promising results in this work, the proposed technique is divided into three steps. Initially, the input data is given to fuzzy c–means clustering algorithm that converts the original data into equal number samples for all the classes. Then, rules are generated from the rough set theory and these rules are used for prediction with the fuzzy classifier. For testing, test data is converted into relevant space after matching with the original cluster centres and then, sample is tested with rough–fuzzy classifier. The results prove that the proposed technique generated excellent results by achieving the accuracy of 81% in Cleveland and 80% in Hungarian datasets.

[1]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[2]  T. John Peter,et al.  Study and Development of Novel Feature Selection Framework for Heart Disease Prediction , 2012 .

[3]  A. Govardhan,et al.  Rough-Fuzzy Classifier: A System to Predict the Heart Disease by Blending Two Different Set Theories , 2014 .

[4]  Sellappan Palaniappan,et al.  Web-based Heart Disease Decision Support System Using Data Mining Classification Modeling Techniques , 2007, iiWAS.

[5]  P. K. Anooj,et al.  Clinical decision support system: Risk level prediction of heart disease using weighted fuzzy rules , 2012, J. King Saud Univ. Comput. Inf. Sci..

[6]  Hervé Delingette,et al.  A Statistical Model for Quantification and Prediction of Cardiac Remodelling: Application to Tetralogy of Fallot , 2011, IEEE Transactions on Medical Imaging.

[7]  M Anbarasi,et al.  ENHANCED PREDICTION OF HEART DISEASE WITH FEATURE SUBSET SELECTION USING GENETIC ALGORITHM , 2010 .

[8]  Abel Damtew DESIGNING A PREDICTIVE MODEL FOR HEART DISEASE DETECTION USING DATA MINING TECHNIQUES , 2011 .

[9]  Asma Parveen,et al.  PREDICTION SYSTEM FOR HEART DISEASE USING NAIVE BAYES , 2012 .

[10]  D. Binu,et al.  MKF-Cuckoo: Hybridization of Cuckoo Search and Multiple Kernel-based Fuzzy C-means Algorithm , 2013 .

[11]  Chanin Nantasenamat,et al.  Data mining of magnetocardiograms for prediction of ischemic heart disease , 2010, EXCLI journal.

[12]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[13]  Kiran Jyoti,et al.  An Analysis of Heart Disease Prediction using Different Data Mining Techniques , 2012 .

[14]  Dimitrios I. Fotiadis,et al.  Automated Diagnosis of Coronary Artery Disease Based on Data Mining and Fuzzy Modeling , 2008, IEEE Transactions on Information Technology in Biomedicine.

[15]  Aloysius George,et al.  KF-PSO: Hybridization of particle swarm optimization and kernel-based fuzzy C means algorithm , 2013, 2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI).