Heart disease prediction system using k-Nearest neighbor algorithm with simplified patient's health parameters

Heart disease is the primary cause of death nowadays. Treatments of heart disease patients have been advanced, for example with machine-to-machine (M2M) technology to enable remote patient monitoring. To use M2M to take care remote heart disease patient, his/her medical condition should be measured periodically at home. Thus, it is difficult to perform complex tests which need physicians to help. Meanwhile, heart disease can be predicted by analysing some of patient's health parameters. With help of data mining techniques, heart disease prediction can be improved. There are some algorithms that have been used for this purpose like Naive Bayes, Decision Tree, and k-Nearest Neighbor (KNN). This study aims to use data mining techniques in heart disease prediction, with simplifying parameters to be used, so they can be used in M2M remote patient monitoring purpose. KNN is used with parameter weighting method to improve accuracy. Only 8 parameters are used (out of 13 parameters recommended), since they are simple and instant parameters that can be measured at home. The result shows that the accuracy of these 8 parameters using KNN algorithm are good enough, comparing to 13 parameters with KNN, or even other algorithms like Naive Bayes and Decision Tree.

[1]  Yong Hu,et al.  The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature , 2011, Decis. Support Syst..

[2]  Ian H. Witten,et al.  Data mining in bioinformatics using Weka , 2004, Bioinform..

[3]  M. R. Shafiee-Chafi,et al.  A Novel Fuzzy Based Method for Heart Rate Variability Prediction , 2014 .

[4]  Zhong Fan,et al.  M2M communications for e-health: Standards, enabling technologies, and research challenges , 2012, 2012 6th International Symposium on Medical Information and Communication Technology (ISMICT).

[5]  Mostefa Mesbah,et al.  Newborn EEG seizure detection based on interspike space distribution in the time-frequency domain , 2007 .

[6]  Saurabh Pal,et al.  Early Prediction of Heart Diseases Using Data Mining Techniques , 2013 .

[7]  G. D'Agostini,et al.  A Multidimensional unfolding method based on Bayes' theorem , 1995 .

[8]  Geert Wets,et al.  A data mining framework for optimal product selection in retail supermarket data: the generalized PROFSET model , 2000, KDD '00.

[9]  Irina Rish,et al.  An empirical study of the naive Bayes classifier , 2001 .

[10]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[11]  M. J. del Jesus,et al.  Web usage mining to improve the design of an e-commerce website: OrOliveSur.com , 2012, Expert Syst. Appl..

[12]  Cristina Elena Turcu,et al.  The 2 nd International Conference on Integrated Information Internet of Things as Key Enabler for Sustainable Healthcare Delivery , 2013 .

[13]  Sungmo Jung,et al.  An Optimization Scheme for M2M-Based Patient Monitoring in Ubiquitous Healthcare Domain , 2012, Int. J. Distributed Sens. Networks.

[14]  Majid Ahmadi,et al.  Investigating the Performance of Naive- Bayes Classifiers and K- Nearest Neighbor Classifiers , 2007, 2007 International Conference on Convergence Information Technology (ICCIT 2007).

[15]  Ahmad Basheer Hassanat,et al.  Solving the Problem of the K Parameter in the KNN Classifier Using an Ensemble Learning Approach , 2014, ArXiv.

[16]  P. Libby,et al.  Braunwald's Heart Disease: A Textbook of Cardiovascular Medicine, 2-Volume Set, 9th Edition Expert Consult Premium Edition €“ Enhanced Online Features , 2011 .

[17]  G.Karthiga,et al.  Heart Disease Analysis System Using DataMining Techniques , 2014 .

[18]  Y Zare Mehrjerdi,et al.  A Novel Continuous KNN Prediction Algorithm to Improve Manufacturing Policies in a VMI Supply Chain , 2014 .

[19]  K. Usha Rani,et al.  ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA , 2012 .

[20]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[21]  D. Gunawan,et al.  Designing machine-to-machine (M2M) system in health-cure modeling for cardiovascular disease patients: Initial study , 2015, 2015 3rd International Conference on Information and Communication Technology (ICoICT).

[22]  Abdul R. Shaikh,et al.  INTELLIGENT HEART DISEASE PREDICTION SYSTEM WITH MONGODB , 2015 .

[23]  B. Thuraisingham A primer for understanding and applying data mining , 2000 .

[24]  Qin Ding,et al.  k-nearest Neighbor Classification on Spatial Data Streams Using P-trees , 2002, PAKDD.

[25]  Leif E. Peterson K-nearest neighbor , 2009, Scholarpedia.

[26]  Rong-Ho Lin,et al.  An intelligent model for liver disease diagnosis , 2009, Artif. Intell. Medicine.

[27]  David C. Yen,et al.  Applying data mining to telecom churn management , 2006, Expert Syst. Appl..

[28]  D. Mozaffarian,et al.  Executive summary: heart disease and stroke statistics--2012 update: a report from the American Heart Association. , 2012, Circulation.

[29]  Victor C. M. Leung,et al.  A Survey of Recent Developments in Home M2M Networks , 2014, IEEE Commun. Surv. Tutorials.