Application of knowledge discovery process on the prediction of stroke

OBJECTIVE Stroke is a prominent life-threatening disease in the world. The current study was performed to predict the outcome of stroke using knowledge discovery process (KDP) methods, artificial neural networks (ANN) and support vector machine (SVM) models. MATERIALS AND METHODS The records of 297 (130 sick and 167 healthy) individuals were acquired from the databases of the department of emergency medicine. Nine predictors (coronary artery disease, diabetes mellitus, hypertension, history of cerebrovascular disease, atrial fibrillation, smoking, the findings of carotid Doppler ultrasonography [normal, plaque, plaque+stenosis≥50%], the levels of cholesterol and C-reactive protein) were used for predicting the stroke. Feature selection based on the Cramer's V test was carried out for reducing the predictors. Multilayer perceptron (MLP) ANN and SVM with radial basis function (RBF) kernel were used for the prediction based on the selected predictors. RESULTS The accuracy values were 81.82% for ANN and 80.38% for SVM in the training dataset (n=209), and 85.9% for ANN and 84.62% for SVM in the testing dataset (n=78), respectively. ANN and SVM models yielded area under curve (AUC) values of 0.905 and 0.899 in the training dataset, and 0.928 and 0.91 in the testing dataset, consecutively. CONCLUSION The findings of the current study pointed out that ANN had more predictive performance when compared with SVM in predicting stroke. The proposed ANN model would be useful when making clinical decisions regarding stroke.

[1]  Lionel Tarassenko,et al.  "Stroke Nav": A wireless data collection and review system to support stroke care delivery , 2012, Comput. Methods Programs Biomed..

[2]  M. Luby,et al.  Magnetic resonance imaging and computed tomography in emergency assessment of patients with suspected acute stroke: a prospective comparison , 2007, The Lancet.

[3]  Gokmen Zararsiz,et al.  Drug/nondrug classification using Support Vector Machines with various feature selection strategies , 2014, Comput. Methods Programs Biomed..

[4]  Mehdi Khashei,et al.  A novel hybrid classification model of artificial neural networks and multiple linear regression models , 2012, Expert Syst. Appl..

[5]  L. Hui,et al.  Evaluation of an artificial neural network to ascertain why there is a high incidence of hepatitis B in the Chinese population after vaccination , 2013, Comput. Biol. Medicine.

[6]  Chun-An Cheng,et al.  Prediction of the Prognosis of Ischemic Stroke Patients after Intravenous Thrombolysis Using Artificial Neural Networks , 2014, ICIMTH.

[7]  Lucio Soibelman,et al.  Factor selection for delay analysis using Knowledge Discovery in Databases , 2008 .

[8]  Kulthida Tuamsuk,et al.  Data Mining and Its Applications for Knowledge Management: A Literature Review from 2007 to 2012 , 2012, ArXiv.

[9]  Muin J. Khoury,et al.  Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes , 2010, BMC Medical Informatics Decis. Mak..

[10]  O. Oshinaike,et al.  Glucose and lipid assessment in patients with acute stroke , 2014, International archives of medicine.

[11]  Cemil Colak,et al.  Predicting coronary artery disease using different artificial neural network models. , 2008, Anadolu kardiyoloji dergisi : AKD = the Anatolian journal of cardiology.

[12]  Y. Çelik,et al.  Prediction of mortality in stroke patients using multilayer perceptron neural networks , 2012, Turkish Journal of Medical Sciences.

[13]  A. Arauz,et al.  Vascular Cognitive Disorders and Depression After First-Ever Stroke: The Fogarty-Mexico Stroke Cohort , 2014, Cerebrovascular Diseases.

[14]  Sazali Yaacob,et al.  Normal and hypoacoustic infant cry signal classification using time-frequency analysis and general regression neural network , 2012, Comput. Methods Programs Biomed..

[15]  Marcos Gestal,et al.  Exploring patterns of epigenetic information with data mining techniques. , 2012, Current pharmaceutical design.

[16]  D. Rueckert,et al.  Prediction of stroke thrombolysis outcome using CT brain machine learning , 2014, NeuroImage: Clinical.

[17]  Amr Sharawy,et al.  Computer aided detection system for micro calcifications in digital mammograms , 2014, Comput. Methods Programs Biomed..

[18]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..