Assessment of Cardiovascular Disease Risk Prediction Models: Evaluation Methods

This paper uses a real world anaesthesia time-series monitoring data in the prediction of cardiovascular disease risk in a manner similar to exercise electrocardiography. Models derived using the entire anaesthesia population and subgroups based on pre-anaesthesia likelihood of complications are compared in an attempt to ascertain which model performance measures are best suited to populations with differing pre-test probability of disease. Misclassification rate (MR) as a measure of prediction model performance was compared with Kappa statistic, sensitivity, specificity, positive and negative predictive values and area under the receiver operating characteristic curve (AUC). In this medical application of data mining, MR is shown to be dependent on the prevalence of disease within the population studied but AUC and Kappa statistic are shown to be independent of disease prevalence.

[1]  A. Sharrett,et al.  Enhanced Risk Assessment in Asymptomatic Individuals With Exercise Testing and Framingham Risk Scores , 2005, Circulation.

[2]  Maurice Bruynooghe,et al.  Mining data from intensive care patients , 2007, Adv. Eng. Informatics.

[3]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[4]  James F Sallis,et al.  AHA Guidelines for Primary Prevention of Cardiovascular Disease and Stroke: 2002 Update: Consensus Panel Guide to Comprehensive Risk Reduction for Adult Patients Without Coronary or Other Atherosclerotic Vascular Diseases. American Heart Association Science Advisory and Coordinating Committee. , 2002, Circulation.

[5]  Blaz Zupan,et al.  Predictive data mining in clinical medicine: Current issues and guidelines , 2008, Int. J. Medical Informatics.

[6]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[7]  Sun I. Kim,et al.  Application of irregular and unbalanced data to predict diabetic nephropathy using visualization and feature selection methods , 2008, Artif. Intell. Medicine.

[8]  Krzysztof J. Cios,et al.  Uniqueness of medical data mining , 2002, Artif. Intell. Medicine.

[9]  Oscar Luaces,et al.  Predicting the probability of survival in intensive care unit patients from a small number of variables and training examples , 2009, Artif. Intell. Medicine.

[10]  Jaques Reifman,et al.  Decision tool for the early diagnosis of trauma patient hypovolemia , 2008, J. Biomed. Informatics.