Prediction of Cerebral Aneurysm Rupture Using Hemodynamic, Morphologic and Clinical Features: A Data Mining Approach

Cerebral aneurysms pose a major clinical threat and the current practice upon diagnosis is a complex, lengthy, and costly, multicriteria analysis, which to date is not fully understood. This paper reports the development of several classifiers predicting whether a given clinical case is likely to rupture taking into account available information of the patient and characteristics of the aneurysm. The dataset used included 157 cases, with 294 features each. The broad range of features include basic demographics and clinical information, morphological characteristics computed from the patient's medical images, as well as results gained from personalised blood flow simulations. In this premiere attempt the wealth of aneurysm-related information gained from multiple heterogeneous sources and complex simulation processes is used to systematically apply different data-mining algorithms and assess their predictive accuracy in this domain. The promising results show up to 95% classification accuracy. Moreover, the analysis also enables to confirm or reject risk factors commonly accepted or suspected in the domain.

[1]  Alejandro F. Frangi,et al.  @neurIST: Infrastructure for Advanced Disease Management Through Integration of Heterogeneous Data, Computing, and Complex Processing Services , 2010, IEEE Transactions on Information Technology in Biomedicine.

[2]  Yen H. Le,et al.  A supervised classification-based method for coronary calcium detection in non-contrast CT , 2010, The International Journal of Cardiovascular Imaging.

[3]  Alejandro F Frangi,et al.  The @neurIST project: towards understanding cerebral aneurysms , 2007 .

[4]  Alejandro F Frangi,et al.  @neurIST complex information processing toolchain for the integrated management of cerebral aneurysms , 2011, Interface Focus.

[5]  Antoine Geissbühler,et al.  Design of a Decentralized Reusable Research Database Architecture to Support Data Acquisition in Large Research Projects , 2007, MedInfo.

[6]  Charu C. Aggarwal,et al.  On the Use of Conceptual Reconstruction for Mining Massively Incomplete Data Sets , 2003, IEEE Trans. Knowl. Data Eng..

[7]  Michael R. Berthold,et al.  Constructive training of probabilistic neural networks , 1998, Neurocomputing.

[8]  J. Mocco,et al.  Hemodynamic–Morphologic Discriminants for Intracranial Aneurysm Rupture , 2011, Stroke.

[9]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[10]  Keun Ho Ryu,et al.  A Data Mining Approach for Coronary Heart Disease Prediction using HRV Features and Carotid Arterial Wall Thickness , 2008, 2008 International Conference on BioMedical Engineering and Informatics.

[11]  Osmar R. Zaïane,et al.  Application of Data Mining Techniques for Medical Image Classification , 2001, MDM/KDD.

[12]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[13]  Charles B. Wilson,et al.  Endovascular and surgical treatment of unruptured cerebral aneurysms: Comparison of risks , 2000, Annals of neurology.

[14]  J. Sayre,et al.  Quantitative Hemodynamic Analysis of Brain Aneurysms at Different Locations , 2009, American Journal of Neuroradiology.

[15]  Alejandro F Frangi,et al.  Morphological descriptors as rupture indicators in middle cerebral artery aneurysms , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[16]  Agma J. M. Traina,et al.  How to Improve Medical Image Diagnosis through Association Rules: The IDEA Method , 2008, 2008 21st IEEE International Symposium on Computer-Based Medical Systems.

[17]  Alejandro F. Frangi,et al.  Efficient 3D Geometric and Zernike Moments Computation from Unstructured Surface Meshes , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[19]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[20]  Michael R. Berthold,et al.  Mixed fuzzy rule formation , 2003, Int. J. Approx. Reason..

[21]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[22]  L. Beenen,et al.  Direct Costs of Modern Treatment of Aneurysmal Subarachnoid Hemorrhage in the First Year After Diagnosis , 2002, Stroke.

[23]  Kristin P. Bennett,et al.  Support vector machines: hype or hallelujah? , 2000, SKDD.

[24]  Agma J. M. Traina,et al.  Mining Statistical Association Rules to Select the Most Relevant Medical Image Features , 2009, Mining Complex Data.

[25]  Haiwei Pan,et al.  Domain Knowledge-Driven Association Pattern Mining Algorithm on Medical Images , 2009, 2009 Fourth International Conference on Internet Computing for Science and Engineering.

[26]  Alejandro F. Frangi,et al.  @neurIST - Chronic Disease Management through Integration of Heterogeneous Data and Computer-interpretable Guideline Services , 2008, HealthGrid.

[27]  F. Mut,et al.  Association of Hemodynamic Characteristics and Cerebral Aneurysm Rupture , 2011, American Journal of Neuroradiology.

[28]  Djamel A. Zighed,et al.  Mining Complex Data, ECML/PKDD 2007 Third International Workshop, MCD 2007, Warsaw, Poland, September 17-21, 2007, Revised Selected Papers , 2008, MCD.

[29]  Wei-Pang Yang,et al.  A discretization algorithm based on Class-Attribute Contingency Coefficient , 2008, Inf. Sci..