Feature Selection Based on the SVM Weight Vector for Classification of Dementia

Computer-aided diagnosis of dementia using a support vector machine (SVM) can be improved with feature selection. The relevance of individual features can be quantified from the SVM weights as a significance map (p-map). Although these p-maps previously showed clusters of relevant voxels in dementia-related brain regions, they have not yet been used for feature selection. Therefore, we introduce two novel feature selection methods based on p-maps using a direct approach (filter) and an iterative approach (wrapper). To evaluate these p-map feature selection methods, we compared them with methods based on the SVM weight vector directly, t-statistics, and expert knowledge. We used MRI data from the Alzheimer's disease neuroimaging initiative classifying Alzheimer's disease (AD) patients, mild cognitive impairment (MCI) patients who converted to AD (MCIc), MCI patients who did not convert to AD (MCInc), and cognitively normal controls (CN). Features for each voxel were derived from gray matter morphometry. Feature selection based on the SVM weights gave better results than t-statistics and expert knowledge. The p-map methods performed slightly better than those using the weight vector. The wrapper method scored better than the filter method. Recursive feature elimination based on the p-map improved most for AD-CN: the area under the receiver-operating-characteristic curve (AUC) significantly increased from 90.3% without feature selection to 92.0% when selecting 1.5%-3% of the features. This feature selection method also improved the other classifications: AD-MCI 0.1% improvement in AUC (not significant), MCI-CN 0.7%, and MCIc-MCInc 0.1% (not significant). Although the performance improvement due to feature selection was limited, the methods based on the p-map generally had the best performance, and were therefore better in estimating the relevance of individual features.

[1]  Linda Teri,et al.  Clinico‐Neuropathological Correlation of Alzheimer's Disease in a Community‐Based Case Series , 1999, Journal of the American Geriatrics Society.

[2]  Sophie Paquerault,et al.  Battle against Alzheimer's disease: the scope and potential value of magnetic resonance imaging biomarkers. , 2012, Academic radiology.

[3]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[4]  Marie Chupin,et al.  Spatial and anatomical regularization of SVM for brain image analysis , 2010, NIPS.

[5]  Marie Chupin,et al.  Automatic classi fi cation of patients with Alzheimer ' s disease from structural MRI : A comparison of ten methods using the ADNI database , 2010 .

[6]  Wlodzislaw Duch Filter methods , 2004 .

[7]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[8]  Juha Koikkalainen,et al.  Optimizing the diagnosis of early Alzheimer's disease in mild cognitive impairment subjects. , 2012, Journal of Alzheimer's disease : JAD.

[9]  Jieping Ye,et al.  Sparse learning and stability selection for predicting MCI to AD conversion using baseline ADNI data , 2012, BMC Neurology.

[10]  G. Frisoni,et al.  Detection of grey matter loss in mild Alzheimer's disease with voxel based morphometry , 2002, Journal of neurology, neurosurgery, and psychiatry.

[11]  Alain Rakotomamonjy,et al.  Variable Selection Using SVM-based Criteria , 2003, J. Mach. Learn. Res..

[12]  J. Ramírez,et al.  Feature selection using factor analysis for Alzheimer's diagnosis using 18F-FDG PET images. , 2010, Medical physics.

[13]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[14]  M. Folstein,et al.  Clinical diagnosis of Alzheimer's disease: Report of the NINCDS—ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer's Disease , 2011, Neurology.

[15]  Editors-in-chief,et al.  Encyclopedia of statistics in behavioral science , 2005 .

[16]  Bilwaj Gaonkar,et al.  Analytic estimation of statistical significance maps for support vector machine based multi-variate image analysis and classification , 2013, NeuroImage.

[17]  A. Wimo,et al.  The global prevalence of dementia: A systematic review and metaanalysis , 2013, Alzheimer's & Dementia.

[18]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[19]  G. Houston,et al.  Diagnostic classification of arterial spin labeling and structural MRI in presenile early stage dementia , 2014, Human brain mapping.

[20]  Dinggang Shen,et al.  COMPARE: Classification of Morphological Patterns Using Adaptive Regional Elements , 2007, IEEE Transactions on Medical Imaging.

[21]  Frederik Barkhof,et al.  Pathological Aging of the Brain: An Overview , 2004, Topics in magnetic resonance imaging : TMRI.

[22]  D. Louis Collins,et al.  MRI-Based Automated Computer Classification of Probable AD Versus Normal Controls , 2008, IEEE Transactions on Medical Imaging.

[23]  Stephen M Smith,et al.  Fast robust automated brain extraction , 2002, Human brain mapping.

[24]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[25]  Alzheimer’s Association,et al.  2011 Alzheimer’s disease facts and figures , 2011, Alzheimer's & Dementia.

[26]  L. Lapham,et al.  Clinicopathologic Correlates in Alzheimer Disease: Assessment of Clinical and Pathologic Diagnostic Criteria , 1993, Alzheimer disease and associated disorders.

[27]  Paul Suetens,et al.  Construction of a Brain Template from MR Images Using State-of-the-Art Registration and Segmentation Techniques , 2004, MICCAI.

[28]  Michael Unser,et al.  Optimization of mutual information for multiresolution image registration , 2000, IEEE Trans. Image Process..

[29]  Nick C Fox,et al.  Automatic classification of MR scans in Alzheimer's disease. , 2008, Brain : a journal of neurology.

[30]  Alexander Hammers,et al.  Three‐dimensional maximum probability atlas of the human brain, with particular reference to the temporal lobe , 2003, Human brain mapping.

[31]  Yi Liu,et al.  FS_SFS: A novel feature selection method for support vector machines , 2006, Pattern Recognit..

[32]  Stefan Klein,et al.  Feature Selection Based on SVM Significance Maps for Classification of Dementia , 2014, MLMI.

[33]  Lenore J. Launer,et al.  Accuracy of clinical criteria for AD in the Honolulu-Asia Aging Study, a population-based study. , 2001 .

[34]  Daniel Rueckert,et al.  Automatic anatomical brain MRI segmentation combining label propagation and decision fusion , 2006, NeuroImage.

[35]  Daniel Rueckert,et al.  Automatic segmentation of brain MRIs of 2-year-olds into 83 regions of interest , 2008, NeuroImage.

[36]  Karl J. Friston,et al.  Voxel-Based Morphometry—The Methods , 2000, NeuroImage.

[37]  J. Baron,et al.  Mapping gray matter loss with voxel-based morphometry in mild cognitive impairment , 2002, Neuroreport.

[38]  Karl J. Friston,et al.  Unified segmentation , 2005, NeuroImage.

[39]  Clifford R. Jack,et al.  Diagnostic neuroimaging across diseases , 2011, NeuroImage.

[40]  J. Ramírez,et al.  Feature selection using factor analysis for Alzheimer's diagnosis using 18F-FDG PET images. , 2010, Medical physics.

[41]  John Shawe-Taylor,et al.  SCoRS—A Method Based on Stability for Feature Selection and Mapping in Neuroimaging , 2014, IEEE Transactions on Medical Imaging.

[42]  Nick C Fox,et al.  The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods , 2008, Journal of magnetic resonance imaging : JMRI.

[43]  Mert R. Sabuncu,et al.  The Relevance Voxel Machine (RVoxM): A Self-Tuning Bayesian Model for Informative Image-Based Prediction , 2012, IEEE Transactions on Medical Imaging.

[44]  Janaina Mourão Miranda,et al.  Classifying brain states and determining the discriminating activation patterns: Support Vector Machine on functional MRI data , 2005, NeuroImage.

[45]  Daoqiang Zhang,et al.  Multimodal classification of Alzheimer's disease and mild cognitive impairment , 2011, NeuroImage.

[46]  Verónica Bolón-Canedo,et al.  A review of feature selection methods on synthetic data , 2013, Knowledge and Information Systems.

[47]  G. Frisoni,et al.  A voxel based morphometry study on mild cognitive impairment , 2004, Journal of Neurology, Neurosurgery & Psychiatry.

[48]  Martin Guha,et al.  Encyclopedia of Statistics in Behavioral Science , 2006 .

[49]  K. Jellinger,et al.  Accuracy of clinical criteria for AD in the Honolulu-Asia Aging Study, a population-based study. , 2002, Neurology.

[50]  John A. Detre,et al.  Support vector machine learning-based fMRI data group analysis , 2007, NeuroImage.

[51]  Bilwaj Gaonkar,et al.  Feature ranking based nested support vector machine ensemble for medical image classification , 2012, 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI).

[52]  Peter A. Bandettini,et al.  Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images , 2012, NeuroImage.

[53]  Max A. Viergever,et al.  elastix: A Toolbox for Intensity-Based Medical Image Registration , 2010, IEEE Transactions on Medical Imaging.

[54]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[55]  Marko Grobelnik,et al.  Feature selection using linear classifier weights: interaction with classification models , 2004, SIGIR '04.

[56]  A. Simmons,et al.  Multivariate data analysis and machine learning in Alzheimer's disease with a focus on structural magnetic resonance imaging. , 2014, Journal of Alzheimer's disease : JAD.