Neuropsychological predictors of conversion from mild cognitive impairment to Alzheimer’s disease: a feature selection ensemble combining stability and predictability

BackgroundPredicting progression from Mild Cognitive Impairment (MCI) to Alzheimer’s Disease (AD) is an utmost open issue in AD-related research. Neuropsychological assessment has proven to be useful in identifying MCI patients who are likely to convert to dementia. However, the large battery of neuropsychological tests (NPTs) performed in clinical practice and the limited number of training examples are challenge to machine learning when learning prognostic models. In this context, it is paramount to pursue approaches that effectively seek for reduced sets of relevant features. Subsets of NPTs from which prognostic models can be learnt should not only be good predictors, but also stable, promoting generalizable and explainable models.MethodsWe propose a feature selection (FS) ensemble combining stability and predictability to choose the most relevant NPTs for prognostic prediction in AD. First, we combine the outcome of multiple (filter and embedded) FS methods. Then, we use a wrapper-based approach optimizing both stability and predictability to compute the number of selected features. We use two large prospective studies (ADNI and the Portuguese Cognitive Complaints Cohort, CCC) to evaluate the approach and assess the predictive value of a large number of NPTs.ResultsThe best subsets of features include approximately 30 and 20 (from the original 79 and 40) features, for ADNI and CCC data, respectively, yielding stability above 0.89 and 0.95, and AUC above 0.87 and 0.82. Most NPTs learnt using the proposed feature selection ensemble have been identified in the literature as strong predictors of conversion from MCI to AD.ConclusionsThe FS ensemble approach was able to 1) identify subsets of stable and relevant predictors from a consensus of multiple FS methods using baseline NPTs and 2) learn reliable prognostic models of conversion from MCI to AD using these subsets of features. The machine learning models learnt from these features outperformed the models trained without FS and achieved competitive results when compared to commonly used FS algorithms. Furthermore, the selected features are derived from a consensus of methods thus being more robust, while releasing users from choosing the most appropriate FS method to be used in their classification task.

[1]  Shyam Visweswaran,et al.  Measuring Stability of Feature Selection in Biomedical Datasets , 2009, AMIA.

[2]  João Maroco,et al.  Prediction of long-term (5 years) conversion to dementia using neuropsychological tests in a memory clinic setting. , 2013, Journal of Alzheimer's disease : JAD.

[3]  Jean-François Dartigues,et al.  The 9 year cognitive decline before dementia of the Alzheimer type: a prospective population-based study. , 2005, Brain : a journal of neurology.

[4]  Mario Cannataro,et al.  A Genetic Algorithm for the selection of structural MRI features for classification of Mild Cognitive Impairment and Alzheimer's Disease , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[5]  S. Belleville,et al.  Neuropsychological Measures that Predict Progression from Mild Cognitive Impairment to Alzheimer's type dementia in Older Adults: a Systematic Review and Meta-Analysis , 2017, Neuropsychology Review.

[6]  R. Petersen,et al.  Mild cognitive impairment , 2006, The Lancet.

[7]  E. Salmon,et al.  Early neuropsychological detection of Alzheimer's disease , 2014, European Journal of Clinical Nutrition.

[8]  V. Narayan,et al.  Disease progression model for Clinical Dementia Rating–Sum of Boxes in mild cognitive impairment and Alzheimer’s subjects from the Alzheimer’s Disease Neuroimaging Initiative , 2014, Neuropsychiatric disease and treatment.

[9]  Nicola Amoroso,et al.  Deep learning reveals Alzheimer's disease onset in MCI subjects: Results from an international challenge , 2017, Journal of Neuroscience Methods.

[10]  M. Summers,et al.  Neuropsychological measures predict decline to Alzheimer's dementia from mild cognitive impairment. , 2012, Neuropsychology.

[11]  M. Prince,et al.  World Alzheimer Report 2015 - The Global Impact of Dementia: An analysis of prevalence, incidence, cost and trends , 2015 .

[12]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[13]  Victor R. Preedy,et al.  DSM-IV-TR , 2000 .

[14]  Mohamed Limam,et al.  Ensemble feature selection for high dimensional data: a new method and a comparative study , 2017, Advances in Data Analysis and Classification.

[15]  I. Castiglioni,et al.  A wrapped multi-label classifier for the automatic diagnosis and prognosis of Alzheimer’s disease , 2018, Journal of Neuroscience Methods.

[16]  Albert Y. Zomaya,et al.  A Review of Ensemble Methods in Bioinformatics , 2010, Current Bioinformatics.

[17]  Ludmila I. Kuncheva,et al.  A stability index for feature selection , 2007, Artificial Intelligence and Applications.

[18]  P. Cunningham,et al.  Solutions to Instability Problems with Sequential Wrapper-based Approaches to Feature Selection , 2002 .

[19]  Verónica Bolón-Canedo,et al.  Ensemble feature selection: Homogeneous and heterogeneous approaches , 2017, Knowl. Based Syst..

[20]  D. Powers Evaluation: From Precision, Recall and F-Factor to ROC, Informedness, Markedness & Correlation , 2008 .

[21]  Tso-Jung Yen,et al.  Discussion on "Stability Selection" by Meinshausen and Buhlmann , 2010 .

[22]  João Maroco,et al.  Data mining methods in the prediction of Dementia: A real-data comparison of the accuracy, sensitivity and specificity of linear discriminant analysis, logistic regression, neural networks, support vector machines, classification trees and random forests , 2011, BMC Research Notes.

[23]  Heikki Huttunen,et al.  Machine learning framework for early MRI-based Alzheimer's conversion prediction in MCI subjects , 2015, NeuroImage.

[24]  D. Selkoe Alzheimer's disease. , 2011, Cold Spring Harbor perspectives in biology.

[25]  R. Petersen,et al.  Mild cognitive impairment , 2006, The Lancet.

[26]  Alzheimer's Disease Neuroimaging Initiative,et al.  A point-based tool to predict conversion from mild cognitive impairment to probable Alzheimer's disease , 2014, Alzheimer's & Dementia.

[27]  Antonio Cerasa,et al.  Combining multiple approaches for the early diagnosis of Alzheimer's Disease , 2016, Pattern Recognit. Lett..

[28]  Simon Duchesne,et al.  Detecting early preclinical Alzheimer's disease via cognition, neuropsychiatry, and neuroimaging: qualitative review and recommendations for testing. , 2014, Journal of Alzheimer's disease : JAD.

[29]  Sara C Madeira,et al.  Integrative biomarker discovery in neurodegenerative diseases , 2015, Wiley interdisciplinary reviews. Systems biology and medicine.

[30]  Gavin Brown,et al.  Measuring the Stability of Feature Selection , 2016, ECML/PKDD.

[31]  Dirk Van,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[32]  P. Langley Selection of Relevant Features in Machine Learning , 1994 .

[33]  R. Lipton,et al.  Memory impairment on free and cued selective reminding predicts dementia , 2000, Neurology.

[34]  Sara C. Madeira,et al.  Improving Prognostic Prediction from Mild Cognitive Impairment to Alzheimer's Disease Using Genetic Algorithms , 2017, PACBB.

[35]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[36]  Ping Zhang,et al.  Genetic algorithm with logistic regression for prediction of progression to Alzheimer's disease , 2014, BMC Bioinformatics.

[37]  Heikki Huttunen,et al.  Comparison of Feature Selection Techniques in Machine Learning for Anatomical Brain MRI in Dementia , 2016, Neuroinformatics.

[38]  P. Scheltens,et al.  Mild cognitive impairment (MCI) in medical practice: a critical review of the concept and new diagnostic procedure. Report of the MCI Working Group of the European Consortium on Alzheimer’s Disease , 2006, Journal of Neurology, Neurosurgery & Psychiatry.

[39]  Kristine Yaffe,et al.  A Clinical Index to Predict Progression from Mild Cognitive Impairment to Dementia Due to Alzheimer's Disease , 2014, PloS one.

[40]  Vladimir Brusic,et al.  An adaptive genetic algorithm for selection of blood-based biomarkers for prediction of Alzheimer's disease progression , 2015, BMC Bioinformatics.

[41]  D. Powers,et al.  SIE-07-001 December 2007 Evaluation : From Precision , Recall and F-Factor to ROC , , 2008 .

[42]  J. Weuve,et al.  2016 Alzheimer's disease facts and figures , 2016 .

[43]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[44]  C. Jack,et al.  Ways toward an early diagnosis in Alzheimer’s disease: The Alzheimer’s Disease Neuroimaging Initiative (ADNI) , 2005, Alzheimer's & Dementia.

[45]  Deborah Blacker,et al.  Clinical prediction of Alzheimer disease dementia across the spectrum of mild cognitive impairment. , 2007, Archives of general psychiatry.

[46]  Verónica Bolón-Canedo,et al.  Data classification using an ensemble of filters , 2014, Neurocomputing.

[47]  A. Simmons,et al.  Predicting Progression of Alzheimer’s Disease Using Ordinal Regression , 2014, PloS one.

[48]  Huan Liu,et al.  Feature Selection for Classification: A Review , 2014, Data Classification: Algorithms and Applications.

[49]  M. Prince,et al.  World Alzheimer report 2016: improving healthcare for people living with dementia: coverage, quality and costs now and in the future , 2016 .

[50]  Melanie Hilario,et al.  Knowledge and Information Systems , 2007 .

[51]  S. Chung,et al.  No effect of recumbency duration on the occurrence of post-lumbar puncture headache with a 22G cutting needle , 2012, BMC Neurology.

[52]  Klaus Nordhausen,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman , 2009 .

[53]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[54]  S. Sitharama Iyengar,et al.  Data-Driven Techniques in Disaster Information Management , 2017, ACM Comput. Surv..

[55]  Muireann Irish,et al.  Everyday episodic memory in amnestic mild cognitive impairment: a preliminary investigation , 2011, BMC Neuroscience.

[56]  Mary Davoren,et al.  The DUNDRUM Quartet: validation of structured professional judgement instruments DUNDRUM-3 assessment of programme completion and DUNDRUM-4 assessment of recovery in forensic mental health services , 2011, BMC Research Notes.

[57]  Peter Willett,et al.  Combination of Similarity Rankings Using Data Fusion , 2013, J. Chem. Inf. Model..

[58]  Margarida Silveira,et al.  Predicting conversion from MCI to AD with FDG-PET brain images at different prodromal stages , 2015, Comput. Biol. Medicine.

[59]  Sara C. Madeira,et al.  Predicting progression of mild cognitive impairment to dementia using neuropsychological data: a supervised learning approach using time windows , 2017, BMC Medical Informatics and Decision Making.

[60]  Yvan Saeys,et al.  Robust Feature Selection Using Ensemble Feature Selection Techniques , 2008, ECML/PKDD.

[61]  Taghi M. Khoshgoftaar,et al.  A Comparative Study of Ensemble Feature Selection Techniques for Software Defect Prediction , 2010, 2010 Ninth International Conference on Machine Learning and Applications.

[62]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[63]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[64]  Verónica Bolón-Canedo,et al.  Testing Different Ensemble Configurations for Feature Selection , 2017, Neural Processing Letters.

[65]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[66]  Christian Salvatore,et al.  Optimizing Neuropsychological Assessments for Cognitive, Behavioral, and Functional Impairment Classification: A Machine Learning Study , 2017, Behavioural neurology.

[67]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[68]  Thibault Helleputte,et al.  Robust biomarker identification for cancer diagnosis with ensemble feature selection methods , 2010, Bioinform..

[69]  Kewei Cheng,et al.  Feature Selection , 2016, ACM Comput. Surv..

[70]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[71]  Jieping Ye,et al.  Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization , 2009, UAI.

[72]  Gavin Brown,et al.  Measuring the Stability of Feature Selection with Applications to Ensemble Methods , 2015, MCS.

[73]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[74]  Guodong Zhao,et al.  Feature Subset Selection for Cancer Classification Using Weight Local Modularity , 2016, Scientific Reports.

[75]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[76]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[77]  for the Alzheimer's Disease Neuroimaging Initiative,et al.  Random forest feature selection, fusion and ensemble strategy: Combining multiple morphological MRI measures to discriminate among healhy elderly, MCI, cMCI and alzheimer’s disease patients: From the alzheimer’s disease neuroimaging initiative (ADNI) database , 2017, Journal of Neuroscience Methods.

[78]  Vladimir Fonov,et al.  Prediction of Alzheimer's disease in subjects with mild cognitive impairment from the ADNI cohort using patterns of cortical thinning , 2013, NeuroImage.

[79]  Jieping Ye,et al.  Sparse learning and stability selection for predicting MCI to AD conversion using baseline ADNI data , 2012, BMC Neurology.