Deep Learning in current Neuroimaging: a multivariate approach with power and type I error control but arguable generalization ability

Discriminative analysis in neuroimaging by means of deep/machine learning techniques is usually tested with validation techniques such as K-fold, hold-out, and resubstitution, whereas the associated statistical significance remains largely under-developed due to their computational complexcarmenj@ugr.es javierrp@ugr.es js369@cam.ac.uk Jonathan.Voeglein@med.uni-muenchen.de Johannes.Levin@med.uni-muenchen.de gorriz@ugr.es, jg528@cam.ac.uk Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf Deep Learning in current Neuroimaging: a multivariate approach with power and type I error control but arguable generalization ability A PREPRINT ity. In this work, a non-parametric framework is proposed that estimates the statistical significance of classifications using deep learning architectures that are increasingly used in neuroimaging. In particular, a combination of autoencoders (AE) and support vector machines (SVM) is applied to: (i) a one-condition, within-group designs often of normal controls (NC) and; (ii) a two-condition, between-group designs which contrast, for example, Alzheimer’s disease (AD) patients with NC (the extension to multi-class analyses is also included). A random-effects inference based on a label permutation test is proposed in both studies using cross-validation (CV) and resubstitution with upper bound correction (RUB) as validation methods. This allows both false positives and classifier overfitting to be detected as well as estimating the statistical power of the test. Several experiments were carried out using the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset, the Dominantly Inherited Alzheimer Network (DIAN) dataset, and a MCI prediction dataset. We found in the permutation test that CV and RUB methods offer a false positive rate close to the significance level and an acceptable statistical power (although lower using cross-validation). A large separation between training and test accuracies using CV was observed, especially in one-condition designs. This implies a low generalization ability as the model fitted in training is not informative with respect to the test set. We propose as solution by applying RUB, whereby similar results are obtained to those of the CV test set, but considering the whole set and with a lower computational cost per iteration.

[1]  Venkatapavani Pallavi Punugu Machine Learning in Neuroimaging , 2017 .

[2]  Sayan Mukherjee,et al.  Permutation Tests for Classification , 2005, COLT.

[3]  Nick C Fox,et al.  Clinical phenotype and genetic associations in autosomal dominant familial Alzheimer’s disease: a case series , 2016, The Lancet Neurology.

[4]  J. M. GÓRRIZ,et al.  Case-Based Statistical Learning: A Non-Parametric Implementation With a Conditional-Error Rate SVM , 2017, IEEE Access.

[5]  Ayman El-Baz,et al.  Alzheimer's disease diagnostics by a 3D deeply supervised adaptable convolutional network. , 2018, Frontiers in bioscience.

[6]  Christos Davatzikos,et al.  Machine learning in neuroimaging: Progress and challenges , 2019, NeuroImage.

[7]  M. Sugishita,et al.  [Clinical Dementia Rating (CDR)]. , 2011, Nihon rinsho. Japanese journal of clinical medicine.

[8]  Leslie Pack Kaelbling,et al.  Generalization in Deep Learning , 2017, ArXiv.

[9]  Polina Golland,et al.  Permutation Tests for Classification: Towards Statistical Significance in Image-Based Studies , 2003, IPMI.

[10]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[11]  Ninon Burgos,et al.  Convolutional Neural Networks for Classification of Alzheimer's Disease: Overview and Reproducible Evaluation , 2019, Medical Image Anal..

[12]  Juan Manuel Górriz,et al.  Spatial component analysis of MRI data for Alzheimer's disease diagnosis: a Bayesian network approach , 2014, Front. Comput. Neurosci..

[13]  Li Su,et al.  Autosomal dominantly inherited alzheimer disease: Analysis of genetic subgroups by machine learning , 2020, Inf. Fusion.

[14]  Ryota Tomioka,et al.  Norm-Based Capacity Control in Neural Networks , 2015, COLT.

[15]  Nick C Fox,et al.  Modeling autosomal dominant Alzheimer's disease with machine learning , 2021, Alzheimer's & dementia : the journal of the Alzheimer's Association.

[16]  F J Martinez-Murcia,et al.  A Spherical Brain Mapping of MR Images for the Detection of Alzheimer's Disease. , 2016, Current Alzheimer research.

[17]  J. Morris,et al.  Clinical Dementia Rating: A Reliable and Valid Diagnostic and Staging Measure for Dementia of the Alzheimer Type , 1997, International Psychogeriatrics.

[18]  Gaël Varoquaux,et al.  Cross-validation failure: Small sample sizes lead to large error bars , 2017, NeuroImage.

[19]  J. Ramírez,et al.  Machine-learning neuroimaging challenge for automated diagnosis of mild cognitive impairment: Lessons learnt , 2018, Journal of Neuroscience Methods.

[20]  Gemma C. Garriga,et al.  Permutation Tests for Studying Classifier Performance , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[21]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[22]  P. Bouvry,et al.  Reducing Overfitting and Improving Generalization in Training Convolutional Neural Network (CNN) under Limited Sample Sizes in Image Recognition , 2020, 2020 - 5th International Conference on Information Technology (InCIT).

[23]  Andrés Ortiz,et al.  Ensembles of Deep Learning Architectures for the Early Diagnosis of the Alzheimer's Disease , 2016, Int. J. Neural Syst..

[24]  G. Schellenberg,et al.  Candidate gene for the chromosome 1 familial Alzheimer's disease locus , 1995, Science.

[25]  for the Alzheimer's Disease Neuroimaging Initiative,et al.  Random forest feature selection, fusion and ensemble strategy: Combining multiple morphological MRI measures to discriminate among healhy elderly, MCI, cMCI and alzheimer’s disease patients: From the alzheimer’s disease neuroimaging initiative (ADNI) database , 2017, Journal of Neuroscience Methods.

[26]  I. Álvarez,et al.  SVM-based CAD system for early detection of the Alzheimer's disease using kernel PCA and LDA , 2009, Neuroscience Letters.

[27]  Antonio Cerasa,et al.  Editorial on special issue: Machine learning on MCI , 2018, Journal of Neuroscience Methods.

[28]  Andrew J. Saykin,et al.  Deep Learning in Alzheimer's Disease: Diagnostic Classification and Prognostic Prediction Using Neuroimaging Data , 2019, Front. Aging Neurosci..

[29]  Nathan Srebro,et al.  Exploring Generalization in Deep Learning , 2017, NIPS.

[30]  Feng Zhang,et al.  Automated MRI-Based Deep Learning Model for Detection of Alzheimer's Disease Process , 2020, Int. J. Neural Syst..

[31]  Juan Manuel Górriz,et al.  Independent Component Analysis-Support Vector Machine-Based Computer-Aided Diagnosis System for Alzheimer's with Visual Support , 2017, Int. J. Neural Syst..

[32]  John Suckling,et al.  On the computation of distribution-free performance bounds: Application to small sample sizes in neuroimaging , 2019, Pattern Recognit..

[33]  Reisa A. Sperling,et al.  Longitudinal Change in CSF Biomarkers in Autosomal-Dominant Alzheimer’s Disease , 2014, Science Translational Medicine.

[34]  Juan Manuel Górriz,et al.  A comparative study of feature extraction methods for the diagnosis of Alzheimer's disease using the ADNI database , 2012, Neurocomputing.

[35]  Hans Knutsson,et al.  Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates , 2016, Proceedings of the National Academy of Sciences.

[36]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[37]  Stefan J. Teipel,et al.  The relative importance of imaging markers for the prediction of Alzheimer's disease dementia in mild cognitive impairment — Beyond classical regression , 2015, NeuroImage: Clinical.

[38]  Mark E. Schmidt,et al.  The Alzheimer’s Disease Neuroimaging Initiative: A review of papers published since its inception , 2012, Alzheimer's & Dementia.

[39]  Karl J. Friston,et al.  Unified segmentation , 2005, NeuroImage.

[40]  Jelle J. Goeman,et al.  Better-than-chance classification for signal detection. , 2016, Biostatistics.

[41]  Shai Ben-David,et al.  Understanding Machine Learning: From Theory to Algorithms , 2014 .

[42]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[43]  Yann LeCun,et al.  Measuring the VC-Dimension of a Learning Machine , 1994, Neural Computation.

[44]  Peter L. Bartlett,et al.  Vapnik-Chervonenkis dimension of neural nets , 2003 .

[45]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[46]  Tom M. Mitchell,et al.  Machine learning classifiers and fMRI: A tutorial overview , 2009, NeuroImage.

[47]  A M Dale,et al.  Measuring the thickness of the human cerebral cortex from magnetic resonance images. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[48]  C. Keysers,et al.  An introduction to anatomical ROI-based fMRI classification analysis , 2009, Brain Research.

[49]  Giovanni Montana,et al.  Predicting Alzheimer's disease: a neuroimaging study with 3D convolutional neural networks , 2015, ICPRAM 2015.

[50]  P. Good,et al.  Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses , 1995 .

[51]  Mutation of the Alzheimer's disease amyloid gene in hereditary cerebral hemorrhage, Dutch type. , 1990, Science.

[52]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[53]  Roman Rosipal,et al.  Overview and Recent Advances in Partial Least Squares , 2005, SLSFS.

[54]  Samy Bengio,et al.  Understanding deep learning requires rethinking generalization , 2016, ICLR.

[55]  Bruce W. Suter,et al.  The multilayer perceptron as an approximation to a Bayes optimal discriminant function , 1990, IEEE Trans. Neural Networks.

[56]  Diego Castillo-Barnes,et al.  Studying the Manifold Structure of Alzheimer's Disease: A Deep Learning Approach Using Convolutional Autoencoders , 2020, IEEE Journal of Biomedical and Health Informatics.

[57]  Diego Castillo-Barnes,et al.  Optimized One vs One Approach in Multiclass Classification for Early Alzheimer’s Disease and Mild Cognitive Impairment Diagnosis , 2020, IEEE Access.

[58]  J.M. Gorriz,et al.  Statistical agnostic mapping: a framework in neuroimaging based on concentration inequalities , 2019, bioRxiv.

[59]  Arthur W. Toga,et al.  Sharing data in the global alzheimer's association interactive network , 2016, NeuroImage.

[60]  Pierre Baldi,et al.  Autoencoders, Unsupervised Learning, and Deep Architectures , 2011, ICML Unsupervised and Transfer Learning.

[61]  Philip T. Reiss,et al.  Cross-validation and hypothesis testing in neuroimaging: An irenic comment on the exchange between Friston and Lindquist et al. , 2015, NeuroImage.

[62]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[63]  Seong-Whan Lee,et al.  Latent feature representation with stacked auto-encoder for AD/MCI diagnosis , 2013, Brain Structure and Function.

[64]  Yi Chen,et al.  Statistical inference and multiple testing correction in classification-based multi-voxel pattern analysis (MVPA): Random permutations and cluster size control , 2011, NeuroImage.

[65]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[66]  Francisco Jesús Martínez-Murcia,et al.  Convolutional Neural Networks for Neuroimaging in Parkinson's Disease: Is Preprocessing Needed? , 2018, Int. J. Neural Syst..

[67]  Nick C Fox,et al.  Predicting sporadic Alzheimer’s disease progression via inherited Alzheimer’s disease-informed machine-learning , 2020, Alzheimer's & dementia : the journal of the Alzheimer's Association.

[68]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[69]  Vijaya L. Melnick,et al.  Alzheimer’s Dementia , 1985, Contemporary Issues in Biomedicine, Ethics, and Society.

[70]  D. Louis Collins,et al.  Early Prediction of Alzheimer's Disease Progression Using Variational Autoencoders , 2019, MICCAI.