Statistical agnostic mapping: a framework in neuroimaging based on concentration inequalities

In the 70s a novel branch of statistics emerged focusing its effort in selecting a function in the pattern recognition problem, which fulfils a definite relationship between the quality of the approximation and its complexity. These data-driven approaches are mainly devoted to problems of estimating dependencies with limited sample sizes and comprise all the empirical out-of sample generalization approaches, e.g. cross validation (CV) approaches. Although the latter are not designed for testing competing hypothesis or comparing different models in neuroimaging, there are a number of theoretical developments within this theory which could be employed to derive a Statistical Agnostic (non-parametric) Mapping (SAM) at voxel or multi-voxel level. Moreover, SAMs could relieve i) the problem of instability in limited sample sizes when estimating the actual risk via the CV approaches, e.g. large error bars, and provide ii) an alternative way of Family-wise-error (FWE) corrected p-value maps in inferential statistics for hypothesis testing. In this sense, we propose a novel framework in neuroimaging based on concentration inequalities, which results in (i) a rigorous development for model validation with a small sample/dimension ratio, and (ii) a less-conservative procedure than FWE p-value correction, to determine the brain significance maps from the inferences made using small upper bounds of the actual risk.

[1]  J. Morris,et al.  The diagnosis of dementia due to Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer's disease , 2011, Alzheimer's & Dementia.

[2]  John Suckling,et al.  On the computation of distribution-free performance bounds: Application to small sample sizes in neuroimaging , 2019, Pattern Recognit..

[3]  Shai Ben-David,et al.  Understanding Machine Learning: From Theory to Algorithms , 2014 .

[4]  Jonathan Flint,et al.  Confidence and precision increase with high statistical power , 2013, Nature Reviews Neuroscience.

[5]  G. Lugosi,et al.  Data-dependent margin-based generalization bounds for classification , 2003 .

[6]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[7]  David Haussler,et al.  Decision Theoretic Generalizations of the PAC Model for Neural Net and Other Learning Applications , 1992, Inf. Comput..

[8]  Karl J. Friston Ten ironic rules for non-statistical reviewers , 2012, NeuroImage.

[9]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[10]  Karl J. Friston,et al.  Human Brain Function, Second Edition , 2004 .

[11]  J. Morris,et al.  The diagnosis of dementia due to Alzheimer's disease: Recommendations from the National Institute on Aging and the Alzheimer's Association workgroup , 2011 .

[12]  J. V. Haxby,et al.  Spatial Pattern Analysis of Functional Brain Images Using Partial Least Squares , 1996, NeuroImage.

[13]  Martin A. Lindquist,et al.  Ironing out the statistical wrinkles in “ten ironic rules” , 2013, NeuroImage.

[14]  Diego Castillo-Barnes,et al.  Studying the Manifold Structure of Alzheimer's Disease: A Deep Learning Approach Using Convolutional Autoencoders , 2020, IEEE Journal of Biomedical and Health Informatics.

[15]  S. Shelah A combinatorial problem; stability and order for models and theories in infinitary languages. , 1972 .

[16]  C. Jack,et al.  NIA-AA Research Framework: Toward a biological definition of Alzheimer’s disease , 2018, Alzheimer's & Dementia.

[17]  Karl J. Friston Sample size and the fallacies of classical inference , 2013, NeuroImage.

[18]  Gaël Varoquaux,et al.  Cross-validation failure: Small sample sizes lead to large error bars , 2017, NeuroImage.

[19]  Karl J. Friston,et al.  Dynamic causal modeling for EEG and MEG , 2009, Human brain mapping.

[20]  Thomas M. Cover,et al.  Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[21]  Philip T. Reiss,et al.  Cross-validation and hypothesis testing in neuroimaging: An irenic comment on the exchange between Friston and Lindquist et al. , 2015, NeuroImage.

[22]  Vanessa Gómez-Verdejo,et al.  Sign-consistency based variable importance for machine learning in brain imaging , 2017 .

[23]  Mathukumalli Vidyasagar Applications to Neural Networks , 2003 .

[24]  Vladimir Vapnik,et al.  Estimation of Dependences Based on Empirical Data: Springer Series in Statistics (Springer Series in Statistics) , 1982 .

[25]  Francisco Jesús Martínez-Murcia,et al.  A Machine Learning Approach to Reveal the NeuroPhenotypes of Autisms , 2019, Int. J. Neural Syst..

[26]  H. Zaidi,et al.  Quantitative Analysis in Nuclear Medicine Imaging , 2007, Journal of Nuclear Medicine.

[27]  M. Hallett Human Brain Function , 1998, Trends in Neurosciences.

[28]  Janaina Mourão Miranda,et al.  Classifying brain states and determining the discriminating activation patterns: Support Vector Machine on functional MRI data , 2005, NeuroImage.

[29]  John Shawe-Taylor,et al.  Correction to "SCoRS - A Method Based on Stability for Feature Selection and Mapping in Neuroimaging" , 2014, IEEE Trans. Medical Imaging.

[30]  F Segovia,et al.  Automatic assistance to Parkinson's disease diagnosis in DaTSCAN SPECT imaging. , 2012, Medical physics.

[31]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[32]  Danilo Bzdok,et al.  Classical Statistics and Statistical Learning in Imaging Neuroscience , 2016, Front. Neurosci..

[33]  Roman Rosipal,et al.  Overview and Recent Advances in Partial Least Squares , 2005, SLSFS.

[34]  Norbert Sauer,et al.  On the Density of Families of Sets , 1972, J. Comb. Theory A.

[35]  Alan C. Evans,et al.  Prediction of brain maturity based on cortical thickness at different spatial resolutions , 2015, NeuroImage.

[36]  Rainer Goebel,et al.  Combining multivariate voxel selection and support vector machines for mapping and classification of fMRI spatial patterns , 2008, NeuroImage.

[37]  Vladimir Vapnik,et al.  Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities , 1971 .

[38]  P. Massart Some applications of concentration inequalities to statistics , 2000 .

[39]  Colin McDiarmid,et al.  Surveys in Combinatorics, 1989: On the method of bounded differences , 1989 .

[40]  John Shawe-Taylor,et al.  SCoRS—A Method Based on Stability for Feature Selection and Mapping in Neuroimaging , 2014, IEEE Transactions on Medical Imaging.

[41]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.