Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates

Significance Functional MRI (fMRI) is 25 years old, yet surprisingly its most common statistical methods have not been validated using real data. Here, we used resting-state fMRI data from 499 healthy controls to conduct 3 million task group analyses. Using this null data with different experimental designs, we estimate the incidence of significant results. In theory, we should find 5% false positives (for a significance threshold of 5%), but instead we found that the most common software packages for fMRI analysis (SPM, FSL, AFNI) can result in false-positive rates of up to 70%. These results question the validity of a number of fMRI studies and may have a large impact on the interpretation of weakly significant neuroimaging results. The most widely used task functional magnetic resonance imaging (fMRI) analyses use parametric statistical methods that depend on a variety of assumptions. In this work, we use real resting-state data and a total of 3 million random task group analyses to compute empirical familywise error rates for the fMRI software packages SPM, FSL, and AFNI, as well as a nonparametric permutation method. For a nominal familywise error rate of 5%, the parametric statistical methods are shown to be conservative for voxelwise inference and invalid for clusterwise inference. Our results suggest that the principal cause of the invalid cluster inferences is spatial autocorrelation functions that do not follow the assumed Gaussian shape. By comparison, the nonparametric permutation test is found to produce nominal results for voxelwise as well as clusterwise inference. These findings speak to the need of validating the statistical methods being used in the field of neuroimaging.

[1]  Stephen M. Smith,et al.  Threshold-free cluster enhancement: Addressing problems of smoothing, threshold dependence and localisation in cluster inference , 2009, NeuroImage.

[2]  Andreas Meyer-Lindenberg,et al.  False positives in imaging genetics , 2008, NeuroImage.

[3]  Satrajit S. Ghosh,et al.  Data sharing in neuroimaging research , 2012, Front. Neuroinform..

[4]  Karl J. Friston,et al.  A unified statistical approach for determining significant signals in images of cerebral activation , 1996, Human brain mapping.

[5]  Christian Windischberger,et al.  Toward discovery science of human brain function , 2010, Proceedings of the National Academy of Sciences.

[6]  Karl J. Friston,et al.  Assessing the significance of focal activations using their spatial extent , 1994, Human brain mapping.

[7]  William A. Cunningham,et al.  Type I and Type II error concerns in fMRI research: re-balancing the scale. , 2009, Social cognitive and affective neuroscience.

[8]  Krzysztof J. Gorgolewski,et al.  Making big data open: data sharing in neuroimaging , 2014, Nature Neuroscience.

[9]  A. Mechelli,et al.  False positive rates in Voxel-based Morphometry studies of the human brain: Should we be worried? , 2015, Neuroscience & Biobehavioral Reviews.

[10]  N. Logothetis What we can do and what we cannot do with fMRI , 2008, Nature.

[11]  Anders Eklund,et al.  BROCCOLI: Software for fast fMRI analysis on many-core CPUs and GPUs , 2014, Front. Neuroinform..

[12]  Nick C Fox,et al.  The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods , 2008, Journal of magnetic resonance imaging : JMRI.

[13]  Johan Wessberg,et al.  A Monte Carlo method for locally multivariate brain mapping , 2011, NeuroImage.

[14]  Anjali Krishnan,et al.  Cluster-extent based thresholding in fMRI analyses: Pitfalls and recommendations , 2014, NeuroImage.

[15]  Thomas E. Nichols,et al.  Post-hoc power estimation for topological inference in fMRI , 2014, NeuroImage.

[16]  Thomas E. Nichols,et al.  Validating cluster size inference: random field and permutation methods , 2003, NeuroImage.

[17]  John Suckling,et al.  Global, voxel, and cluster tests, by theory and permutation, for a difference between two groups of structural MR images of the brain , 1999, IEEE Transactions on Medical Imaging.

[18]  Sabrina M. Tom,et al.  The Neural Basis of Loss Aversion in Decision-Making Under Risk , 2007, Science.

[19]  Ravi S. Menon,et al.  Intrinsic signal changes accompanying sensory stimulation: functional brain mapping with magnetic resonance imaging. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[21]  C. Jack,et al.  Alzheimer's Disease Neuroimaging Initiative , 2008 .

[22]  David A. Leopold,et al.  Dynamic functional connectivity: Promise, issues, and interpretations , 2013, NeuroImage.

[23]  Karsten Mueller,et al.  Commentary: Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates , 2017, Front. Hum. Neurosci..

[24]  Mark W. Woolrich,et al.  FSL , 2012, NeuroImage.

[25]  Essa Yacoub,et al.  The WU-Minn Human Connectome Project: An overview , 2013, NeuroImage.

[26]  Stephen M. Smith,et al.  Permutation inference for the general linear model , 2014, NeuroImage.

[27]  Mark W. Woolrich,et al.  Multilevel linear modelling for FMRI group analysis using Bayesian inference , 2004, NeuroImage.

[28]  Joshua Carp,et al.  The secret lives of experiments: Methods reporting in the fMRI literature , 2012, NeuroImage.

[29]  Andrea Bergmann,et al.  Statistical Parametric Mapping The Analysis Of Functional Brain Images , 2016 .

[30]  B. Biswal,et al.  Functional connectivity in the motor cortex of resting human brain using echo‐planar mri , 1995, Magnetic resonance in medicine.

[31]  Joseph T. Devlin,et al.  Consistency and variability in functional localisers , 2009, NeuroImage.

[32]  Hans Knutsson,et al.  Fast Random Permutation Tests Enable Objective Evaluation of Methods for Single-Subject fMRI Analysis , 2011, Int. J. Biomed. Imaging.

[33]  Thomas E. Nichols,et al.  False positives in neuroimaging genetics using voxel-based morphometry data , 2011, NeuroImage.

[34]  Clare Kelly Toward Discovery Science of Human Brain Function: Development , 2010 .

[35]  Oluwasanmi Koyejo,et al.  Toward open sharing of task-based fMRI data: the OpenfMRI project , 2013, Front. Neuroinform..

[36]  Daniel S. Margulies,et al.  NeuroVault.org: A repository for sharing unthresholded statistical maps, parcellations, and atlases of the human brain , 2016, NeuroImage.

[37]  Jeffrey M. Zacks,et al.  Searchlight analysis: Promise, pitfalls, and potential , 2013, NeuroImage.

[38]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[39]  John Ashburner,et al.  SPM: A history , 2012, NeuroImage.

[40]  R W Cox,et al.  AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. , 1996, Computers and biomedical research, an international journal.

[41]  Hans Knutsson,et al.  Does Parametric Fmri Analysis with Spm Yield Valid Results? -an Empirical Study of 1484 Rest Datasets Does Parametric Fmri Analysis with Spm Yield Valid Results? - an Empirical Study of 1484 Rest Datasets , 2022 .

[42]  Hans Knutsson,et al.  Adaptive analysis of fMRI data , 2003, NeuroImage.

[43]  Thomas E. Nichols,et al.  Adjusting the effect of nonstationarity in cluster-based and TFCE inference , 2011, NeuroImage.

[44]  松田 佳尚 Organization for Human Brain Mapping(OHBM)2010の報告 , 2010 .

[45]  Keith J. Worsley,et al.  Applications of Random Fields in Human Brain Mapping , 2001 .

[46]  Hui Zhang,et al.  Cluster mass inference via random field theory , 2009, NeuroImage.

[47]  Yves Rosseel,et al.  A Review of fMRI Simulation Studies , 2014, PloS one.

[48]  Brian A. Nosek,et al.  Power failure: why small sample size undermines the reliability of neuroscience , 2013, Nature Reviews Neuroscience.

[49]  Karl J. Friston,et al.  Voxel-Based Morphometry—The Methods , 2000, NeuroImage.

[50]  Hans Knutsson,et al.  Empirically investigating the statistical validity of SPM, FSL and AFNI for single subject fMRI analysis , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[51]  Abraham Z. Snyder,et al.  A method for using blocked and event-related fMRI data to study “resting state” functional connectivity , 2007, NeuroImage.

[52]  Thomas E. Nichols,et al.  Nonstationary cluster-size inference with random field and permutation methods , 2004, NeuroImage.

[53]  Thomas J. Santner,et al.  Testing for Activation in Data from FMRI Experiments , 2021, Journal of Data Science.

[54]  Nikolaus Kriegeskorte,et al.  Artifactual time‐course correlations in echo‐planar fMRI with implications for studies of brain function , 2008, Int. J. Imaging Syst. Technol..

[55]  Jonathan D. Cohen,et al.  Improved Assessment of Significant Activation in Functional Magnetic Resonance Imaging (fMRI): Use of a Cluster‐Size Threshold , 1995, Magnetic resonance in medicine.

[56]  S C Williams,et al.  Generic brain activation mapping in functional magnetic resonance imaging: a nonparametric approach. , 1997, Magnetic resonance imaging.

[57]  Giuseppe Sartori,et al.  When the single matters more than the group: Very high false positive rates in single case Voxel Based Morphometry , 2013, NeuroImage.

[58]  Anders Eklund,et al.  Medical image processing on the GPU - Past, present and future , 2013, Medical Image Anal..

[59]  Thomas E. Nichols,et al.  Controlling the familywise error rate in functional neuroimaging: a comparative review , 2003, Statistical methods in medical research.

[60]  Tim Curran,et al.  Extending Local Canonical Correlation Analysis to Handle General Linear Contrasts for fMRI Data , 2012, Int. J. Biomed. Imaging.

[61]  Karl J. Friston,et al.  Robust Smoothness Estimation in Statistical Parametric Maps Using Standardized Residuals from the General Linear Model , 1999, NeuroImage.