Exploring the impact of analysis software on task fMRI results

A wealth of analysis tools are available to fMRI researchers in order to extract patterns of task variation and, ultimately, understand cognitive function. However, this ‘methodological plurality’ comes with a drawback. While conceptually similar, two different analysis pipelines applied on the same dataset may not produce the same scientific results. Differences in methods, implementations across software packages, and even operating systems or software versions all contribute to this variability. Consequently, attention in the field has recently been directed to reproducibility and data sharing. Neuroimaging is currently experiencing a surge in initiatives to improve research practices and ensure that all conclusions inferred from an fMRI study are replicable. In this work, our goal is to understand how choice of software package impacts on analysis results. We use publicly shared data from three published task fMRI neuroimaging studies, reanalyzing each study using the three main neuroimaging software packages, AFNI, FSL and SPM, using parametric and nonparametric inference. We obtain all information on how to process, analyze, and model each dataset from the publications. We make quantitative and qualitative comparisons between our replications to gauge the scale of variability in our results and assess the fundamental differences between each software package. While qualitatively we find broad similarities between packages, we also discover marked differences, such as Dice similarity coefficients ranging from 0.000 - 0.684 in comparisons of thresholded statistic maps between software. We discuss the challenges involved in trying to reanalyse the published studies, and highlight our own efforts to make this research reproducible.

[1]  Joshua Carp,et al.  The secret lives of experiments: Methods reporting in the fMRI literature , 2012, NeuroImage.

[2]  Satrajit S. Ghosh,et al.  Sharing brain mapping statistical results with the neuroimaging data model , 2016, Scientific Data.

[3]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[4]  Oluwasanmi Koyejo,et al.  Toward open sharing of task-based fMRI data: the OpenfMRI project , 2013, Front. Neuroinform..

[5]  Andrea Bergmann,et al.  Statistical Parametric Mapping The Analysis Of Functional Brain Images , 2016 .

[6]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[7]  Thomas E. Nichols,et al.  Exploring fMRI Results Space: 31 Variants of an fMRI Analysis in AFNI, FSL, and SPM , 2016, Front. Neuroinform..

[8]  Hans Knutsson,et al.  Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates , 2016, Proceedings of the National Academy of Sciences.

[9]  Joshua Carp,et al.  On the Plurality of (Methodological) Worlds: Estimating the Analytic Flexibility of fMRI Experiments , 2012, Front. Neurosci..

[10]  L. K. Hansen,et al.  The Quantitative Evaluation of Functional Neuroimaging Experiments: The NPAIRS Data Analysis Framework , 2000, NeuroImage.

[11]  Gang Chen,et al.  FMRI processing with AFNI: Some comments and corrections on “Exploring the Impact of Analysis Software on Task fMRI Results” , 2018, bioRxiv.

[12]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[13]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[14]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[15]  Lars Kai Hansen,et al.  The Quantitative Evaluation of Functional Neuroimaging Experiments: The NPAIRS Data Analysis Framework , 2000, NeuroImage.

[16]  Ron Mengelers,et al.  The Effects of FreeSurfer Version, Workstation Type, and Macintosh Operating System Version on Anatomical Volume and Cortical Thickness Measurements , 2012, PloS one.

[17]  Thomas E. Nichols,et al.  Scanning the horizon: towards transparent and reproducible neuroimaging research , 2016, Nature Reviews Neuroscience.

[18]  Tanneguy Redarce,et al.  Automatic Lip-Contour Extraction and Mouth-Structure Segmentation in Images , 2011, Computing in Science & Engineering.

[19]  Andy Wai Kan Yeung,et al.  An Updated Survey on Statistical Thresholding and Sample Size of fMRI Studies , 2018, Front. Hum. Neurosci..

[20]  R W Cox,et al.  AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. , 1996, Computers and biomedical research, an international journal.

[21]  Michael Brady,et al.  Improved Optimization for the Robust and Accurate Linear Registration and Motion Correction of Brain Images , 2002, NeuroImage.

[22]  Satrajit S. Ghosh,et al.  The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments , 2016, Scientific Data.

[23]  Ninon Burgos,et al.  New advances in the Clinica software platform for clinical neuroimaging studies , 2019 .

[24]  M. Jenkinson Non-linear registration aka Spatial normalisation , 2007 .

[25]  Beatriz Luna,et al.  Developmental changes in brain function underlying the influence of reward processing on inhibitory control , 2011, Developmental Cognitive Neuroscience.

[26]  Karl J. Friston,et al.  False discovery rate revisited: FDR and topological inference using Gaussian random fields , 2009, NeuroImage.

[27]  Martin A. Lindquist,et al.  Evaluating the consistency and specificity of neuroimaging data using meta-analysis , 2009, NeuroImage.

[28]  Ariel Deardorff,et al.  Open Science Framework (OSF) , 2017, Journal of the Medical Library Association : JMLA.

[29]  et al.,et al.  Jupyter Notebooks - a publishing format for reproducible computational workflows , 2016, ELPUB.

[30]  Michael Eickenberg,et al.  Machine learning for neuroimaging with scikit-learn , 2014, Front. Neuroinform..

[31]  Thomas E Nichols,et al.  Exploring the impact of analysis software on task fMRI results , 2019, Human brain mapping.

[32]  Anders M. Dale,et al.  Cortical Surface-Based Analysis I. Segmentation and Surface Reconstruction , 1999, NeuroImage.

[33]  Stephen M. Smith,et al.  Temporal Autocorrelation in Univariate Linear Modeling of FMRI Data , 2001, NeuroImage.

[34]  R. Peng Reproducible Research in Computational Science , 2011, Science.

[35]  Karsten Mueller,et al.  Commentary: Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates , 2017, Front. Hum. Neurosci..

[36]  Stephen M Smith,et al.  Fast robust automated brain extraction , 2002, Human brain mapping.

[37]  Jason P. Mitchell,et al.  Social-Cognitive Deficits in Normal Aging , 2012, The Journal of Neuroscience.

[38]  Daniel S. Margulies,et al.  NeuroVault.org: a web-based repository for collecting and sharing unthresholded statistical maps of the human brain , 2014, bioRxiv.

[39]  John C. Gore,et al.  ROC Analysis of Statistical Methods Used in Functional MRI: Individual Subjects , 1999, NeuroImage.

[40]  Joshua Carp,et al.  Optimizing the order of operations for movement scrubbing: Comment on Power et al. , 2013, NeuroImage.

[41]  Tristan Glatard,et al.  Reproducibility of neuroimaging analyses across operating systems , 2015, Front. Neuroinform..

[42]  Eliza Congdon,et al.  Decreasing Ventromedial Prefrontal Cortex Activity During Sequential Risk-Taking: An fMRI Investigation of the Balloon Analog Risk Task , 2012, Frontiers in Neuroscience.

[43]  Egill Rostrup,et al.  Motion or activity: their role in intra- and inter-subject variation in fMRI , 2005, NeuroImage.

[44]  Stephen M. Smith,et al.  Permutation inference for the general linear model , 2014, NeuroImage.