Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics

In this critical review, we examine the application of predictive models, e.g. classifiers, trained using Machine Learning (ML) to assist in interpretation of functional neuroimaging data. Our primary goal is to summarize how ML is being applied and critically assess common practices. Our review covers 250 studies published using ML and resting-state functional MRI (fMRI) to infer various dimensions of the human functional connectome. Results for hold-out (“lockbox”) performance was, on average, ~13% less accurate than performance measured through cross-validation alone, highlighting the importance of lockbox data which was included in only 16% of the studies. There was also a concerning lack of transparency across the key steps in training and evaluating predictive models. The summary of this literature underscores the importance of the use of a lockbox and highlights several methodological pitfalls that can be addressed by the imaging community. We argue that, ideally, studies are motivated both by the reproducibility and generalizability of findings as well as the potential clinical significance of the insights. We offer recommendations for principled integration of machine learning into the clinical neurosciences with the goal of advancing imaging biomarkers of brain disorders, understanding causative determinants for health risks, and parsing heterogeneous patient outcomes.

[1]  Byunghan Lee,et al.  Deep learning in bioinformatics , 2016, Briefings Bioinform..

[2]  S. Kosslyn,et al.  Topographical representations of mental images in primary visual cortex , 1995, Nature.

[3]  Frank G. Hillary,et al.  Graph theory approaches to functional network organization in brain disorders: A critique for a brave new small-world , 2018, Network Neuroscience.

[4]  Douglas M. Hawkins,et al.  The Problem of Overfitting , 2004, J. Chem. Inf. Model..

[5]  Vasant Honavar,et al.  LMLFM: Longitudinal Multi-Level Factorization Machine , 2020, AAAI.

[6]  Stephanie K. Langella,et al.  Lower functional hippocampal redundancy in mild cognitive impairment , 2021, Translational Psychiatry.

[7]  Andrew Y. Ng,et al.  Preventing "Overfitting" of Cross-Validation Data , 1997, ICML.

[8]  Nick C Fox,et al.  The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods , 2008, Journal of magnetic resonance imaging : JMRI.

[9]  Danielle S Bassett,et al.  Brain graphs: graphical models of the human brain connectome. , 2011, Annual review of clinical psychology.

[10]  Vince D. Calhoun,et al.  The tenth annual MLSP competition: Schizophrenia classification challenge , 2014, 2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP).

[11]  Emily Grossner,et al.  Diminished neural network dynamics after moderate and severe traumatic brain injury , 2018, PloS one.

[12]  Adam R Ferguson,et al.  Preclinical Common Data Elements for Traumatic Brain Injury Research: Progress and Use Cases. , 2020, Journal of Neurotrauma.

[13]  M. Gilardi,et al.  Magnetic resonance imaging biomarkers for the early diagnosis of Alzheimer's disease: a machine learning approach , 2015, Front. Neurosci..

[14]  Gavin C. Cawley,et al.  On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation , 2010, J. Mach. Learn. Res..

[15]  Robert Leech,et al.  Salience network integrity predicts default mode network function after traumatic brain injury , 2012, Proceedings of the National Academy of Sciences.

[16]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[17]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[18]  Thomas E. Nichols,et al.  The ENIGMA Consortium: large-scale collaborative analyses of neuroimaging and genetic data , 2014, Brain Imaging and Behavior.

[19]  Simon B. Eickhoff,et al.  Machine learning for psychiatry: getting doctors at the black box? , 2020, Molecular Psychiatry.

[20]  Russell A. Poldrack,et al.  Editorial: Reliability and Reproducibility in Functional Connectomics , 2019, Front. Neurosci..

[21]  Andres Hoyos Idrobo,et al.  Assessing and tuning brain decoders: Cross-validation, caveats, and guidelines , 2016, NeuroImage.

[22]  R. Bharath,et al.  Emerging behavioral and neuroimaging biomarkers for early and accurate characterization of autism spectrum disorders: a systematic review , 2021, Translational Psychiatry.

[23]  Moritz Hardt,et al.  A Meta-Analysis of Overfitting in Machine Learning , 2019, NeurIPS.

[24]  Yoshua Bengio,et al.  Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[25]  B. T. Thomas Yeo,et al.  Proportional thresholding in resting-state fMRI functional connectivity networks and consequences for patient-control connectome studies: Issues and recommendations , 2017, NeuroImage.

[26]  Concha Bielza,et al.  Machine Learning in Bioinformatics , 2008, Encyclopedia of Database Systems.

[27]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[28]  E. Mayo-Wilson,et al.  The PRISMA 2020 statement: an updated guideline for reporting systematic reviews , 2021, BMJ.

[29]  Andreas Daffertshofer,et al.  Comparing Brain Networks of Different Size and Connectivity Density Using Graph Theory , 2010, PloS one.

[30]  Jari Saramäki,et al.  Reorganization of functionally connected brain subnetworks in high‐functioning autism , 2015, Human brain mapping.

[31]  Russell Greiner,et al.  ADHD-200 Global Competition: diagnosing ADHD using personal characteristic data can outperform resting state fMRI measurements , 2012, Front. Syst. Neurosci..

[32]  Danielle S Bassett,et al.  Different shades of default mode disturbance in schizophrenia: Subnodal covariance estimation in structure and function , 2018, Human brain mapping.

[33]  Frank G. Hillary,et al.  The evolution of cost-efficiency in neural networks during recovery from traumatic brain injury , 2017, PloS one.

[34]  A. Meyer-Lindenberg,et al.  Neuroimaging Biomarkers in Schizophrenia , 2008 .

[35]  Russell T. Shinohara,et al.  Increased power by harmonizing structural MRI site differences with the ComBat batch adjustment method in ENIGMA , 2020, NeuroImage.

[36]  Vasant Honavar,et al.  Longitudinal Deep Kernel Gaussian Process Regression , 2020, ArXiv.

[37]  D. Wolpert On Overfitting Avoidance as Bias , 1993 .

[38]  O. Sporns,et al.  Complex brain networks: graph theoretical analysis of structural and functional systems , 2009, Nature Reviews Neuroscience.

[39]  A. Gelman,et al.  The statistical crisis in science , 2014 .

[40]  Klaus Nordhausen,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman , 2009 .

[41]  Enrico Pellegrini,et al.  Machine learning of neuroimaging for assisted diagnosis of cognitive impairment and dementia: A systematic review , 2018, Alzheimer's & dementia.

[42]  Vasant Honavar,et al.  Dynamical Gaussian Process Latent Variable Model for Representation Learning from Longitudinal Data , 2020, FODS.

[43]  Emily L. Dennis,et al.  ENIGMA and global neuroscience: A decade of large-scale studies of the brain in health and disease across more than 40 countries , 2019, Biological Psychiatry.

[44]  Bradley J. Ferguson,et al.  Beta-adrenergic antagonism modulates functional connectivity in the default mode network of individuals with and without autism spectrum disorder , 2017, Brain Imaging and Behavior.

[45]  C. Hartmann,et al.  Machine-learning identifies Parkinson's disease patients based on resting-state between-network functional connectivity. , 2019, The British journal of radiology.

[46]  Daniel S. Margulies,et al.  The Neuro Bureau ADHD-200 Preprocessed repository , 2016, NeuroImage.

[47]  Gregory R. Grant,et al.  Bioinformatics - The Machine Learning Approach , 2000, Comput. Chem..

[48]  Thomas G. Dietterich Overfitting and undercomputing in machine learning , 1995, CSUR.

[49]  Sina Sheikholeslami,et al.  Ablation Programming for Machine Learning , 2019 .

[50]  Bin Zhang,et al.  Changes in the topological organization of the default mode network in autism spectrum disorder , 2020, Brain Imaging and Behavior.

[51]  Alex Zelinsky,et al.  Learning OpenCV---Computer Vision with the OpenCV Library (Bradski, G.R. et al.; 2008)[On the Shelf] , 2009, IEEE Robotics & Automation Magazine.

[53]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[54]  Thomas E. Nichols,et al.  Best practices in data analysis and sharing in neuroimaging using MRI , 2017, Nature Neuroscience.

[55]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[56]  Vasant Honavar,et al.  Explainable Multivariate Time Series Classification: A Deep Neural Network Which Learns To Attend To Important Variables As Well As Informative Time Intervals , 2020, ArXiv.

[57]  Dimitris Samaras,et al.  Deriving reproducible biomarkers from multi-site resting-state data: An Autism-based example , 2016, NeuroImage.

[58]  Christian Desrosiers,et al.  Machine learning for the diagnosis of Parkinson's disease: A systematic review , 2020, ArXiv.

[59]  M. Onu,et al.  Exploring the reproducibility of functional connectivity alterations in Parkinson’s disease , 2016, PloS one.

[60]  Howard Bowman,et al.  I tried a bunch of things: The dangers of unexpected overfitting in classification of brain data , 2020, Neuroscience and Biobehavioral Reviews.

[61]  Peter H. Wilson,et al.  Mapping the functional connectome in traumatic brain injury: What can graph metrics tell us? , 2017, NeuroImage.

[62]  Rich Caruana,et al.  Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping , 2000, NIPS.

[63]  John P. A. Ioannidis,et al.  Exploration, Inference, and Prediction in Neuroscience and Biomedicine , 2019, Trends in Neurosciences.

[64]  C.-C. Jay Kuo,et al.  Graph representation learning: a survey , 2019, APSIPA Transactions on Signal and Information Processing.

[65]  Holger H. Hoos,et al.  Analysing differences between algorithm configurations through ablation , 2015, Journal of Heuristics.

[66]  P. De Fazio,et al.  Machine learning techniques in a structural and functional MRI diagnostic approach in schizophrenia: a systematic review , 2019, Neuropsychiatric disease and treatment.

[67]  Bradley C. Love,et al.  Variability in the analysis of a single neuroimaging dataset by many teams , 2019, Nature.

[68]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[69]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[70]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[71]  Gael Varoquaux,et al.  Establishment of Best Practices for Evidence for Prediction: A Review. , 2019, JAMA psychiatry.

[72]  Satrajit S. Ghosh,et al.  FMRIPrep: a robust preprocessing pipeline for functional MRI , 2018, bioRxiv.

[73]  Tobias Meisen,et al.  Ablation Studies in Artificial Neural Networks , 2019, ArXiv.

[74]  J. May,et al.  The determination of low levels of aluminum in antihemophilic factor (human) preparations by flame atomic absorption spectrometry. , 1988, Journal of biological standardization.

[75]  C. Pisanu,et al.  Application of Support Vector Machine on fMRI Data as Biomarkers in Schizophrenia Diagnosis: A Systematic Review , 2020, Frontiers in Psychiatry.

[76]  H. Pashler,et al.  Puzzlingly High Correlations in fMRI Studies of Emotion, Personality, and Social Cognition 1 , 2009, Perspectives on psychological science : a journal of the Association for Psychological Science.

[77]  T. Yarkoni,et al.  Choosing Prediction Over Explanation in Psychology: Lessons From Machine Learning , 2017, Perspectives on psychological science : a journal of the Association for Psychological Science.

[78]  Russell A. Poldrack,et al.  OpenNeuro: An open archive for analysis and sharing of BRAIN Initiative data , 2018 .

[79]  Hui Shen,et al.  A Deep Network Model on Dynamic Functional Connectivity With Applications to Gender Classification and Intelligence Prediction , 2020, Frontiers in Neuroscience.

[80]  Jure Leskovec,et al.  Representation Learning on Graphs: Methods and Applications , 2017, IEEE Data Eng. Bull..

[81]  Kotagiri Ramamohanarao,et al.  Towards deep learning for connectome mapping: A block decomposition framework , 2020, NeuroImage.

[82]  Chao Chen,et al.  Deep multi-kernel auto-encoder network for clustering brain functional connectivity data. , 2020, Neural networks : the official journal of the International Neural Network Society.

[83]  Simon B Eickhoff,et al.  Your evidence? Machine learning algorithms for medical diagnosis and prediction , 2019, Human brain mapping.