Toward a unified framework for interpreting machine-learning models in neuroimaging

Machine learning is a powerful tool for creating computational models relating brain function to behavior, and its use is becoming widespread in neuroscience. However, these models are complex and often hard to interpret, making it difficult to evaluate their neuroscientific validity and contribution to understanding the brain. For neuroimaging-based machine-learning models to be interpretable, they should (i) be comprehensible to humans, (ii) provide useful information about what mental or behavioral constructs are represented in particular brain pathways or regions, and (iii) demonstrate that they are based on relevant neurobiological signal, not artifacts or confounds. In this protocol, we introduce a unified framework that consists of model-, feature- and biology-level assessments to provide complementary results that support the understanding of how and why a model works. Although the framework can be applied to different types of models and data, this protocol provides practical tools and examples of selected analysis methods for a functional MRI dataset and multivariate pattern-based predictive models. A user of the protocol should be familiar with basic programming in MATLAB or Python. This protocol will help build more interpretable neuroimaging-based machine-learning models, contributing to the cumulative understanding of brain mechanisms and brain health. Although the analyses provided here constitute a limited set of tests and take a few hours to days to complete, depending on the size of data and available computational resources, we envision the process of annotating and interpreting models as an open-ended process, involving collaborative efforts across multiple studies and laboratories. Neuroimaging-based machine-learning models should be interpretable to neuroscientists and users in applied settings. This protocol describes how to assess the interpretability of models based on fMRI.

[1]  Christopher L. Asplund,et al.  The organization of the human cerebellum estimated by intrinsic functional connectivity. , 2011, Journal of neurophysiology.

[2]  Viktor K. Jirsa,et al.  The Virtual Brain Integrates Computational Modeling and Multimodal Neuroimaging , 2013, Brain Connect..

[3]  Gennady Erlikhman,et al.  Decoding information about dynamically occluded objects in visual cortex , 2017, NeuroImage.

[4]  Klaus-Robert Müller,et al.  Feature Importance Measure for Non-linear Learning Algorithms , 2016, ArXiv.

[5]  Luke J. Chang,et al.  A Sensitive and Specific Neural Signature for Picture-Induced Negative Affect , 2015, PLoS biology.

[6]  Clifford R. Jack,et al.  Antemortem MRI based STructural Abnormality iNDex (STAND)-scores correlate with postmortem Braak neurofibrillary tangle stage , 2008, NeuroImage.

[7]  Alexander Binder,et al.  On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation , 2015, PloS one.

[8]  Masa-aki Sato,et al.  Sparse estimation automatically selects voxels relevant for the decoding of fMRI activity patterns , 2008, NeuroImage.

[9]  Dustin Scheinost,et al.  Ten simple rules for predictive modeling of individual differences in neuroimaging , 2019, NeuroImage.

[10]  Martin P. Paulus,et al.  Pragmatism Instead of Mechanism: A Call for Impactful Biological Psychiatry. , 2015, JAMA psychiatry.

[11]  Sara E. Berger,et al.  The indirect pathway of the nucleus accumbens shell amplifies neuropathic pain , 2015, Nature Neuroscience.

[12]  Jin Fan,et al.  Somatic and vicarious pain are represented by dissociable multivariate brain patterns , 2016, eLife.

[13]  A. Anderson,et al.  Respiratory effects in human functional magnetic resonance imaging due to bulk susceptibility changes. , 2001, Physics in medicine and biology.

[14]  John P. A. Ioannidis,et al.  Exploration, Inference, and Prediction in Neuroscience and Biomedicine , 2019, Trends in Neurosciences.

[15]  Fei-Fei Li,et al.  Visualizing and Understanding Recurrent Networks , 2015, ArXiv.

[16]  H. Francis Song,et al.  Machine Theory of Mind , 2018, ICML.

[17]  John-Dylan Haynes,et al.  The Neural Representation of Voluntary Task-Set Selection in Dynamic Environments. , 2015, Cerebral cortex.

[18]  Lina M. Tran,et al.  Chemogenetic Interrogation of a Brain-wide Fear Memory Network in Mice , 2017, Neuron.

[19]  Jouko Lampinen,et al.  Reproducibility of importance extraction methods in neural network based fMRI classification , 2017, NeuroImage.

[20]  Cynthia Rudin,et al.  Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead , 2018, Nature Machine Intelligence.

[21]  Janaina Mourão Miranda,et al.  Stability-Based Multivariate Mapping Using SCoRS , 2013, 2013 International Workshop on Pattern Recognition in Neuroimaging.

[22]  Danielle S Bassett,et al.  Mitigating head motion artifact in functional connectivity MRI , 2018, Nature Protocols.

[23]  Danilo Bzdok,et al.  Hierarchical Region-Network Sparsity for High-Dimensional Inference in Brain Imaging , 2017, IPMI.

[24]  Joachim M. Buhmann,et al.  Decoding the perception of pain from fMRI using multivariate pattern analysis , 2012, NeuroImage.

[25]  Daniel S Pine,et al.  Biomarkers With a Mechanistic Focus. , 2015, JAMA psychiatry.

[26]  L. K. Hansen,et al.  Activation pattern reproducibility: Measuring the effects of group size and data analysis models , 1997, Human brain mapping.

[27]  Y Kamitani,et al.  Neural Decoding of Visual Imagery During Sleep , 2013, Science.

[28]  J. Haynes A Primer on Pattern-Based Approaches to fMRI: Principles, Pitfalls, and Perspectives , 2015, Neuron.

[29]  Arvind Narayanan,et al.  Semantics derived automatically from language corpora contain human-like biases , 2016, Science.

[30]  Andrew T. Drysdale,et al.  Resting-state connectivity biomarkers define neurophysiological subtypes of depression , 2016, Nature Medicine.

[31]  Nikolaus Kriegeskorte,et al.  Deep Supervised, but Not Unsupervised, Models May Explain IT Cortical Representation , 2014, PLoS Comput. Biol..

[32]  Claudia Plant,et al.  Decoding an individual's sensitivity to pain from the multivariate analysis of EEG data. , 2012, Cerebral cortex.

[33]  César Caballero-Gaudes,et al.  Methods for cleaning the BOLD fMRI signal , 2016, NeuroImage.

[34]  Thomas E. Nichols,et al.  A Bayesian Model of Category-Specific Emotional Brain Responses , 2015, PLoS Comput. Biol..

[35]  Krzysztof J. Gorgolewski,et al.  Making big data open: data sharing in neuroimaging , 2014, Nature Neuroscience.

[36]  Jieping Ye,et al.  Sparse learning and stability selection for predicting MCI to AD conversion using baseline ADNI data , 2012, BMC Neurology.

[37]  Krzysztof J. Gorgolewski,et al.  MRIQC: Advancing the automatic prediction of image quality in MRI from unseen sites , 2016, bioRxiv.

[38]  Yarimar Carrasquillo,et al.  Hemispheric lateralization of a molecular signal for pain modulation in the amygdala , 2008, Molecular pain.

[39]  Andrew Zisserman,et al.  Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[40]  Wolfgang M. Pauli,et al.  Regional specialization within the human striatum for diverse psychological functions , 2016, Proceedings of the National Academy of Sciences.

[41]  Tor D. Wager,et al.  Emotion schemas are embedded in the human visual system , 2018, Science Advances.

[42]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[43]  Luke J. Chang,et al.  Building better biomarkers: brain models in translational neuroimaging , 2017, Nature Neuroscience.

[44]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[45]  Yael Niv,et al.  The Two Cultures of Computational Psychiatry. , 2019, JAMA psychiatry.

[46]  Luke J. Chang,et al.  Multivariate Brain Prediction of Heart Rate and Skin Conductance Responses to Social Threat , 2016, The Journal of Neuroscience.

[47]  Been Kim,et al.  Towards A Rigorous Science of Interpretable Machine Learning , 2017, 1702.08608.

[48]  Giovanni Montana,et al.  Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker , 2016, NeuroImage.

[49]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[50]  Vaidehi S. Natu,et al.  Category-Specific Cortical Activity Precedes Retrieval During Memory Search , 2005, Science.

[51]  Jonathan D. Power,et al.  Recent progress and outstanding issues in motion correction in resting state fMRI , 2015, NeuroImage.

[52]  Sean M. Polyn,et al.  Beyond mind-reading: multi-voxel pattern analysis of fMRI data , 2006, Trends in Cognitive Sciences.

[53]  H. Pashler,et al.  Puzzlingly High Correlations in fMRI Studies of Emotion, Personality, and Social Cognition 1 , 2009, Perspectives on psychological science : a journal of the Association for Psychological Science.

[54]  Sara E. Berger,et al.  Parceling Human Accumbens into Putative Core and Shell Dissociates Encoding of Values for Reward and Pain , 2013, The Journal of Neuroscience.

[55]  M. Breakspear Dynamic models of large-scale brain activity , 2017, Nature Neuroscience.

[56]  Kenji Leibnitz,et al.  Classification and characterisation of brain network changes in chronic back pain: A multicenter study , 2017, bioRxiv.

[57]  Gunnar Rätsch,et al.  The Feature Importance Ranking Measure , 2009, ECML/PKDD.

[58]  Leonie Koban,et al.  Representation, Pattern Information, and Brain Signatures: From Neurons to Neuroimaging , 2018, Neuron.

[59]  R Cameron Craddock,et al.  Disease state prediction from resting state functional connectivity , 2009, Magnetic resonance in medicine.

[60]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[61]  Michael J. Frank,et al.  By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism , 2004, Science.

[62]  Dimitri Van De Ville,et al.  Principal components of functional connectivity: A new approach to study dynamic brain connectivity during rest , 2013, NeuroImage.

[63]  J. DiCarlo,et al.  Using goal-driven deep learning models to understand sensory cortex , 2016, Nature Neuroscience.

[64]  Rainer Goebel,et al.  Information-based functional brain mapping. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[65]  Daniel S. Kermany,et al.  Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning , 2018, Cell.

[66]  Alexander Mordvintsev,et al.  Inceptionism: Going Deeper into Neural Networks , 2015 .

[67]  Philip A. Kragel,et al.  Decoding Spontaneous Emotional States in the Human Brain , 2016, PLoS biology.

[68]  Paulo J. G. Lisboa,et al.  Making machine learning models interpretable , 2012, ESANN.

[69]  Chen Su,et al.  Activation of Corticostriatal Circuitry Relieves Chronic Neuropathic Pain , 2015, The Journal of Neuroscience.

[70]  Marisa O. Hollinshead,et al.  The organization of the human cerebral cortex estimated by intrinsic functional connectivity. , 2011, Journal of neurophysiology.

[71]  Klaus-Robert Müller,et al.  iNNvestigate neural networks! , 2018, J. Mach. Learn. Res..

[72]  Richard D Riley,et al.  Minimum sample size for developing a multivariable prediction model: Part I – Continuous outcomes , 2018, Statistics in medicine.

[73]  M. Lindquist,et al.  An fMRI-based neurologic signature of physical pain. , 2013, The New England journal of medicine.

[74]  A. Ishai,et al.  Distributed and Overlapping Representations of Faces and Objects in Ventral Temporal Cortex , 2001, Science.

[75]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[76]  J. S. Guntupalli,et al.  Decoding neural representational spaces using multivariate pattern analysis. , 2014, Annual review of neuroscience.

[77]  Richard D Riley,et al.  Minimum sample size for developing a multivariable prediction model: PART II ‐ binary and time‐to‐event outcomes , 2018, Statistics in medicine.

[78]  M. Chun,et al.  A neuromarker of sustained attention from whole-brain functional connectivity , 2015, Nature Neuroscience.

[79]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[80]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[81]  Tor D Wager,et al.  What reliability can and cannot tell us about pain report and pain neuroimaging , 2016, Pain.

[82]  Essa Yacoub,et al.  The WU-Minn Human Connectome Project: An overview , 2013, NeuroImage.

[83]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[84]  Ethan Kross,et al.  Discriminating Neural Representations of Physical and Social Pains: How Multivariate Statistics Challenge the 'shared Representation' Theory of Pain Rogachov a Hanna Jr, and Wager Td. Separate Neural Representations for Physical Pain and Social Rejection , 2022 .

[85]  Zachary Chase Lipton The mythos of model interpretability , 2016, ACM Queue.

[86]  Avanti Shrikumar,et al.  Learning Important Features Through Propagating Activation Differences , 2017, ICML.

[87]  Martin A. Lindquist,et al.  Group-regularized individual prediction: theory and application to pain , 2017, NeuroImage.

[88]  Razvan Pascanu,et al.  Vector-based navigation using grid-like representations in artificial agents , 2018, Nature.

[89]  R. O’Reilly Biologically Based Computational Models of High-Level Cognition , 2006, Science.

[90]  Davide Castelvecchi,et al.  Can we open the black box of AI? , 2016, Nature.

[91]  Lars Kai Hansen,et al.  Model sparsity and brain pattern interpretation of classification models in neuroimaging , 2012, Pattern Recognit..

[92]  Andres Hoyos Idrobo,et al.  Assessing and tuning brain decoders: Cross-validation, caveats, and guidelines , 2016, NeuroImage.

[93]  Takeo Watanabe,et al.  A small number of abnormal brain connections predicts adult autism spectrum disorder , 2016, Nature Communications.

[94]  Andrei Irimia,et al.  Multivariate morphological brain signatures predict patients with chronic abdominal pain from healthy control subjects , 2015, Pain.

[95]  M. B. Nebel,et al.  Automated diagnoses of attention deficit hyperactive disorder using magnetic resonance imaging , 2012, Front. Syst. Neurosci..

[96]  Scott M. Lundberg,et al.  Explainable machine-learning predictions for the prevention of hypoxaemia during surgery , 2018, Nature Biomedical Engineering.

[97]  Tor D. Wager,et al.  Empathic Care and Distress: Predictive Brain Markers and Dissociable Brain Systems , 2017, Neuron.

[98]  James V. Haxby,et al.  Multivariate pattern analysis of fMRI: The early beginnings , 2012, NeuroImage.

[99]  David Borsook,et al.  The human amygdala and pain: Evidence from neuroimaging , 2014, Human brain mapping.

[100]  V. Calhoun,et al.  Temporal lobe and “default” hemodynamic brain modes discriminate between schizophrenia and bipolar disorder , 2008, Human brain mapping.

[101]  Jonathan E. Taylor,et al.  Interpretable whole-brain prediction analysis with GraphNet , 2013, NeuroImage.

[102]  A. Vania Apkarian,et al.  A brain signature for acute pain , 2013, Trends in Cognitive Sciences.

[103]  Anders M. Dale,et al.  The Adolescent Brain Cognitive Development (ABCD) study: Imaging acquisition across 21 sites , 2018, Developmental Cognitive Neuroscience.

[104]  Patrick Dupont,et al.  Generalizable Representations of Pain, Cognitive Control, and Negative Emotion in Medial Frontal Cortex , 2017, Nature Neuroscience.

[105]  R. Buckner,et al.  The organization of the human striatum estimated by intrinsic functional connectivity. , 2012, Journal of neurophysiology.

[106]  Cynthia Breazeal,et al.  Machine behaviour , 2019, Nature.

[107]  T. Wager,et al.  Distinct Brain Systems Mediate the Effects of Nociceptive Input and Self-Regulation on Pain , 2015, PLoS biology.

[108]  Daniel S. Margulies,et al.  NeuroVault.org: a web-based repository for collecting and sharing unthresholded statistical maps of the human brain , 2014, bioRxiv.

[109]  Alessandro Rinaldo,et al.  Distribution-Free Predictive Inference for Regression , 2016, Journal of the American Statistical Association.

[110]  D. Hassabis,et al.  Neuroscience-Inspired Artificial Intelligence , 2017, Neuron.

[111]  Jieun Kim,et al.  Decoding Multiple Sound Categories in the Human Temporal Cortex Using High Resolution fMRI , 2015, PloS one.

[112]  Tomoyasu Horikawa,et al.  Generic decoding of seen and imagined objects using hierarchical visual features , 2015, Nature Communications.

[113]  Vincent Frouin,et al.  Structured Sparse Principal Components Analysis With the TV-Elastic Net Penalty , 2016, IEEE Transactions on Medical Imaging.

[114]  Anand D. Sarwate,et al.  Decentralized temporal independent component analysis: Leveraging fMRI data in collaborative settings , 2019, NeuroImage.

[115]  Martin A Lindquist,et al.  Quantifying cerebral contributions to pain beyond nociception , 2017, Nature Communications.

[116]  Russell A. Poldrack,et al.  Large-scale automated synthesis of human functional neuroimaging data , 2011, Nature Methods.

[117]  Patrick D. McDaniel,et al.  Deep k-Nearest Neighbors: Towards Confident, Interpretable and Robust Deep Learning , 2018, ArXiv.

[118]  Dustin Scheinost,et al.  Using connectome-based predictive modeling to predict individual behavior from brain connectivity , 2017, Nature Protocols.

[119]  Conor Liston,et al.  Functional and Optogenetic Approaches to Discovering Stable Subtype-Specific Circuit Mechanisms in Depression , 2018, bioRxiv.

[120]  Rainer Goebel,et al.  Combining multivariate voxel selection and support vector machines for mapping and classification of fMRI spatial patterns , 2008, NeuroImage.

[121]  Krzysztof J. Gorgolewski,et al.  OpenNeuro – a free online platform for sharing and analysis of neuroimaging data , 2017 .

[122]  Alison J. Wiggett,et al.  Patterns of fMRI Activity Dissociate Overlapping Functional Brain Areas that Respond to Biological Motion , 2006, Neuron.

[123]  Lawrence Carin,et al.  Brain-wide Electrical Spatiotemporal Dynamics Encode Depression Vulnerability , 2018, Cell.

[124]  Tom Michael Mitchell,et al.  Predicting Human Brain Activity Associated with the Meanings of Nouns , 2008, Science.

[125]  G. Box Science and Statistics , 1976 .

[126]  Hyoung F. Kim,et al.  Distinct Basal Ganglia Circuits Controlling Behaviors Guided by Flexible and Stable Values , 2013, Neuron.

[127]  Stephen José Hanson,et al.  Combinatorial codes in ventral temporal lobe for object recognition: Haxby (2001) revisited: is there a “face” area? , 2004, NeuroImage.

[128]  Karl J. Friston,et al.  The Dynamic Brain: From Spiking Neurons to Neural Masses and Cortical Fields , 2008, PLoS Comput. Biol..

[129]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[130]  F. Cabitza,et al.  Unintended Consequences of Machine Learning in Medicine , 2017, JAMA.

[131]  N. Kriegeskorte,et al.  Author ' s personal copy Representational geometry : integrating cognition , computation , and the brain , 2013 .

[132]  Jamil Zaki,et al.  The Anatomy of Suffering: Understanding the Relationship between Nociceptive and Empathic Pain , 2016, Trends in Cognitive Sciences.

[133]  Luca Baldassarre,et al.  Sparsity Is Better with Stability: Combining Accuracy and Stability for Model Selection in Brain Decoding , 2017, Front. Neurosci..

[134]  R. Goebel,et al.  Pattern classification of valence in depression☆ , 2013, NeuroImage: Clinical.