Bayesian model selection for group studies

Bayesian model selection (BMS) is a powerful method for determining the most likely among a set of competing hypotheses about the mechanisms that generated observed data. BMS has recently found widespread application in neuroimaging, particularly in the context of dynamic causal modelling (DCM). However, so far, combining BMS results from several subjects has relied on simple (fixed effects) metrics, e.g. the group Bayes factor (GBF), that do not account for group heterogeneity or outliers. In this paper, we compare the GBF with two random effects methods for BMS at the between-subject or group level. These methods provide inference on model-space using a classical and Bayesian perspective respectively. First, a classical (frequentist) approach uses the log model evidence as a subject-specific summary statistic. This enables one to use analysis of variance to test for differences in log-evidences over models, relative to inter-subject differences. We then consider the same problem in Bayesian terms and describe a novel hierarchical model, which is optimised to furnish a probability density on the models themselves. This new variational Bayes method rests on treating the model as a random variable and estimating the parameters of a Dirichlet distribution which describes the probabilities for all models considered. These probabilities then define a multinomial distribution over model space, allowing one to compute how likely it is that a specific model generated the data of a randomly chosen subject as well as the exceedance probability of one model being more likely than any other model. Using empirical and synthetic data, we show that optimising a conditional density of the model probabilities, given the log-evidences for each model over subjects, is more informative and appropriate than both the GBF and frequentist tests of the log-evidences. In particular, we found that the hierarchical Bayesian approach is considerably more robust than either of the other approaches in the presence of outliers. We expect that this new random effects method will prove useful for a wide range of group studies, not only in the context of DCM, but also for other modelling endeavours, e.g. comparing different source reconstruction methods for EEG/MEG or selecting among competing computational models of learning and decision-making.

[1]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[2]  Philip K. McGuire,et al.  Fronto-temporal Interactions during Overt Verbal Initiation and Suppression , 2008, Journal of Cognitive Neuroscience.

[3]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[4]  Olivier Faugeras,et al.  Using nonlinear models in fMRI data analysis: Model selection and activation detection , 2006, NeuroImage.

[5]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[6]  H. Akaike A new look at the statistical model identification , 1974 .

[7]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[8]  R. Dolan,et al.  Task and Content Modulate Amygdala-Hippocampal Connectivity in Emotional Retrieval , 2006, Neuron.

[9]  Karl J. Friston,et al.  Degeneracy and cognitive anatomy , 2002, Trends in Cognitive Sciences.

[10]  Karl J. Friston,et al.  Hierarchical Processing of Auditory Objects in Humans , 2007, PLoS Comput. Biol..

[11]  Karl J. Friston,et al.  Dynamic causal modelling of evoked potentials: A reproducibility study , 2007, NeuroImage.

[12]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[13]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[14]  Karl J. Friston,et al.  Interhemispheric Integration of Visual Processing during Task-Driven Lateralization , 2007, The Journal of Neuroscience.

[15]  Karl J. Friston,et al.  Nonlinear Dynamic Causal Models for Fmri Nonlinear Dynamic Causal Models for Fmri Nonlinear Dynamic Causal Models for Fmri , 2022 .

[16]  David L. Woodruff,et al.  General Purpose Metrics for Solution Variety , 2005 .

[17]  Karl J. Friston,et al.  Nonlinear Responses in fMRI: The Balloon Model, Volterra Kernels, and Other Hemodynamics , 2000, NeuroImage.

[18]  Karl J. Friston,et al.  The functional anatomy of the MMN: A DCM study of the roving paradigm , 2008, NeuroImage.

[19]  Mark W. Greenlee,et al.  Connectivity modulation of early visual processing areas during covert and overt tracking tasks , 2008, NeuroImage.

[20]  Douglas C. Noll,et al.  Accounting for nonlinear BOLD effects in fMRI: parameter estimates and a model for prediction in rapid event-related studies , 2005, NeuroImage.

[21]  Martin J. McKeown,et al.  Dynamic Bayesian network modeling of fMRI: A comparison of group-analysis methods , 2008, NeuroImage.

[22]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[23]  C. Summerfield,et al.  A Neural Representation of Prior Information during Perceptual Inference , 2008, Neuron.

[24]  Karl J. Friston,et al.  Dynamic causal modelling , 2003, NeuroImage.

[25]  Karl J. Friston,et al.  The Cortical Dynamics of Intelligible Speech , 2008, The Journal of Neuroscience.

[26]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[27]  Bahram Alidaee,et al.  Metaheuristic Optimization via Memory and Evolution: Tabu Search and Scatter Search (Operations Research/Computer Science Interfaces Series) , 2005 .

[28]  Karl J. Friston,et al.  Dynamic causal modeling , 2010, Scholarpedia.

[29]  Karl J. Friston,et al.  Population-level inferences for distributed MEG source localization under multiple constraints: Application to face-evoked fields , 2007, NeuroImage.

[30]  Karl J. Friston,et al.  Synaptic Plasticity and Dysconnection in Schizophrenia , 2006, Biological Psychiatry.

[31]  Karl J. Friston,et al.  Integrated Bayesian models of learning and decision making for saccadic eye movements☆ , 2008, Neural Networks.

[32]  Karl J. Friston,et al.  Dynamic causal models of neural system dynamics: current state and future extensions , 2007, Journal of Biosciences.

[33]  K. Amunts,et al.  Effective connectivity of the left BA 44, BA 45, and inferior temporal gyrus during lexical and phonological decisions identified with DCM , 2009, Human brain mapping.

[34]  Antígona Martínez,et al.  Nonlinear temporal dynamics of the cerebral blood flow response , 2001, Human brain mapping.

[35]  Karl J. Friston,et al.  Variational free energy and the Laplace approximation , 2007, NeuroImage.

[36]  Karl J. Friston,et al.  Electromagnetic source reconstruction for group studies , 2008, NeuroImage.

[37]  D. Noll,et al.  Nonlinear Aspects of the BOLD Response in Functional MRI , 1998, NeuroImage.

[38]  Karl J. Friston,et al.  Canonical Source Reconstruction for MEG , 2007, Comput. Intell. Neurosci..

[39]  Matthew C. Keller,et al.  Increased sensitivity in neuroimaging analyses using robust regression , 2005, NeuroImage.

[40]  Ivan Toni,et al.  Parieto-Frontal Connectivity during Visually Guided Grasping , 2007, The Journal of Neuroscience.

[41]  J. O'Doherty,et al.  The Role of the Ventromedial Prefrontal Cortex in Abstract State-Based Inference during Decision Making in Humans , 2006, The Journal of Neuroscience.

[42]  Karl J. Friston,et al.  Comparing dynamic causal models , 2004, NeuroImage.

[43]  I. J. Myung,et al.  When a good fit can be bad , 2002, Trends in Cognitive Sciences.

[44]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[45]  Karl J. Friston,et al.  Comparing hemodynamic models with DCM , 2007, NeuroImage.

[46]  Jörn Diedrichsen,et al.  Detecting and adjusting for artifacts in fMRI time series data , 2005, NeuroImage.