Smoothness without Smoothing: Why Gaussian Naive Bayes Is Not Naive for Multi-Subject Searchlight Studies

Spatial smoothness is helpful when averaging fMRI signals across multiple subjects, as it allows different subjects' corresponding brain areas to be pooled together even if they are slightly misaligned. However, smoothing is usually not applied when performing multivoxel pattern-based analyses (MVPA), as it runs the risk of blurring away the information that fine-grained spatial patterns contain. It would therefore be desirable, if possible, to carry out pattern-based analyses which take unsmoothed data as their input but which produce smooth images as output. We show here that the Gaussian Naive Bayes (GNB) classifier does precisely this, when it is used in “searchlight” pattern-based analyses. We explain why this occurs, and illustrate the effect in real fMRI data. Moreover, we show that analyses using GNBs produce results at the multi-subject level which are statistically robust, neurally plausible, and which replicate across two independent data sets. By contrast, SVM classifiers applied to the same data do not generate a replication, even if the SVM-derived searchlight maps have smoothing applied to them. An additional advantage of GNB classifiers for searchlight analyses is that they are orders of magnitude faster to compute than more complex alternatives such as SVMs. Collectively, these results suggest that Gaussian Naive Bayes classifiers may be a highly non-naive choice for multi-subject pattern-based fMRI studies.

[1]  Anil K. Jain,et al.  Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[3]  David R. Musicant,et al.  Lagrangian Support Vector Machines , 2001, J. Mach. Learn. Res..

[4]  Michael I. Jordan,et al.  On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[5]  D. Hand,et al.  Idiot's Bayes—Not So Stupid After All? , 2001 .

[6]  R. Savoy Functional Magnetic Resonance Imaging (fMRI) , 2002 .

[7]  David D. Cox,et al.  Functional magnetic resonance imaging (fMRI) “brain reading”: detecting and classifying distributed patterns of fMRI activity in human visual cortex , 2003, NeuroImage.

[8]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[9]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[10]  Tom M. Mitchell,et al.  Learning to Decode Cognitive States from Brain Images , 2004, Machine Learning.

[11]  P. Bickel,et al.  Some theory for Fisher''s linear discriminant function , 2004 .

[12]  Lei Zhang,et al.  Machine learning for clinical diagnosis from functional magnetic resonance imaging , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[14]  Rainer Goebel,et al.  Information-based functional brain mapping. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Jean-Baptiste Poline,et al.  Analysis of a large fMRI cohort: Statistical and methodological issues for group analyses , 2007, NeuroImage.

[16]  Rajeev D. S. Raizada,et al.  Selective Amplification of Stimulus Differences during Categorical Processing of Speech , 2007, Neuron.

[17]  Emily B. Myers Dissociable effects of phonetic competition and category typicality in a phonetic categorization task: An fMRI investigation , 2007, Neuropsychologia.

[18]  P. Hluštík,et al.  Effects of spatial smoothing on fMRI group inferences. , 2008, Magnetic resonance imaging.

[19]  D. M. Titterington,et al.  Comment on “On Discriminative vs. Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes” , 2008, Neural Processing Letters.

[20]  Arthur Gretton,et al.  Comparison of Pattern Recognition Methods in Classifying High-resolution Bold Signals Obtained at High Magnetic Field in Monkeys , 2008 .

[21]  Stefan Bode,et al.  Decoding sequential stages of task preparation in the human brain , 2009, NeuroImage.

[22]  Dirk B. Walther,et al.  Natural Scene Categories Revealed in Distributed Patterns of Activity in the Human Brain , 2009, The Journal of Neuroscience.

[23]  Emily B. Myers,et al.  Inferior Frontal Regions Underlie the Perception of Phonetic Category Invariance , 2009, Psychological science.

[24]  Kenneth A. Norman,et al.  Recollection, Familiarity, and Cortical Reinstatement: A Multivoxel Pattern Analysis , 2009, Neuron.

[25]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[26]  Nikolaus Kriegeskorte,et al.  Comparison of multivariate classifiers and response normalizations for pattern-information fMRI , 2010, NeuroImage.

[27]  Geoffrey E. Hinton,et al.  Comparing Classification Methods for Longitudinal fMRI Studies , 2010, Neural Computation.

[28]  Soyoung Q. Park,et al.  The neural code of reward anticipation in human orbitofrontal cortex , 2010, Proceedings of the National Academy of Sciences.

[29]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[30]  Hans P. Op de Beeck,et al.  Against hyperacuity in brain reading: Spatial smoothing does not hurt multivariate fMRI analyses? , 2010, NeuroImage.

[31]  Lloyd T. Elliott,et al.  Cortical surface-based searchlight decoding , 2011, NeuroImage.

[32]  Jakob Heinzle,et al.  Decoding different roles for vmPFC and dlPFC in multi-attribute decision making , 2011, NeuroImage.

[33]  Tom M. Mitchell,et al.  Commonality of neural representations of words and pictures , 2011, NeuroImage.

[34]  Francisco Pereira,et al.  Information mapping with pattern classifiers: A comparative study , 2011, NeuroImage.

[35]  Richard Granger,et al.  Categorical Speech Processing in Broca's Area: An fMRI Study Using Multivariate Pattern-Based Analysis , 2012, The Journal of Neuroscience.

[36]  Dinggang Shen,et al.  Multiscale Adaptive Generalized Estimating Equations for Longitudinal Neuroimaging Data ☆ , 2022 .