Permutation distributions of fMRI classification do not behave in accord with central limit theorem

Applying Monte Carlo method on Fisher's exact test is a prominent choice in estimating an existent statistical effect in large data. When used to analyze classification results, the method, which is widely known as permutation testing, works by testing the null hypothesis after generating a permutation distribution (PD) of classification accuracies/errors that is centered around chance-level. In principle, these PDs should behave in accord with the central limit theorem (CLT) if the independence condition in the cross-validation classification error (test statistic) is fulfilled. Permutation testing has been widely used in pattern classification applied to neuroimaging studies to eradicate chance performance. In this work, we used Anderson-Darling test to evaluate the accordance level of PDs of classification accuracies to normality expected under CLT. An exhaustive simulation study was carried out using functional magnetic resonance imaging data that were collected while human subjects responded to visual stimulation paradigms and the following classifiers were considered: support vector machines, logistic regression, ridge logistic regression, Gaussian Naive Bayes, sparse multinomial logistic regression, and artificial neural networks. Our results showed that while the standard normal distribution does not adequately fit to PDs, it tends to fit well when the mean classification accuracy averaged over a set of independent classifiers is considered. We also found that across-run lk motion correction of the fMRI data weakens the accordance of PDs with CLT and this phenomenon could be due to the across runs dependence resulting from motion correction.

[1]  A. Ishai,et al.  Distributed and Overlapping Representations of Faces and Objects in Ventral Temporal Cortex , 2001, Science.

[2]  C. Keysers,et al.  An introduction to anatomical ROI-based fMRI classification analysis , 2009, Brain Research.

[3]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[4]  Sean M. Polyn,et al.  Beyond mind-reading: multi-voxel pattern analysis of fMRI data , 2006, Trends in Cognitive Sciences.

[5]  Mohammed Sadeq Al-Rawi,et al.  On using permutation tests to estimate the classification significance of functional magnetic resonance imaging data , 2012, Neurocomputing.

[6]  Thorsten Joachims,et al.  Making large-scale support vector machine learning practical , 1999 .

[7]  T. W. Anderson,et al.  Asymptotic Theory of Certain "Goodness of Fit" Criteria Based on Stochastic Processes , 1952 .

[8]  Helmut Schäfer,et al.  PERMORY: an LD-exploiting permutation test algorithm for powerful genome-wide association testing , 2010, Bioinform..

[9]  Polina Golland,et al.  Permutation Tests for Classification: Towards Statistical Significance in Image-Based Studies , 2003, IPMI.

[10]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[11]  Tom M. Mitchell,et al.  Machine learning classifiers and fMRI: A tutorial overview , 2009, NeuroImage.

[12]  Karl J. Friston Sample size and the fallacies of classical inference , 2013, NeuroImage.

[13]  Gemma C. Garriga,et al.  Permutation Tests for Studying Classifier Performance , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[14]  M. Stephens EDF Statistics for Goodness of Fit and Some Comparisons , 1974 .

[15]  T. Carlson,et al.  Patterns of Activity in the Categorical Representations of Objects , 2003 .