Accelerating permutation testing in voxel-wise analysis through subspace tracking: A new plugin for SnPM

Abstract Permutation testing is a non‐parametric method for obtaining the max null distribution used to compute corrected p‐values that provide strong control of false positives. In neuroimaging, however, the computational burden of running such an algorithm can be significant. We find that by viewing the permutation testing procedure as the construction of a very large permutation testing matrix, Symbol, one can exploit structural properties derived from the data and the test statistics to reduce the runtime under certain conditions. In particular, we see that Symbol is low‐rank plus a low‐variance residual. This makes Symbol a good candidate for low‐rank matrix completion, where only a very small number of entries of Symbol (Symbol of all entries in our experiments) have to be computed to obtain a good estimate. Based on this observation, we present RapidPT, an algorithm that efficiently recovers the max null distribution commonly obtained through regular permutation testing in voxel‐wise analysis. We present an extensive validation on a synthetic dataset and four varying sized datasets against two baselines: Statistical NonParametric Mapping (SnPM13) and a standard permutation testing implementation (referred as NaivePT). We find that RapidPT achieves its best runtime performance on medium sized datasets (Symbol), with speedups of 1.5× ‐ 38× (vs. SnPM13) and 20x‐1000× (vs. NaivePT). For larger datasets (Symbol) RapidPT outperforms NaivePT (6× ‐ 200×) on all datasets, and provides large speedups over SnPM13 when more than 10000 permutations (2× ‐ 15×) are needed. The implementation is a standalone toolbox and also integrated within SnPM13, able to leverage multi‐core architectures when available. Symbol. No caption available. Symbol. No caption available. Symbol. No caption available. Symbol. No caption available. HighlightsA fast and robust permutation testing approach for multiple hypothesis testing is proposed.Our permutation testing approach is ˜20× faster then current methods.The proposed model is in the developing (soon to be released) version of SnPM.

[1]  James Bennett,et al.  The Netflix Prize , 2007 .

[2]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[3]  Stephen P. Boyd,et al.  Rank minimization and applications in system theory , 2004, Proceedings of the 2004 American Control Conference.

[4]  A. Tamhane,et al.  Multiple Comparison Procedures , 1989 .

[5]  Alexander J. Smola,et al.  Neural Information Processing Systems , 1997, NIPS 1997.

[6]  Robert D. Nowak,et al.  Online identification and tracking of subspaces from highly incomplete information , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[7]  K. Worsley,et al.  Random fields of multivariate test statistics, with applications to shape analysis , 2008, 0803.1708.

[8]  Zuowei Shen,et al.  Robust video denoising using low rank matrix completion , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Hans Knutsson,et al.  Fast Random Permutation Tests Enable Objective Evaluation of Methods for Single-Subject fMRI Analysis , 2011, Int. J. Biomed. Imaging.

[10]  J D Watson,et al.  Nonparametric Analysis of Statistic Images from Functional Mapping Experiments , 1996, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[11]  Stephen M. Smith,et al.  Faster permutation inference in brain imaging , 2016, NeuroImage.

[12]  Eugene S. Edgington,et al.  Statistical inference : the distribution-free approach , 1969 .

[13]  Benjamin Recht,et al.  A Simpler Approach to Matrix Completion , 2009, J. Mach. Learn. Res..

[14]  Alan C. Evans,et al.  A Three-Dimensional Statistical Analysis for CBF Activation Studies in Human Brain , 1992, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[15]  Thomas E. Nichols,et al.  Holmes and Watson on ‘Sherlock’ , 1998, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[16]  Laura Balzano,et al.  Incremental gradient on the Grassmannian for online foreground and background separation in subsampled video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Stephen M. Smith,et al.  Permutation inference for the general linear model , 2014, NeuroImage.

[18]  N C Andreasen,et al.  Tests for Comparing Images Based on Randomization and Permutation Methods , 1996, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[19]  H. Keselman,et al.  Multiple Comparison Procedures , 2005 .

[20]  Vikas Singh,et al.  Speeding up Permutation Testing in Neuroimaging , 2013, NIPS.

[21]  Emmanuel J. Candès,et al.  The Power of Convex Relaxation: Near-Optimal Matrix Completion , 2009, IEEE Transactions on Information Theory.

[22]  Bilwaj Gaonkar,et al.  Analytic estimation of statistical significance maps for support vector machine based multi-variate image analysis and classification , 2013, NeuroImage.

[23]  Karl J. Friston,et al.  A unified statistical approach for determining significant signals in images of cerebral activation , 1996, Human brain mapping.

[24]  Hans Knutsson,et al.  fMRI analysis on the GPU - Possibilities and challenges , 2012, Comput. Methods Programs Biomed..

[25]  Anders Eklund,et al.  Medical image processing on the GPU - Past, present and future , 2013, Medical Image Anal..

[26]  Eugene S. Edgington,et al.  Randomization Tests , 2011, International Encyclopedia of Statistical Science.

[27]  M. Newton,et al.  Multiple Hypothesis Testing by Clustering Treatment Effects , 2007 .

[28]  Marcel J. T. Reinders,et al.  Fewer permutations, more accurate P-values , 2009, Bioinform..

[29]  Pablo A. Parrilo,et al.  Rank-Sparsity Incoherence for Matrix Decomposition , 2009, SIAM J. Optim..

[30]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[31]  K Wienhard,et al.  Performance of a Randomization Test for Single-Subject 15O-Water PET Activation Studies , 1997, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[32]  Pablo A. Parrilo,et al.  Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization , 2007, SIAM Rev..

[33]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2008, Found. Comput. Math..

[34]  V. Marčenko,et al.  DISTRIBUTION OF EIGENVALUES FOR SOME SETS OF RANDOM MATRICES , 1967 .

[35]  Karl-Heinz Jöckel,et al.  Computational Aspects of Monte Carlo Tests , 1984 .

[36]  Thomas E. Nichols,et al.  Controlling the familywise error rate in functional neuroimaging: a comparative review , 2003, Statistical methods in medical research.

[37]  E. Edgington Approximate Randomization Tests , 1969 .

[38]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2009, Found. Comput. Math..

[39]  J. Li,et al.  Adjusting multiple testing in multilocus analyses using the eigenvalues of a correlation matrix , 2005, Heredity.

[40]  J. Cheverud,et al.  A simple correction for multiple comparisons in interval mapping genome scans , 2001, Heredity.

[41]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[42]  A. Edelman Eigenvalues and condition numbers of random matrices , 1988 .

[43]  R. Tibshirani,et al.  On testing the significance of sets of genes , 2006, math/0610667.

[44]  Raj Rao Nadakuditi,et al.  The eigenvalues and eigenvectors of finite, low rank perturbations of large random matrices , 2009, 0910.2120.