MULTIPLE TESTING VIA FDRL FOR LARGE SCALE IMAGING DATA

The multiple testing procedure plays an important role in detecting the presence of spatial signals for large scale imaging data. Typically, the spatial signals are sparse but clustered. This paper provides empirical evidence that for a range of commonly used control levels, the conventional FDR procedure can lack the ability to detect statistical significance, even if the p-values under the true null hypotheses are independent and uniformly distributed; more generally, ignoring the neighboring information of spatially structured data will tend to diminish the detection effectiveness of the FDR procedure. This paper first introduces a scalar quantity to characterize the extent to which the "lack of identification phenomenon" (LIP) of the FDR procedure occurs. Second, we propose a new multiple comparison procedure, called FDR(L), to accommodate the spatial information of neighboring p-values, via a local aggregation of p-values. Theoretical properties of the FDR(L) procedure are investigated under weak dependence of p-values. It is shown that the FDR(L) procedure alleviates the LIP of the FDR procedure, thus substantially facilitating the selection of more stringent control levels. Simulation evaluations indicate that the FDR(L) procedure improves the detection sensitivity of the FDR procedure with little loss in detection specificity. The computational simplicity and detection effectiveness of the FDR(L) procedure are illustrated through a real brain fMRI dataset.

[1]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[2]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[3]  R W Cox,et al.  AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. , 1996, Computers and biomedical research, an international journal.

[4]  J. Marron,et al.  Edge-Preserving Smoothers for Image Processing , 1998 .

[5]  A. V. D. Vaart Asymptotic Statistics: Delta Method , 1998 .

[6]  Alan C. Evans,et al.  A general statistical analysis for fMRI data , 2000, NeuroImage.

[7]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[9]  Stephen M. Smith,et al.  Temporal Autocorrelation in Univariate Linear Modeling of FMRI Data , 2001, NeuroImage.

[10]  D. Le Bihan,et al.  Diffusion tensor imaging: Concepts and applications , 2001, Journal of magnetic resonance imaging : JMRI.

[11]  John D. Storey A direct approach to false discovery rates , 2002 .

[12]  L. Wasserman,et al.  Operating characteristics and extensions of the false discovery rate procedure , 2002 .

[13]  Thomas E. Nichols,et al.  Controlling the familywise error rate in functional neuroimaging: a comparative review , 2003, Statistical methods in medical research.

[14]  Joseph P. Romano,et al.  Generalizations of the familywise error rate , 2005, math/0507420.

[15]  S. Dudoit,et al.  Multiple Hypothesis Testing in Microarray Experiments , 2003 .

[16]  Mark W. Woolrich,et al.  Advances in functional and structural MR image analysis and implementation as FSL , 2004, NeuroImage.

[17]  John D. Storey,et al.  Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach , 2004 .

[18]  L. Wasserman,et al.  A stochastic process approach to false discovery control , 2004, math/0406519.

[19]  E. L. Lehmann,et al.  On optimality of stepdown and stepup multiple test procedures , 2005 .

[20]  A. Owen Variance of the number of false discoveries , 2005 .

[21]  S. Sarkar False discovery and false nondiscovery rates in single-step multiple testing procedures , 2006, math/0605607.

[22]  Jianqing Fan,et al.  To How Many Simultaneous Hypothesis Tests Can Normal, Student's t or Bootstrap Calibration Be Applied? , 2006, math/0701003.

[23]  L. Wasserman,et al.  False discovery control with p-value weighting , 2006 .

[24]  Y. Benjamini,et al.  False Discovery Rates for Spatial Signals , 2007 .

[25]  W. Wu,et al.  On false discovery control under dependence , 2008, 0803.1971.

[26]  Jeffrey T Leek,et al.  A general framework for multiple testing dependence , 2008, Proceedings of the National Academy of Sciences.

[27]  M. Newton Large-Scale Simultaneous Hypothesis Testing: The Choice of a Null Hypothesis , 2008 .

[28]  Tao Yu,et al.  Semiparametric detection of significant activation for brain fMRI , 2008, 0808.0989.