The p‐filter: multilayer false discovery rate control for grouped hypotheses

In many practical applications of multiple testing, there are natural ways to partition the hypotheses into groups by using the structural, spatial or temporal relatedness of the hypotheses, and this prior knowledge is not used in the classical Benjamini–Hochberg procedure for controlling the false discovery rate (FDR). When one can define (possibly several) such partitions, it may be desirable to control the group FDR simultaneously for all partitions (as special cases, the ‘finest’ partition divides the n hypotheses into n groups of one hypothesis each, and this corresponds to controlling the usual notion of FDR, whereas the ‘coarsest’ partition puts all n hypotheses into a single group, and this corresponds to testing the global null hypothesis). We introduce the p‐filter, which takes as input a list of n p‐values and M⩾1 partitions of hypotheses, and produces as output a list of n or fewer discoveries such that the group FDR is provably simultaneously controlled for all partitions. Importantly, since the partitions are arbitrary, our procedure can also handle multiple partitions which are non‐hierarchical. The p‐filter generalizes two classical procedures—when M=1, choosing the finest partition into n singletons, we exactly recover the Benjamini–Hochberg procedure, whereas, choosing instead the coarsest partition with a single group of size n, we exactly recover the Simes test for the global null hypothesis. We verify our findings with simulations that show how this technique can not only lead to the aforementioned multilayer FDR control but also lead to improved precision of rejected hypotheses. We present some illustrative results from an application to a neuroscience problem with functional magnetic resonance imaging data, where hypotheses are explicitly grouped according to predefined regions of interest in the brain, thus allowing the scientist to employ field‐specific prior knowledge explicitly and flexibly.

[1]  R. Simes,et al.  An improved Bonferroni procedure for multiple tests of significance , 1986 .

[2]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[3]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[4]  Thomas E. Nichols,et al.  Thresholding of Statistical Maps in Functional Neuroimaging Using the False Discovery Rate , 2002, NeuroImage.

[5]  N. Meinshausen Hierarchical testing of variable importance , 2008 .

[6]  D. Yekutieli Hierarchical False Discovery Rate–Controlling Methodology , 2008 .

[7]  William A. Cunningham,et al.  Type I and Type II error concerns in fMRI research: re-balancing the scale. , 2009, Social cognitive and affective neuroscience.

[8]  William A. Cunningham,et al.  Tools of the Trade Type I and Type II error concerns in fMRI research : rebalancing the scale , 2009 .

[9]  Harrison H. Zhou,et al.  False Discovery Rate Control With Groups , 2010, Journal of the American Statistical Association.

[10]  Brian Murphy,et al.  Simultaneously Uncovering the Patterns of Brain Regions Involved in Different Story Reading Subprocesses , 2014, PloS one.

[11]  Yoav Benjamini,et al.  Selective inference on multiple families of hypotheses , 2014 .

[12]  Jack L. Gallant,et al.  Pycortex: an interactive surface visualizer for fMRI , 2015, Front. Neuroinform..

[13]  Wenguang Sun,et al.  False discovery control in large‐scale spatial multiple testing , 2015, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[14]  Aaditya Ramdas,et al.  REGULARIZED BRAIN READING WITH SHRINKAGE AND SMOOTHING. , 2014, The annals of applied statistics.

[15]  Christine B. Peterson,et al.  Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies , 2015, Genetic epidemiology.