THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY

Benjamini and Hochberg suggest that the false discovery rate may be the appropriate error rate to control in many applied multiple testing problems. A simple procedure was given there as an FDR controlling procedure for independent test statistics and was shown to be much more powerful than comparable procedures which control the traditional familywise error rate. We prove that this same procedure also controls the false discovery rate when the test statistics have positive regression dependency on each of the test statistics corresponding to the true null hypotheses. This condition for positive dependency is general enough to cover many problems of practical interest, including the comparisons of many treatments with a single control, multivariate normal test statistics with positive correlation matrix and multivariate t. Furthermore, the test statistics may be discrete, and the tested hypotheses composite without posing special difficulties. For all other forms of dependency, a simple conservative modification of the procedure controls the false discovery rate. Thus the range of problems for which a procedure with proven FDR control can be offered is greatly increased. 1.1. Simultaneous hypotheses testing. The control of the increased type I error when testing simultaneously a family of hypotheses is a central issue in the area of multiple comparisons. Rarely are we interested only in whether all hypotheses are jointly true or not, which is the test of the intersection null hypothesis. In most applications, we infer about the individual hypotheses, realizing that some of the tested hypotheses are usually true—we hope not all—and some are not. We wish to decide which ones are not true, indicating (statistical) discoveries. An important such problem is that of multiple endpoints in a clinical trial: a new treatment is compared with an existing one in terms of a large number of potential benefits (endpoints).

[1]  E. Lehmann Some Concepts of Dependence , 1966 .

[2]  P. Seeger A Note on a Method for the Analysis of Significances en masse , 1968 .

[3]  A. Leviton,et al.  Deficits in psychologic and classroom performance of children with elevated dentine lead levels. , 1979, The New England journal of medicine.

[4]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[5]  James H. Torrie,et al.  Principles and procedures of statistics: a biometrical approach (2nd ed) , 1980 .

[6]  S. Karlin,et al.  Classes of orderings of measures and related correlation inequalities. I. Multivariate totally positive distributions , 1980 .

[7]  Samuel Karlin,et al.  Total Positivity Properties of Absolute Value Multinormal Variables with Applications to Confidence Interval Estimates and Related Probabilistic Inequalities , 1981 .

[8]  P. Rosenbaum Testing the conditional independence and monotonicity assumptions of item response theory , 1984 .

[9]  P. Rosenbaum,et al.  Conditional Association and Unidimensionality in Monotone Latent Variable Models , 1985 .

[10]  R. Simes,et al.  An improved Bonferroni procedure for multiple tests of significance , 1986 .

[11]  Morris L. Eaton,et al.  Lectures on topics in probability inequalities , 1988 .

[12]  Y. Hochberg A sharper Bonferroni procedure for multiple tests of significance , 1988 .

[13]  G. Hommel A stagewise rejective multiple test procedure based on a modified Bonferroni test , 1988 .

[14]  E. Lander,et al.  Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. , 1989, Genetics.

[15]  Gerhard Hommel,et al.  Multiple Hypotheses Testing , 1993 .

[16]  T. Powles,et al.  Double-blind controlled trial of oral clodronate in patients with bone metastases from breast cancer. , 1993, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[17]  J. Booth,et al.  Resampling-Based Multiple Testing. , 1994 .

[18]  M. Barinaga From fruit flies, rats, mice: evidence of genetic influence. , 1994, Science.

[19]  E. Lander,et al.  Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results , 1995, Nature Genetics.

[20]  Yosef Hochberg,et al.  Extensions of multiple testing procedures based on Simes' test , 1995 .

[21]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[22]  Y. Benjamini,et al.  Adaptive thresholding of wavelet coefficients , 1996 .

[23]  S. Sarkar,et al.  The Simes Method for Multiple Hypothesis Testing with Positively Dependent Test Statistics , 1997 .

[24]  Y. Benjamini,et al.  Multiple Hypotheses Testing with Weights , 1997 .

[25]  J. Weller,et al.  A new approach to the problem of multiple comparisons in the genetic dissection of complex traits. , 1998, Genetics.

[26]  S. Sarkar Some probability inequalities for ordered $\rm MTP\sb 2$ random variables: a proof of the Simes conjecture , 1998 .

[27]  Y. Benjamini,et al.  A step-down multiple hypotheses testing procedure that controls the false discovery rate under independence , 1999 .

[28]  Meinhard Kieser,et al.  Procedures for testing multiple endpoints in clinical trials: An overview , 1999 .

[29]  Y. Benjamini,et al.  Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics , 1999 .

[30]  P. Sen Some remarks on Simes-type multiple tests of significance , 1999 .

[31]  P. Sen Multiple comparisons in interim analysis , 1999 .

[32]  Charles W. Dunnett,et al.  Stepwise multiple test procedures with biometric applications , 1999 .

[33]  Y. Benjamini,et al.  On the Adaptive Control of the False Discovery Rate in Multiple Testing With Independent Statistics , 2000 .

[34]  J. Troendle,et al.  Stepwise normal theory multiple test procedures controlling the false discovery rate , 2000 .