Multiple Hypothesis Tests Controlling Generalized Error Rates for Sequential Data

The $\gamma$-FDP and $k$-FWER multiple testing error metrics, which are tail probabilities of the respective error statistics, have become popular recently as less-stringent alternatives to the FDR and FWER. We propose general and flexible stepup and stepdown procedures for testing multiple hypotheses about sequential (or streaming) data that simultaneously control both the type I and II versions of $\gamma$-FDP, or $k$-FWER. The error control holds regardless of the dependence between data streams, which may be of arbitrary size and shape. All that is needed is a test statistic for each data stream that controls the conventional type I and II error probabilities, and no information or assumptions are required about the joint distribution of the statistics or data streams. The procedures can be used with sequential, group sequential, truncated, or other sampling schemes. We give recommendations for the procedures' implementation including closed-form expressions for the needed critical values in some commonly-encountered testing situations. The proposed sequential procedures are compared with each other and with comparable fixed sample size procedures in the context of strongly positively correlated Gaussian data streams. For this setting we conclude that both the stepup and stepdown sequential procedures provide substantial savings over the fixed sample procedures in terms of expected sample size, and the stepup procedure performs slightly but consistently better than the stepdown for $\gamma$-FDP control, with the relationship reversed for $k$-FWER control.

[1]  T. Lai,et al.  Statistical Models and Methods for Financial Markets , 2008 .

[2]  P. Visscher,et al.  Estimating missing heritability for disease from genome-wide association studies. , 2011, American journal of human genetics.

[3]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[4]  Wenge Guo,et al.  Further results on controlling the false discovery proportion , 2014, 1406.0266.

[5]  Michael Baron,et al.  Sequential Bonferroni Methods for Multiple Hypothesis Testing with Strong Control of Family-Wise Error Rates I and II , 2012 .

[6]  D. Siegmund Sequential Analysis: Tests and Confidence Intervals , 1985 .

[7]  J. Salzman,et al.  Statistical properties of an early stopping rule for resampling-based multiple testing. , 2012, Biometrika.

[8]  Shyamal K. De,et al.  Step-up and step-down methods for testing multiple hypotheses in sequential experiments , 2012 .

[9]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[10]  Azeem M. Shaikh,et al.  On stepdown control of the false discovery proportion , 2006 .

[11]  Kenneth Rice,et al.  FDR and Bayesian Multiple Comparisons Rules , 2006 .

[12]  H. Chernoff Sequential Analysis and Optimal Design , 1987 .

[13]  Mandy Berg,et al.  Introduction To Statistical Theory , 2016 .

[14]  Y. Mei Efficient scalable schemes for monitoring a large number of data streams , 2010 .

[15]  B. Turnbull,et al.  Group Sequential Methods with Applications to Clinical Trials , 1999 .

[16]  Sanat K. Sarkar,et al.  Generalizing Simes' test and Hochberg's stepup procedure , 2008, 0803.1961.

[17]  Jay Bartroff,et al.  Sequential Experimentation in Clinical Trials: Design and Analysis , 2012 .

[18]  Joseph P. Romano,et al.  Stepup procedures for control of generalizations of the familywise error rate , 2006, math/0611266.

[19]  X. Rong Li,et al.  Sequential detection of targets in multichannel systems , 2003, IEEE Trans. Inf. Theory.

[20]  Jay Bartroff,et al.  Multistage Tests of Multiple Hypotheses , 2010, 1107.1919.

[21]  David H. Baillie,et al.  Multivariate Acceptance Sampling—Some Applications to Defence Procurement , 1987 .

[22]  Nicolle Clements,et al.  APPLYING MULTIPLE TESTING PROCEDURES TO DETECT CHANGE IN EAST AFRICAN VEGETATION , 2014, 1405.0785.

[23]  J. Bartroff,et al.  Sequential Tests of Multiple Hypotheses Controlling Type I and II Familywise Error Rates. , 2013, Journal of statistical planning and inference.

[24]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[25]  Johanna S. Hardin,et al.  A method for generating realistic correlation matrices , 2011, 1106.5834.

[26]  J. Bartroff,et al.  Sequential tests of multiple hypotheses controlling false discovery and nondiscovery rates , 2013, Sequential analysis.

[27]  S. Sarkar STEPUP PROCEDURES CONTROLLING GENERALIZED FWER AND GENERALIZED FDR , 2007, 0803.2934.

[28]  Joseph P. Romano,et al.  Generalizations of the familywise error rate , 2005, math/0507420.

[29]  Hui Jiang,et al.  Statistical Modeling of RNA-Seq Data. , 2011, Statistical science : a review journal of the Institute of Mathematical Statistics.