Simultaneous Confidence Regions Corresponding to Holm's Step‐Down Procedure and Other Closed‐Testing Procedures

Holm's (1979) step-down multiple-testing procedure (MTP) is appealing for its flexibility, transparency, and general validity, but the derivation of corresponding simultaneous confidence regions has remained an unsolved problem. This article provides such confidence regions. In fact, simultanenous confidence regions are provided for any MTP in the class of short-cut consonant closed-testing procedures based on marginal p -values and weighted Bonferroni tests for intersection hypotheses considered by Hommel, Bretz and Maurer (2007). In addition to Holm's MTP, this class includes the fixed-sequence MTP, recently proposed gatekeeping MTPs, and the fallback MTP. The simultaneous confidence regions are generally valid if underlying marginal p -values and corresponding marginal confidence regions (assumed to be available) are valid. The marginal confidence regions and estimated quantities are not assumed to be of any particular kinds/dimensions. Compared to the rejections made by the MTP for the family of null hypotheses H under consideration, the proposed confidence regions provide extra free information. In particular, with Holm's MTP, such extra information is provided: for all nonrejected H s, in case not all H s are rejected; or for certain (possibly all) H s, in case all H s are rejected. In case not all H s are rejected, no extra information is provided for rejected H s. This drawback seems however difficult to overcome. Illustrations concerning clinical studies are given.

[1]  J. Hsu Multiple Comparisons: Theory and Methods , 1996 .

[2]  Iris Pigeot-Kübler Multiple Comparisons Theory and Methods: Hsu, Jason C. (1996): London: Chapman & Hall, pp. 277, ISBN 0-412-98281-1, [pound sign] 35.00 , 1997 .

[3]  Ľ.,et al.  Bioequivalence Trials , Intersection-Union Tests , andEquivalence Con dence , 2006 .

[4]  Jason C. Hsu,et al.  On Confidence Sets in Multiple Comparisons , 1988 .

[5]  E. Lehmann,et al.  Nonparametrics: Statistical Methods Based on Ranks , 1976 .

[6]  Anthony J. Hayter,et al.  On the Relationship between Stepwise Decision Procedures and Confidence Sets , 1994 .

[7]  A Agresti,et al.  On Small‐Sample Confidence Intervals for Parameters in Discrete Distributions , 2001, Biometrics.

[8]  Joseph P. Romano,et al.  Generalizations of the familywise error rate , 2005, math/0507420.

[9]  G. Hommel,et al.  Powerful short‐cuts for multiple testing procedures with special reference to gatekeeping strategies , 2007, Statistics in medicine.

[10]  A. Tamhane,et al.  Multiple Comparison Procedures , 2009 .

[11]  S. S. Young,et al.  Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment , 1993 .

[12]  Martin Posch,et al.  Multiple Testing for Identifying Effective and Safe Treatments , 2001 .

[13]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[14]  Eve Bofinger Expanded confidence intervals, one-sided tests, and equivalence testing. , 1992, Journal of biopharmaceutical statistics.

[15]  H. Toutenburg,et al.  Lehmann, E. L., Nonparametrics: Statistical Methods Based on Ranks, San Francisco. Holden‐Day, Inc., 1975. 480 S., $ 22.95 . , 1977 .

[16]  K. Strassburger,et al.  The partitioning principle: a powerful tool in multiple decision theory , 2002 .

[17]  A. Tamhane,et al.  Multiple Comparison Procedures , 1989 .

[18]  Christoph Gerlinger,et al.  On Testing Simultaneously Non‐inferiority in Two Multiple Primary Endpoints and Superiority in at Least One of Them , 2006, Biometrical journal. Biometrische Zeitschrift.

[19]  J. Aitchison Confidence‐Region Tests , 1964 .

[20]  Yogendra P. Chaubey Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment , 1993 .

[21]  Eve Bofinger Step Down Procedures for Comparison With a Control , 1987 .

[22]  R. Berger,et al.  Bioequivalence trials, intersection-union tests and equivalence confidence sets , 1996 .

[23]  L. Hothorn,et al.  Testing strategies in multi-dose experiments including active control. , 1998, Statistics in medicine.