Online Control of the False Coverage Rate and False Sign Rate

The false coverage rate (FCR) is the expected ratio of number of constructed confidence intervals (CIs) that fail to cover their respective parameters to the total number of constructed CIs. Procedures for FCR control exist in the offline setting, but none so far have been designed with the online setting in mind. In the online setting, there is an infinite sequence of fixed unknown parameters $\theta_t$ ordered by time. At each step, we see independent data that is informative about $\theta_t$, and must immediately make a decision whether to report a CI for $\theta_t$ or not. If $\theta_t$ is selected for coverage, the task is to determine how to construct a CI for $\theta_t$ such that $\text{FCR} \leq \alpha$ for any $T\in \mathbb{N}$. A straightforward solution is to construct at each step a $(1-\alpha)$ level conditional CI. In this paper, we present a novel solution to the problem inspired by online false discovery rate (FDR) algorithms, which only requires the statistician to be able to construct a marginal CI at any given level. Apart from the fact that marginal CIs are usually simpler to construct than conditional ones, the marginal procedure has an important qualitative advantage over the conditional solution, namely, it allows selection to be determined by the candidate CI itself. We take advantage of this to offer solutions to some online problems which have not been addressed before. For example, we show that our general CI procedure can be used to devise online sign-classification procedures that control the false sign rate (FSR). In terms of power and length of the constructed CIs, we demonstrate that the two approaches have complementary strengths and weaknesses using simulations. Last, all of our methodology applies equally well to online FCR control for prediction intervals, having particular implications for assumption-free selective conformal inference.

[1]  R. Bohrer Multiple Three-Decision Rules for Parametric Signs , 1979 .

[2]  R Bohrer,et al.  An optimal multiple decision rule for signs of parameters. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Y. Hochberg Multiple classification rules for signs of parameters , 1986 .

[4]  J. Tukey The Philosophy of Multiple Comparisons , 1991 .

[5]  P. Stark,et al.  Confidence Intervals with More Power to Determine the Sign: Two Ends Constrain the Means , 1998 .

[6]  Francis Tuerlinckx,et al.  Type S error rates for classical and Bayesian single and multiple comparison procedures , 2000, Comput. Stat..

[7]  John D. Storey The positive false discovery rate: a Bayesian interpretation and the q-value , 2003 .

[8]  G. Shafer,et al.  Algorithmic Learning in a Random World , 2005 .

[9]  Y. Benjamini,et al.  False Discovery Rate–Adjusted Multiple Confidence Intervals for Selected Parameters , 2005 .

[10]  W. Gasarch,et al.  The Book Review Column 1 Coverage Untyped Systems Simple Types Recursive Types Higher-order Systems General Impression 3 Organization, and Contents of the Book , 2022 .

[11]  R. Prentice,et al.  Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies. , 2008, Biostatistics.

[12]  Dean P. Foster,et al.  α‐investing: a procedure for sequential control of expected false discoveries , 2008 .

[13]  Andrew Gelman,et al.  Why We (Usually) Don't Have to Worry About Multiple Comparisons , 2009, 0907.2478.

[14]  Vladimir Vovk,et al.  A tutorial on conformal prediction , 2007, J. Mach. Learn. Res..

[15]  J. Goeman,et al.  Three‐sided hypothesis testing: Simultaneous testing of superiority, equivalence and inferiority , 2010, Statistics in medicine.

[16]  Y. Benjamini,et al.  Selection Adjusted Confidence Intervals With More Power to Determine the Sign , 2013 .

[17]  Vladimir Vovk,et al.  Conditional validity of inductive conformal predictors , 2012, Machine Learning.

[18]  Larry Wasserman,et al.  Distribution‐free prediction bands for non‐parametric regression , 2014 .

[19]  S. Rosset,et al.  Generalized α‐investing: definitions, optimality results and application to public databases , 2014 .

[20]  D. Yekutieli,et al.  Selective Sign-Determining Multiple Confidence Intervals with FCR Control , 2014, Statistica Sinica.

[21]  Adel Javanmard,et al.  Online Rules for Control of False Discovery Rate and False Discovery Exceedance , 2016, ArXiv.

[22]  Matthew Stephens,et al.  False discovery rates: a new deal , 2016, bioRxiv.

[23]  Martin J. Wainwright,et al.  Online control of the false discovery rate with decaying memory , 2017, NIPS.

[24]  Aaditya Ramdas,et al.  Towards "simultaneous selective inference": post-hoc bounds on the false discovery proportion , 2018, 1803.06790.

[25]  Martin J. Wainwright,et al.  SAFFRON: an adaptive algorithm for online control of the false discovery rate , 2018, ICML.

[26]  Aaditya Ramdas,et al.  ADDIS: an adaptive discarding algorithm for online FDR control with conservative nulls , 2019, NeurIPS.

[27]  The limits of distribution-free conditional predictive inference , 2019, Information and Inference: A Journal of the IMA.

[28]  Michael I. Jordan,et al.  Asynchronous Online Testing of Multiple Hypotheses , 2018, J. Mach. Learn. Res..