High-confidence nonparametric fixed-width uncertainty intervals and applications to projected high-dimensional data and common mean estimation

Abstract Nonparametric two-stage procedures to construct fixed-width confidence intervals are studied to quantify uncertainty. It is shown that the validity of the random central limit theorem (RCLT) accompanied by a consistent and asymptotically unbiased estimator of the asymptotic variance already guarantees consistency and first-order as well as second-order efficiency of the two-stage procedures. This holds under the common asymptotics where the length of the confidence interval tends toward 0 as well as under the novel proposed high-confidence asymptotics where the confidence level tends toward 1. The approach is motivated by and applicable to data analysis from distributed big data with nonnegligible costs of data queries. The following problems are discussed: Fixed-width intervals for the mean, for a projection when observing high-dimensional data, and for the common mean when using nonlinear common mean estimators under order constraints. The procedures are investigated by simulations and illustrated by a real data analysis.

[1]  J. Shao,et al.  A General Theory for Jackknife Variance Estimation , 1989 .

[2]  Jun Shao,et al.  Differentiability of Statistical Functionals and Consistency of the Jackknife , 1993 .

[3]  R. Durrett Probability: Theory and Examples , 1993 .

[4]  A. Steland Vertically Weighted Averages in Hilbert Spaces and Applications to Imaging: Fixed-Sample Asymptotics and Efficient Sequential Two-Stage Estimation , 2015 .

[5]  N. Mukhopadhyay,et al.  A consistent and asymptotically efficient two-stage procedure to construct fixed width confidence intervals for the mean , 1980 .

[6]  H. Robbins,et al.  ON THE ASYMPTOTIC THEORY OF FIXED-WIDTH SEQUENTIAL CONFIDENCE INTERVALS FOR THE MEAN. , 1965 .

[7]  A. Gut The weak law of large numbers for arrays , 1992 .

[8]  I. Ibragimov,et al.  On Sequential Estimation , 1975 .

[9]  Nitis Mukhopadhyay,et al.  Sequential Methods and Their Applications , 2002 .

[10]  N. Mukhopadhyay,et al.  On a Two-Stage Procedure Having Second-Order Properties with Applications , 1999 .

[11]  K. Nair An estimator of the common mean of two normal populations , 1982 .

[12]  C. Stein A Two-Sample Test for a Linear Hypothesis Whose Power is Independent of the Variance , 1945 .

[13]  Makoto Aoshima,et al.  Two-Stage Procedures for High-Dimensional Data , 2011 .

[14]  F. Graybill,et al.  Combining Unbiased Estimators , 1959 .

[15]  A. Steland,et al.  Jackknife variance estimation for general two-sample statistics and applications to common mean estimators under ordered variances , 2019, Japanese Journal of Statistics and Data Science.

[16]  Yuan-Tsung Chang,et al.  Improved estimators for the common mean and ordered means of two normal distributions with ordered variances , 2012 .

[17]  Nobuo Shinozaki,et al.  Estimation of two ordered normal means under modified Pitman nearness criterion , 2015 .

[18]  A. Steland On the accuracy of fixed sample and fixed width confidence intervals based on the vertically weighted average , 2016, 1607.05184.