Confidence intervals for a population proportion based on a ranked set sample

Abstract This article examines several approximate methods to formulate confidence intervals for a single population proportion based on a ranked set sample (RSS). All of the intervals correspond to certain test statistics. That is, the confidence intervals are obtained by inverting the Wald, Wilson, score, and likelihood ratio tests. The Wald and Wilson intervals are based on the asymptotic distributions of two point estimators; the method of moments (MM) estimator and the maximum likelihood (ML) estimator. Continuity corrected versions of these intervals are also discussed. The R statistical software program is used to both calculate and evaluate the proposed intervals. For instance, an actual data set is analyzed for the sake of illustration. Furthermore, a simulation study which compares the intervals via expected widths and coverage probabilities is presented. The study indicates that the confidence intervals derived from the ML methodology generally outperform those based on MM procedures. Additionally, the Wilson and score intervals do not yield the same results under RSS as they do under simple random sampling. Lastly, the ML-based Wilson interval (without continuity correction) is recommended for use in practice.

[1]  Charles Taillie,et al.  5 Ranked set sampling , 1994, Environmental Statistics.

[2]  G. P. Patil,et al.  Ranked Set Sampling: a Bibliography , 2004, Environmental and Ecological Statistics.

[3]  G. P. Patil,et al.  Ranked set sampling: an annotated bibliography , 1995, Environmental and Ecological Statistics.

[4]  Douglas A. Wolfe,et al.  Unbalanced Ranked Set Sampling for Estimating A Population Proportion Under Imperfect Rankings , 2009 .

[5]  G. McIntyre,et al.  A method for unbiased selective sampling, using ranked sets , 1952 .

[6]  Ganapati P. Patil,et al.  Ranked set sampling , 2006 .

[7]  Paul H. Kvam,et al.  Ranked set sampling based on binary water quality data with covariates , 2003 .

[8]  J. T. Terpstra On Estimating a Population Proportion via Ranked Set Sampling , 2004 .

[9]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[10]  E. B. Wilson Probable Inference, the Law of Succession, and Statistical Inference , 1927 .

[11]  Haiying Chen,et al.  Ranked set sampling for efficient estimation of a population proportion. , 2005, Statistics in medicine.

[12]  R. Newcombe Two-sided confidence intervals for the single proportion: comparison of seven methods. , 1998, Statistics in medicine.

[13]  Robert V. Hogg,et al.  Introduction to Mathematical Statistics. , 1966 .

[14]  Roger W. Johnson Fitting Percentage of Body Fat to Simple Body Measurements: College Women , 1996, Journal of Statistics and Data Science Education.

[15]  Lora L. Bohn A review of nonparametric ranked-set sampling methodology , 1996 .

[16]  Omer Ozturk,et al.  Ranked Set Sampling: Theory and Applications , 2005 .

[17]  Ralph A. Bradley,et al.  The asymptotic properties of ML estimators when sampling from associated populations , 1962 .

[18]  Jeff Terpstra,et al.  Optimal rank set sampling estimates for a population proportion , 2005 .

[19]  Douglas A Wolfe,et al.  Unbalanced ranked set sampling for estimating a population proportion. , 2006, Biometrics.

[20]  A. Wald Tests of statistical hypotheses concerning several parameters when the number of observations is large , 1943 .

[21]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[22]  Z. Bai,et al.  Ranked set sampling , 2004 .

[23]  Jeff T Terpstra,et al.  Concomitant-based rank set sampling proportion estimates. , 2004, Statistics in medicine.