Confidence intervals for the mean and a percentile based on zero-inflated lognormal data

ABSTRACT The problems of estimating the mean and an upper percentile of a lognormal population with nonnegative values are considered. For estimating the mean of a such population based on data that include zeros, a simple confidence interval (CI) that is obtained by modifying Tian's [Inferences on the mean of zero-inflated lognormal data: the generalized variable approach. Stat Med. 2005;24:3223—3232] generalized CI, is proposed. A fiducial upper confidence limit (UCL) and a closed-form approximate UCL for an upper percentile are developed. Our simulation studies indicate that the proposed methods are very satisfactory in terms of coverage probability and precision, and better than existing methods for maintaining balanced tail error rates. The proposed CI and the UCL are simple and easy to calculate. All the methods considered are illustrated using samples of data involving airborne chlorine concentrations and data on diagnostic test costs.

[1]  J. Kesterson,et al.  Association of Symptoms of Depression with Diagnostic Test Charges among Older Adults , 1997, Annals of Internal Medicine.

[2]  A Donner,et al.  Construction of confidence limits about effect measures: A general approach , 2008, Statistics in medicine.

[3]  Bradley Efron,et al.  R. A. Fisher in the 21st century (Invited paper presented at the 1996 R. A. Fisher Lecture) , 1998 .

[4]  A. P. Dawid,et al.  The Functional-Model Basis of Fiducial Inference , 1982 .

[5]  E. S. Pearson,et al.  THE USE OF CONFIDENCE OR FIDUCIAL LIMITS ILLUSTRATED IN THE CASE OF THE BINOMIAL , 1934 .

[6]  K. Krishnamoorthy,et al.  One-Sided Tolerance Limits in Balanced and Unbalanced One-Way Random Models Based on Generalized Confidence Intervals , 2004, Technometrics.

[7]  K. Krishnamoorthy,et al.  Closed-form fiducial confidence intervals for some functions of independent binomial parameters with comparisons , 2017, Statistical methods in medical research.

[8]  J. Aitchison On the Distribution of a Positive Random Variable Having a Discrete Probability Mass at the Origin , 1955 .

[9]  Todd Iverson,et al.  Generalized fiducial inference , 2014 .

[10]  Thomas Mathew,et al.  Models and Confidence Intervals for True Values in Interlaboratory Trials , 2004 .

[11]  Kam-Wah Tsui,et al.  Generalized p-Values in Significance Testing of Hypotheses in the Presence of Nuisance Parameters , 1989 .

[12]  Guang Yong Zou,et al.  Confidence interval estimation for lognormal data with application to health economics , 2009, Comput. Stat. Data Anal..

[13]  T. Lai,et al.  Least Squares Estimates in Stochastic Regression Models with Applications to Identification and Control of Dynamic Systems , 1982 .

[14]  K. Krishnamoorthy,et al.  Inference for functions of parameters in discrete distributions based on fiducial approach: Binomial and Poisson cases , 2010 .

[15]  Samaradasa Weerahandi,et al.  Generalized Confidence Intervals , 1993 .

[16]  Thomas Mathew,et al.  Tests for individual and population bioequivalence based on generalized p‐values , 2003, Statistics in medicine.

[17]  Franklin A. Graybill,et al.  'Exact' Two-Sided Confidence Intervals on Nonnegative Linear Combinations of Variances. , 1980 .

[18]  Lili Tian Inferences on the mean of zero‐inflated lognormal data: the generalized variable approach , 2005, Statistics in medicine.

[19]  Thomas Mathew,et al.  Model-based imputation approach for data analysis in the presence of non-detects. , 2009, The Annals of occupational hygiene.

[20]  W. J. Owen,et al.  Estimation of the Mean for Lognormal Data Containing Zeroes and Left-Censored Values, with Applications to the Measure- ment of Worker Exposure to Air Contaminants , 1980 .

[21]  W. Stevens,et al.  Fiducial limits of the parameter of a discontinuous distribution. , 1950, Biometrika.

[22]  K. Krishnamoorthy,et al.  Inferences on correlation coefficients : One-sample, independent and correlated cases , 2007 .

[23]  Lili Tian,et al.  Interval estimation for the mean of lognormal data with excess zeros , 2013 .

[24]  X H Zhou,et al.  Confidence Intervals for the Mean of Diagnostic Test Charge Data Containing Zeros , 2000, Biometrics.

[25]  Cindy Y. Huo,et al.  Simple confidence intervals for lognormal means and their differences with environmental applications , 2009 .

[26]  T. Mathew,et al.  Inferences on the means of lognormal distributions using generalized p-values and generalized confidence intervals , 2003 .

[27]  L. Brown,et al.  Interval Estimation for a Binomial Proportion , 2001 .