Estimating overall exposure effects for zero-inflated regression models with application to dental caries

Zero-inflated (ZI) models, which may be derived as a mixture involving a degenerate distribution at value zero and a distribution such as negative binomial (ZINB), have proved useful in dental and other areas of research by accommodating ‘extra’ zeroes in the data. Used in conjunction with generalised linear models, they allow covariate-adjusted inference of an exposure effect on the mixing probability and on the mean for the non-degenerate distribution. However, these models do not directly provide covariate-adjusted inference for the overall exposure effect. Focusing on the ZINB and ZI beta binomial models, we propose an approach that uses model-predicted values for each person under each exposure state. This ‘average predicted value’ method allows covariate-adjusted estimation of flexible functions of exposure group means such as the difference or ratio. A second approach considers a log link for both components of the ZINB to allow a direct approach to estimation. We apply these new methods to a study of dental caries in very low birth weight adolescents. Simulation studies show good bias and robustness properties for both approaches under various scenarios. Robustness diminishes when there is exposure group imbalance for a covariate with a large effect.

[1]  Frank C Curriero,et al.  Mixture models for quantitative HIV RNA data , 2002, Statistical methods in medical research.

[2]  G. Zou Assessment of risks by predicting counterfactuals , 2009, Statistics in medicine.

[3]  J. Hinde,et al.  A Score Test for Testing a Zero‐Inflated Poisson Regression Model Against Zero‐Inflated Negative Binomial Alternatives , 2001, Biometrics.

[4]  Emmanuel Lesaffre,et al.  The zero-inflated negative binomial regression model with correction for misclassification: an example in caries research , 2008, Statistical methods in medical research.

[5]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[6]  J. Mullahy Specification and testing of some modified count data models , 1986 .

[7]  Diane Lambert,et al.  Zero-inflacted Poisson regression, with an application to defects in manufacturing , 1992 .

[8]  Alan Agresti,et al.  Random effect models for repeated measures of zero-inflated count data , 2005 .

[9]  M. Gail,et al.  Biased estimates of treatment effect in randomized experiments with nonlinear regressions and omitted covariates , 1984 .

[10]  W W Hauck,et al.  Should we adjust for covariates in nonlinear regression analyses of randomized trials? , 1998, Controlled clinical trials.

[11]  Sander Greenland,et al.  Model-based estimation of relative risks and other epidemiologic measures in studies of common outcomes and in case-control studies. , 2004, American journal of epidemiology.

[12]  Y. Cheung,et al.  Zero‐inflated models for regression analysis of count data: a study of growth and development , 2002, Statistics in medicine.

[13]  K. Wannemuehler,et al.  On the Use of Zero-Inflated and Hurdle Models for Modeling Vaccine Adverse Event Count Data , 2006, Journal of biopharmaceutical statistics.

[14]  Liming Xiang,et al.  A Score Test for Overdispersion in Zero-inflated Poisson Mixed Regression Model , 2022 .

[15]  H. Kirchner,et al.  Dental Caries and Enamel Defects in Very Low Birth Weight Adolescents , 2010, Caries Research.

[16]  Irene Lena Hudson,et al.  Finite Mixture, Zero-inflated Poisson and Hurdle models with application to SIDS , 2003, Comput. Stat. Data Anal..

[17]  Zhao Yang,et al.  Score tests for overdispersion in zero-inflated Poisson mixed models , 2010, Comput. Stat. Data Anal..

[18]  D. Hall Zero‐Inflated Poisson and Binomial Regression with Random Effects: A Case Study , 2000, Biometrics.

[19]  L. Lilien,et al.  A longitudinal study of developmental outcome of infants with bronchopulmonary dysplasia and very low birth weight. , 1997, Pediatrics.

[20]  Hongqi Xue,et al.  Semiparametric Analysis of Zero‐Inflated Count Data , 2006, Biometrics.

[21]  Peter C Austin,et al.  Absolute risk reductions, relative risks, relative risk reductions, and numbers needed to treat can be obtained from a logistic regression model. , 2010, Journal of clinical epidemiology.

[22]  Andy H. Lee,et al.  Zero‐inflated Poisson regression with random effects to evaluate an occupational injury prevention programme , 2001, Statistics in medicine.

[23]  Andy H. Lee,et al.  Multi-level zero-inflated Poisson regression modelling of correlated count data with excess zeros , 2006, Statistical methods in medical research.

[24]  J. Lewsey,et al.  The utility of the zero-inflated Poisson and zero-inflated negative binomial models: a case study of cross-sectional and longitudinal DMF data examining the effect of socio-economic status. , 2004, Community dentistry and oral epidemiology.

[25]  John Hinde,et al.  Score tests for zero-inflated Poisson models , 2002 .

[26]  Dankmar Böhning,et al.  The zero‐inflated Poisson model and the decayed, missing and filled teeth index in dental epidemiology , 1999 .

[27]  Monica Chiogna,et al.  Semiparametric zero-inflated Poisson models with application to animal abundance studies , 2007 .

[28]  J. Robins,et al.  Estimating causal effects from epidemiological data , 2006, Journal of Epidemiology and Community Health.

[29]  R. Bender,et al.  Estimating adjusted NNT measures in logistic regression analysis , 2007, Statistics in medicine.

[30]  Y. Cheung Growth and cognitive function of Indonesian children: zero‐inflated proportion models , 2006, Statistics in medicine.