Testing overdispersion in the zero-inflated Poisson model

Abstract The zero-inflated negative binomial (ZINB) model is used to account for commonly occurring overdispersion detected in data that are initially analyzed under the zero-inflated Poisson (ZIP) model. Tests for overdispersion (Wald test, likelihood ratio test [LRT], and score test) based on ZINB model for use in ZIP regression models have been developed. Due to similarity to the ZINB model, we consider the zero-inflated generalized Poisson (ZIGP) model as an alternate model for overdispersed zero-inflated count data. The score test has an advantage over the LRT and the Wald test in that the score test only requires that the parameter of interest be estimated under the null hypothesis. This paper proposes score tests for overdispersion based on the ZIGP model and illustrates that the derived score statistics are exactly the same as the score statistics under the ZINB model. A simulation study indicates the proposed score statistics are preferred to other tests for higher empirical power. In practice, based on the approximate mean–variance relationship in the data, the ZINB or ZIGP model can be considered, and a formal score test based on asymptotic standard normal distribution can be employed for assessing overdispersion in the ZIP model. We provide an example to illustrate the procedures for data analysis.

[1]  M. Jhun,et al.  Bootstrap Tests for Overdispersion in a Zero‐Inflated Poisson Regression Model , 2005, Biometrics.

[2]  Diane Lambert,et al.  Zero-inflacted Poisson regression, with an application to defects in manufacturing , 1992 .

[3]  C. Dean Testing for Overdispersion in Poisson and Binomial Regression Models , 1992 .

[4]  D. Hall Zero‐Inflated Poisson and Binomial Regression with Random Effects: A Case Study , 2000, Biometrics.

[5]  Steven Stern,et al.  Semiparametric estimation of count regression models , 1999 .

[6]  Franklin A. Graybill,et al.  Theory and Application of the Linear Model , 1976 .

[7]  Dennis D. Boos,et al.  The IOS Test for Model Misspecification , 2004 .

[8]  D. Stram,et al.  Variance components testing in the longitudinal mixed effects model. , 1994, Biometrics.

[9]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .

[10]  F. Famoye,et al.  Modeling household fertility decisions with generalized Poisson regression , 1997, Journal of population economics.

[11]  F. Famoye,et al.  Generalized poisson regression model , 1992 .

[12]  P. Consul,et al.  A Generalization of the Poisson Distribution , 1973 .

[13]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[14]  Gérard Letac,et al.  Natural Real Exponential Families with Cubic Variance Functions , 1990 .

[15]  Ram C. Tripathi,et al.  Score Test for Zero Inflated Generalized Poisson Regression Model , 2005 .

[16]  Harry Joe,et al.  Generalized Poisson Distribution: the Property of Mixture of Poisson and Comparison with Negative Binomial Distribution , 2005, Biometrical journal. Biometrische Zeitschrift.

[17]  H. Tuenter On the generalized Poisson distribution , 2000, math/0606238.

[18]  James W Hardin,et al.  Testing Approaches for Overdispersion in Poisson Regression versus the Generalized Poisson Model , 2007, Biometrical journal. Biometrische Zeitschrift.

[19]  A. W. Kemp,et al.  Generalized Poisson Distributions: Properties and Applications. , 1992 .

[20]  J. Lawless,et al.  Tests for Detecting Overdispersion in Poisson Regression Models , 1989 .

[21]  Felix Famoye,et al.  On the Generalized Poisson Regression Model with an Application to Accident Data , 2004, Journal of Data Science.

[22]  A. Cameron,et al.  Econometric models based on count data. Comparisons and applications of some estimators and tests , 1986 .

[23]  Alan E. Gelfand,et al.  Zero-inflated models with application to spatial count data , 2002, Environmental and Ecological Statistics.

[24]  Pedro Puig,et al.  Count Data Distributions , 2006 .

[25]  Jye-Chyi Lu,et al.  Multivariate zero-inflated Poisson models and their applications , 1999 .

[26]  P. Puig Characterizing Additively Closed Discrete Models by a Property of Their Maximum Likelihood Estimators, With an Application to Generalized Hermite Distributions , 2003 .

[27]  X H Zhou,et al.  Confidence Intervals for the Mean of Diagnostic Test Charge Data Containing Zeros , 2000, Biometrics.

[28]  Dankmar Böhning,et al.  The zero‐inflated Poisson model and the decayed, missing and filled teeth index in dental epidemiology , 1999 .

[29]  J. A. Marín,et al.  Micropropagation of columnar apple trees , 1993 .

[30]  J. Hardin,et al.  A score test for overdispersion in Poisson regression based on the generalized Poisson-2 model , 2009 .

[31]  K. Liang,et al.  Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions , 1987 .

[32]  L. Gold Generalized poisson distributions , 1957 .

[33]  J. Hinde,et al.  A Score Test for Testing a Zero‐Inflated Poisson Regression Model Against Zero‐Inflated Negative Binomial Alternatives , 2001, Biometrics.

[34]  J. Lawless Negative binomial and mixed Poisson regression , 1987 .

[35]  Felix Famoye,et al.  Zero-Inflated Generalized Poisson Regression Model with an Application to Domestic Violence Data , 2021, Journal of Data Science.