Modeling Clustered Count Data with Excess Zeros in Health Care Outcomes Research

In health research, count outcomes are fairly common and often these counts have a large number of zeros. In order to adjust for these extra zero counts, various modifications of the Poisson regression model have been proposed. Lambert (Lambert, D., Technometrics 34, 1–14, 1992) described a zero-inflated Poisson (ZIP) model that is based on a mixture of a binary distribution (πi) degenerated at zero with a Poisson distribution (λi). Depending on the relationship between πi and λi, she described two variants: a ZIP and a ZIP (τ) model. In this paper, we extend these models for the case of clustered data (e.g., patients observed within hospitals) and describe random-effects ZIP and ZIP (τ) models. These models are appropriate for the analysis of clustered extra-zero Poisson count data. The distribution of the random effects is assumed to be normal and a maximum marginal likelihood estimation method is used to estimate the model parameters. We applied these models to data from patients who underwent colon operations from 123 Veterans Affairs Medical Centers in the National VA Surgical Quality Improvement Program.

[1]  H. Preisler Analysis of a Toxicological Experiment Using a Generalized Linear Model with , 1989 .

[2]  Larry Lee,et al.  Mean and Variance of Partially-Truncated Distributions , 1980 .

[3]  David C. Heilbron,et al.  Zero-Altered and other Regression Models for Count Data with Added Zeros , 1994 .

[4]  J. S. Long,et al.  Regression Models for Categorical and Limited Dependent Variables , 1997 .

[5]  J. A. Calvin Regression Models for Categorical and Limited Dependent Variables , 1998 .

[6]  W. Greene,et al.  Accounting for Excess Zeros and Sample Selection in Poisson and Negative Binomial Regression Models , 1994 .

[7]  Bernard Rosner,et al.  Multivariate Methods for Clustered Binary Data with More than One Level of Nesting , 1989 .

[8]  A. Cohen,et al.  Estimation of the Poisson Parameter from Truncated Samples and from Censored Samples , 1954 .

[9]  D. Hedeker,et al.  Application of random-effects probit regression models. , 1994, Journal of consulting and clinical psychology.

[10]  Dorothy D. Dunlop,et al.  Regression for Longitudinal Data: A Bridge from Least Squares Regression , 1994 .

[11]  Anthony S. Bryk,et al.  Hierarchical Linear Models: Applications and Data Analysis Methods , 1992 .

[12]  A. Stroud,et al.  Gaussian quadrature formulas , 1966 .

[13]  N. Breslow Extra‐Poisson Variation in Log‐Linear Models , 1984 .

[14]  Andy H. Lee,et al.  Zero‐inflated Poisson regression with random effects to evaluate an occupational injury prevention programme , 2001, Statistics in medicine.

[15]  Jim Albert,et al.  A Bayesian Analysis of a Poisson Random Effects Model for Home Run Hitters , 1992 .

[16]  J. Nelder,et al.  Double hierarchical generalized linear models (with discussion) , 2006 .

[17]  K. Hammermeister,et al.  Risk adjustment of the postoperative morbidity rate for the comparative assessment of the quality of surgical care: results of the National Veterans Affairs Surgical Risk Study. , 1998, Journal of the American College of Surgeons.

[18]  N. L. Johnson,et al.  Distributions in Statistics: Discrete Distributions. , 1970 .

[19]  D. Hedeker,et al.  Random effects probit and logistic regression models for three-level data. , 1997, Biometrics.

[20]  D. Hedeker,et al.  A random-effects ordinal regression model for multilevel analysis. , 1994, Biometrics.

[21]  J. Lawless Negative binomial and mixed Poisson regression , 1987 .

[22]  R. Prentice,et al.  Correlated binary regression with covariates specific to each binary observation. , 1988, Biometrics.

[23]  Nicholas P. Jewell,et al.  Some Comments on Rosner's Multiple Logistic Model for Clustered Data , 1990 .

[24]  J. R. Landis,et al.  Population-averaged and cluster-specific models for clustered ordinal response data. , 1996, Statistics in medicine.

[25]  J. Mullahy Specification and testing of some modified count data models , 1986 .

[26]  Pushpa L. Gupta,et al.  Analysis of zero-adjusted count data , 1996 .

[27]  R. Darrell Bock,et al.  Multilevel analysis of educational data , 1989 .

[28]  F B Hu,et al.  Random-effects regression analysis of correlated grouped-time survival data , 2000, Statistical methods in medical research.

[29]  J. Kalbfleisch,et al.  A Comparison of Cluster-Specific and Population-Averaged Approaches for Analyzing Correlated Binary Data , 1991 .

[30]  Diane Lambert,et al.  Zero-inflacted Poisson regression, with an application to defects in manufacturing , 1992 .

[31]  B. McNeil,et al.  Using admission characteristics to predict short-term mortality from myocardial infarction in elderly patients. Results from the Cooperative Cardiovascular Project. , 1996, JAMA.

[32]  D. Hall Zero‐Inflated Poisson and Binomial Regression with Random Effects: A Case Study , 2000, Biometrics.

[33]  F. Grover,et al.  The Department of Veterans Affairs' NSQIP: the first national, validated, outcome-based, risk-adjusted, and peer-controlled program for the measurement and enhancement of the quality of surgical care. National VA Surgical Quality Improvement Program. , 1998, Annals of surgery.

[34]  P F Thall,et al.  Mixed Poisson likelihood regression models for longitudinal interval count data. , 1988, Biometrics.

[35]  Bertram M. Gross,et al.  Event Count Models for International Relations: Generalizations and Applications , 2005 .

[36]  F. Grover,et al.  Risk adjustment of the postoperative mortality rate for the comparative assessment of the quality of surgical care: results of the National Veterans Affairs Surgical Risk Study. , 1997, Journal of the American College of Surgeons.

[37]  A. Cameron,et al.  Econometric models based on count data. Comparisons and applications of some estimators and tests , 1986 .

[38]  Q. Vuong Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses , 1989 .

[39]  Maria Blettner,et al.  Missing Data in Epidemiologic Studies , 2005 .

[40]  Bronwyn H Hall,et al.  Estimation and Inference in Nonlinear Structural Models , 1974 .

[41]  J. Nelder,et al.  Hierarchical Generalized Linear Models , 1996 .

[42]  Roel Bosker,et al.  Multilevel analysis : an introduction to basic and advanced multilevel modeling , 1999 .