Pattern‐Mixture Zero‐Inflated Mixed Models for Longitudinal Unbalanced Count Data with Excessive Zeros

Analysis of longitudinal data with excessive zeros has gained increasing attention in recent years; however, current approaches to the analysis of longitudinal data with excessive zeros have primarily focused on balanced data. Dropouts are common in longitudinal studies; therefore, the analysis of the resulting unbalanced data is complicated by the missing mechanism. Our study is motivated by the analysis of longitudinal skin cancer count data presented by Greenberg, Baron, Stukel, Stevens, Mandel, Spencer, Elias, Lowe, Nierenberg, Bayrd, Vance, Freeman, Clendenning, Kwan, and the Skin Cancer Prevention Study Group[New England Journal of Medicine 323, 789-795]. The data consist of a large number of zero responses (83% of the observations) as well as a substantial amount of dropout (about 52% of the observations). To account for both excessive zeros and dropout patterns, we propose a pattern-mixture zero-inflated model with compound Poisson random effects for the unbalanced longitudinal skin cancer data. We also incorporate an autoregressive of order 1 correlation structure in the model to capture longitudinal correlation of the count responses. A quasi-likelihood approach has been developed in the estimation of our model. We illustrated the method with analysis of the longitudinal skin cancer data.

[1]  Jay M. Ver Hoef,et al.  Space—time zero‐inflated count models of Harbor seals , 2007 .

[2]  Roderick J. A. Little,et al.  A Class of Pattern-Mixture Models for Normal Incomplete Data , 1994 .

[3]  M D Schluchter,et al.  Methods for the analysis of informatively censored longitudinal data. , 1992, Statistics in medicine.

[4]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[5]  T. Stukel,et al.  A clinical trial of beta carotene to prevent basal-cell and squamous-cell cancers of the skin. The Skin Cancer Prevention Study Group. , 1990, The New England journal of medicine.

[6]  Donald Hedeker,et al.  Application of random-efiects pattern-mixture models for miss-ing data in longitudinal studies , 1997 .

[7]  Chris J. Skinner,et al.  The Muscatine children's obesity data reanalysed using pattern mixture models , 2008 .

[8]  James M. Robins,et al.  Semiparametric Regression for Repeated Outcomes With Nonignorable Nonresponse , 1998 .

[9]  Alan Agresti,et al.  Random effect models for repeated measures of zero-inflated count data , 2005 .

[10]  V. De Gruttola,et al.  Modelling progression of CD4-lymphocyte count and its relationship to survival time. , 1994, Biometrics.

[11]  Andy H. Lee,et al.  Zero‐inflated Poisson regression with random effects to evaluate an occupational injury prevention programme , 2001, Statistics in medicine.

[12]  G Molenberghs,et al.  Parametric models for incomplete continuous and categorical longitudinal data , 1999, Statistical methods in medical research.

[13]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[14]  Diane Lambert,et al.  Zero-inflacted Poisson regression, with an application to defects in manufacturing , 1992 .

[15]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[16]  B. Jørgensen Exponential Dispersion Models , 1987 .

[17]  N M Laird,et al.  Mixture models for the joint distribution of repeated measures and event times. , 1997, Statistics in medicine.

[18]  S G Baker,et al.  Marginal regression for repeated binary data with outcome subject to non-ignorable non-response. , 1995, Biometrics.

[19]  D. Hall Zero‐Inflated Poisson and Binomial Regression with Random Effects: A Case Study , 2000, Biometrics.

[20]  Daniel Krewski,et al.  Random effects Cox models: A Poisson modelling approach , 2003 .

[21]  Alan Agresti,et al.  Modeling Nonnegative Data with Clumping at Zero: A Survey , 2002 .

[22]  Hongqi Xue,et al.  Semiparametric Analysis of Zero‐Inflated Count Data , 2006, Biometrics.

[23]  N M Laird,et al.  Model-based approaches to analysing incomplete longitudinal and failure time data. , 1997, Statistics in medicine.

[24]  Shou-En Lu,et al.  Analyzing Excessive No Changes in Clinical Trials with Clustered Data , 2004, Biometrics.

[25]  M. Kenward,et al.  The analysis of longitudinal ordinal data with nonrandom drop-out , 1997 .

[26]  Raymond J. Carroll,et al.  Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process , 1988 .

[27]  R. Ma,et al.  Modelling heterogeneity in clustered count data with extra zeros using compound Poisson random effect , 2009, Statistics in medicine.

[28]  Gary Sneddon,et al.  Zero-Inflated Poisson Regression for Longitudinal Data , 2009, Commun. Stat. Simul. Comput..

[29]  Wensheng Guo,et al.  A Random Pattern-Mixture Model for Longitudinal Data With Dropouts , 2004 .

[30]  B. Jørgensen,et al.  Nested generalized linear mixed models: an orthodox best linear unbiased predictor approach , 2007 .

[31]  On Generalized Quasilikelihood Inference in Longitudinal Mixed Model for Count Data , 2007 .

[32]  A marginalized pattern-mixture model for longitudinal binary data when nonresponse depends on unobserved responses. , 2007, Biostatistics.

[33]  Gordon K. Smyth,et al.  Evaluation of Tweedie exponential dispersion model densities by Fourier inversion , 2008, Stat. Comput..

[34]  N M Laird,et al.  Generalized linear mixture models for handling nonignorable dropouts in longitudinal studies. , 2000, Biostatistics.