Marginal correlation from an extended random-effects model for repeated and overdispersed counts

Vangeneugden et al. [15] derived approximate correlation functions for longitudinal sequences of general data type, Gaussian and non-Gaussian, based on generalized linear mixed-effects models (GLMM). Their focus was on binary sequences, as well as on a combination of binary and Gaussian sequences. Here, we focus on the specific case of repeated count data, important in two respects. First, we employ the model proposed by Molenberghs et al. [13], which generalizes at the same time the Poisson-normal GLMM and the conventional overdispersion models, in particular the negative-binomial model. The model flexibly accommodates data hierarchies, intra-sequence correlation, and overdispersion. Second, means, variances, and joint probabilities can be expressed in closed form, allowing for exact intra-sequence correlation expressions. Next to the general situation, some important special cases such as exchangeable clustered outcomes are considered, producing insightful expressions. The closed-form expressions are contrasted with the generic approximate expressions of Vangeneugden et al. [15]. Data from an epileptic-seizures trial are analyzed and correlation functions derived. It is shown that the proposed extension strongly outperforms the classical GLMM.

[1]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[2]  Geert Molenberghs,et al.  Applying Concepts of Generalizability Theory on Clinical Trial Data to Investigate Sources of Variation and Their Impact on Reliability , 2005, Biometrics.

[3]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[4]  Ana Ivelisse Avilés,et al.  Linear Mixed Models for Longitudinal Data , 2001, Technometrics.

[5]  G. Pledger,et al.  Topiramate placebo-controlled dose-ranging trial in refractory partial epilepsy using 200-, 400-, and 600-mg daily dosages , 1996, Neurology.

[6]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[7]  Andrea Rotnitzky,et al.  Estimation of regression models for the mean of repeated outcomes under nonignorable nonmonotone nonresponse. , 2007, Biometrika.

[8]  J. Lawless Negative binomial and mixed Poisson regression , 1987 .

[9]  L. Hunt,et al.  Missing Data in Clinical Studies , 2007 .

[10]  G. Molenberghs,et al.  An extended random-effects approach to modeling repeated, overdispersed count data , 2007, Lifetime data analysis.

[11]  John Hinde,et al.  Overdispersion: models and estimation , 1998 .

[12]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[13]  Geert Molenberghs,et al.  A combined overdispersed and marginalized multilevel model , 2012, Comput. Stat. Data Anal..

[14]  G. Pledger,et al.  Topiramate placebo-controlled dose-ranging trial in refractory partial epilepsy using 600-, 800-, and 1,000-mg daily dosages , 1996, Neurology.

[15]  N. Breslow Extra‐Poisson Variation in Log‐Linear Models , 1984 .

[16]  Geert Molenberghs,et al.  Marginal Correlation in Longitudinal Binary Data Based on Generalized Linear Mixed Models , 2010 .

[17]  G. Molenberghs,et al.  Models for Discrete Longitudinal Data , 2005 .

[18]  Herwig Friedl,et al.  Negative binomial loglinear mixed models , 2003 .

[19]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.