Sample size and power calculations based on generalized linear mixed models with correlated binary outcomes

The generalized linear mixed model (GLIMMIX) provides a powerful technique to model correlated outcomes with different types of distributions. The model can now be easily implemented with SAS PROC GLIMMIX in version 9.1. For binary outcomes, linearization methods of penalized quasi-likelihood (PQL) or marginal quasi-likelihood (MQL) provide relatively accurate variance estimates for fixed effects. Using GLIMMIX based on these linearization methods, we derived formulas for power and sample size calculations for longitudinal designs with attrition over time. We found that the power and sample size estimates depend on the within-subject correlation and the size of random effects. In this article, we present tables of minimum sample sizes commonly used to test hypotheses for longitudinal studies. A simulation study was used to compare the results. We also provide a Web link to the SAS macro that we developed to compute power and sample sizes for correlated binary outcomes.

[1]  G. Molenberghs,et al.  Models for Discrete Longitudinal Data , 2005 .

[2]  Roel Bosker,et al.  Standard Errors and Sample Sizes for Two-Level Research , 1993 .

[3]  K Y Liang,et al.  Sample size calculations for studies with correlated observations. , 1997, Biometrics.

[4]  Victor M. Gastañaga,et al.  Power calculations for generalized linear models in observational longitudinal studies: A simulation approach in SAS , 2006, Comput. Methods Programs Biomed..

[5]  D. Hedeker,et al.  Sample Size Estimation for Longitudinal Designs with Attrition: Comparing Time-Related Contrasts Between Two Groups , 1999 .

[6]  J Rochon,et al.  Application of GEE procedures for sample size calculations in repeated measures experiments. , 1997, Statistics in medicine.

[7]  W Pan,et al.  Sample size and power calculations with correlated binary data. , 2001, Controlled clinical trials.

[8]  Noreen Goldman,et al.  An assessment of estimation procedures for multilevel models with binary responses , 1995 .

[9]  R. Moineddin,et al.  A simulation study of sample size for multilevel logistic regression models , 2007, BMC medical research methodology.

[10]  James Rochon,et al.  Application of GEE procedures for sample size calculations in repeated measures experiments , 1998 .

[11]  W. Shih,et al.  Sample Size and Power Calculations for Periodontal and Other Studies with Clustered Samples Using the Method of Generalized Estimating Equations , 1997 .

[12]  R. Wolfinger,et al.  Generalized linear mixed models a pseudo-likelihood approach , 1993 .