Performance of a Mixed Effects Logistic Regression Model for Binary Outcomes With Unequal Cluster Size

ABSTRACT When a clustered randomized controlled trial is considered at a design stage of a clinical trial, it is useful to consider the consequences of unequal cluster size (i.e., sample size per cluster). Furthermore, the assumption of independence of observations within cluster does not hold, of course, because the subjects share the same cluster. Moreover, when the clustered outcomes are binary, a mixed effect logistic regression model is applicable. This article compares the performance of a maximum likelihood estimation of the mixed effects logistic regression model with equal and unequal cluster sizes. This was evaluated in terms of type I error rate, power, bias, and standard error through computer simulations that varied treatment effect, number of clusters, and intracluster correlation coefficients. The results show that the performance of the mixed effects logistic regression model is very similar, regardless of inequality in cluster size. This is illustrated using data from the Prevention Of Suicide in Primary care Elderly: Collaborative Trial (PROSPECT) study.

[1]  O. Ukoumunne A comparison of confidence interval methods for the intraclass correlation coefficient in cluster randomized trials , 2002, Statistics in medicine.

[2]  D. Hedeker,et al.  A random-effects ordinal regression model for multilevel analysis. , 1994, Biometrics.

[3]  Moonseong Heo,et al.  Comparison of statistical methods for analysis of clustered binary observations , 2005, Statistics in medicine.

[4]  D. Bates,et al.  Approximations to the Log-Likelihood Function in the Nonlinear Mixed-Effects Model , 1995 .

[5]  E. Crouch,et al.  The Evaluation of Integrals of the form ∫+∞ −∞ f(t)exp(−t 2) dt: Application to Logistic-Normal Models , 1990 .

[6]  Geert Molenberghs,et al.  Linear Mixed Models in Practice , 1997 .

[7]  Donald Hedeker,et al.  Application of random-efiects pattern-mixture models for miss-ing data in longitudinal studies , 1997 .

[8]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[9]  M. Heo,et al.  Re-engineering systems for the treatment of depression in primary care: cluster randomised controlled trial , 2004, BMJ : British Medical Journal.

[10]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[11]  N M Laird,et al.  Missing data in longitudinal studies. , 1988, Statistics in medicine.

[12]  Gene H. Golub,et al.  Calculation of Gauss quadrature rules , 1967, Milestones in Matrix Computation.

[13]  A. Leon Sample-Size Requirements for Comparisons of Two Groups on Repeated Observations of a Binary Outcome , 2004, Evaluation & the health professions.

[14]  Thomas R Ten Have,et al.  Reducing suicidal ideation and depressive symptoms in depressed older primary care patients: a randomized controlled trial. , 2004, JAMA.

[15]  Scott L. Zeger,et al.  Generalized linear models with random e ects: a Gibbs sampling approach , 1991 .

[16]  D. Hedeker,et al.  MIXOR: a computer program for mixed-effects ordinal regression analysis. , 1996, Computer methods and programs in biomedicine.

[17]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[18]  S G Thompson,et al.  Analysis of cluster randomized trials with repeated cross-sectional binary measurements. , 2001, Statistics in medicine.

[19]  Richard Birtwhistle,et al.  Pragmatic controlled clinical trials in primary care: the struggle between external and internal validity. , 2003, BMC medical research methodology.

[20]  G Piaggio,et al.  Methodological considerations on the design and analysis of an equivalence stratified cluster randomization trial. , 2001, Statistics in medicine.

[21]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[22]  P. Diggle Analysis of Longitudinal Data , 1995 .

[23]  M. Kenward,et al.  Informative dropout in longitudinal data analysis (with discussion) , 1994 .

[24]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[25]  M. Hamilton A RATING SCALE FOR DEPRESSION , 1960, Journal of neurology, neurosurgery, and psychiatry.

[26]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[27]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[28]  G. E. Gray Evidence-Based Medicine: An Introduction for Psychiatrists , 2002, Journal of Psychiatric Practice.

[29]  A. Davison,et al.  Non‐parametric bootstrap confidence intervals for the intraclass correlation coefficient , 2003, Statistics in medicine.

[30]  Allan Donner,et al.  Some aspects of the design and analysis of cluster randomization trials , 2002 .

[31]  Deborah Ashby,et al.  Lessons for cluster randomized trials in the twenty-first century: a systematic review of trials in primary care , 2004, Clinical trials.