Bayesian methods of analysis for cluster randomized trials with binary outcome data.

We explore the potential of Bayesian hierarchical modelling for the analysis of cluster randomized trials with binary outcome data, and apply the methods to a trial randomized by general practice. An approximate relationship is derived between the intracluster correlation coefficient (ICC) and the between-cluster variance used in a hierarchical logistic regression model. By constructing an informative prior for the ICC on the basis of available information, we are thus able implicitly to specify an informative prior for the between-cluster variance. The approach also provides us with a credible interval for the ICC for binary outcome data. Several approaches to constructing informative priors from empirical ICC values are described. We investigate the sensitivity of results to the prior specified and find that the estimate of intervention effect changes very little in this data set, while its interval estimate is more sensitive. The Bayesian approach allows us to assume distributions other than normality for the random effects used to model the clustering. This enables us to gain insight into the robustness of our parameter estimates to the classical normality assumption. In a model with a more complex variance structure, Bayesian methods can provide credible intervals for a difference between two variance components, in order for example to investigate whether the effect of intervention varies across clusters. We compare our results with those obtained from classical estimation, discuss the relative merits of the Bayesian framework, and conclude that the flexibility of the Bayesian approach offers some substantial advantages, although selection of prior distributions is not straightforward.

[1]  P. Diggle,et al.  Analysis of Longitudinal Data , 2003 .

[2]  S. Chinn,et al.  Components of variance and intraclass correlations for the design of community-based surveys and intervention studies: data from the Health Survey for England 1994. , 1999, American journal of epidemiology.

[3]  Paul Aveyard,et al.  Cluster randomised controlled trial of expert system based on the transtheoretical (“stages of change”) model for smoking prevention and cessation in schools , 1999, BMJ.

[4]  J. Sterne,et al.  Clustered randomised trial of an intervention to improve the management of asthma: Greenwich asthma study , 1999, BMJ.

[5]  T. Lewis,et al.  Outliers in multilevel data , 1998 .

[6]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[7]  P Marsh,et al.  Preventing injuries in children: cluster randomised controlled trial in primary care , 1999, BMJ.

[8]  J. Robson,et al.  Improving uptake of breast screening in multiethnic populations: a randomised controlled trial using practice reception staff to contact non-attenders , 1997, BMJ.

[9]  C. Roberts,et al.  Randomising groups of patients , 1998, BMJ.

[10]  T R Ten Have,et al.  A comparison of mixed effects logistic regression models for binary response data with two nested levels of clustering. , 1999, Statistics in medicine.

[11]  J K Lindsey,et al.  On the appropriateness of marginal models for repeated measurements in clinical trials. , 1998, Statistics in medicine.

[12]  S. Thompson Letter to the Editor: The merits of matching in community intervention trials: a cautionary tale by N. Klar and A. Donner, Statistics in Medicine, 16, 1753–1764 (1997) , 1998 .

[13]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[14]  David R. Jones,et al.  An introduction to bayesian methods in health technology assessment , 1999, BMJ.

[15]  J. Kalbfleisch,et al.  A Comparison of Cluster-Specific and Population-Averaged Approaches for Analyzing Correlated Binary Data , 1991 .

[16]  R. Tibshirani,et al.  An introduction to the bootstrap , 1993 .

[17]  D. Firth,et al.  Estimating Intraclass Correlation for Binary Data , 1999, Biometrics.

[18]  A Donner,et al.  Adjustments to the Mantel-Haenszel chi-square statistic and odds ratio variance estimator when the data are clustered. , 1987, Statistics in medicine.

[19]  Y. Ohashi,et al.  A Bayesian hierarchical survival model for the institutional effects in a multi-centre cancer clinical trial. , 1998, Statistics in medicine.

[20]  Elena Losina,et al.  An introduction to hierarchical linear modelling , 1999 .

[21]  Marie Davidian,et al.  The Nonlinear Mixed Effects Model with a Smooth Random Effects Density , 1993 .

[22]  A. Raftery,et al.  How Many Iterations in the Gibbs Sampler , 1991 .

[23]  R Z Omar,et al.  Analysis of a cluster randomized trial with binary outcome data using a multi-level model. , 2000, Statistics in medicine.

[24]  L. Wasserman,et al.  The Selection of Prior Distributions by Formal Rules , 1996 .

[25]  Murray Aitkin,et al.  A general maximum likelihood analysis of overdispersion in generalized linear models , 1996, Stat. Comput..

[26]  A Donner,et al.  Methods for comparing event rates in intervention studies when the unit of allocation is a cluster. , 1994, American journal of epidemiology.

[27]  R. Kass,et al.  Reference Bayesian Methods for Generalized Linear Mixed Models , 2000 .

[28]  R Z Omar,et al.  Analysing repeated measurements data: a practical comparison of methods. , 1999, Statistics in medicine.

[29]  D. A. Williams,et al.  Extra‐Binomial Variation in Logistic Linear Models , 1982 .

[30]  J. Kalbfleisch,et al.  The effects of mixture distribution misspecification when fitting mixed-effects logistic models , 1992 .

[31]  J. Atchison,et al.  Logistic-normal distributions:Some properties and uses , 1980 .

[32]  A. Agresti Distribution-free fitting of logit models with random effects for repeated categorical responses. , 1993, Statistics in medicine.

[33]  J. Sterne,et al.  Methods for evaluating area-wide and organisation-based interventions in health and health care: a systematic review. , 1999, Health technology assessment.

[34]  D. Commenges,et al.  The intraclass correlation coefficient: distribution-free definition and test. , 1994, Biometrics.

[35]  D. Spiegelhalter,et al.  Bayesian Analysis of Realistically Complex Models , 1996 .

[36]  S G Thompson,et al.  The design and analysis of paired cluster randomized trials: an application of meta-analysis techniques. , 1997, Statistics in medicine.

[37]  Harvey Goldstein,et al.  Improved Approximations for Multilevel Models with Binary Responses , 1996 .

[38]  Kelvyn Jones Review of HLM 4 for Windows , 1996 .