A maximum likelihood approach to power calculations for stepped wedge designs of binary outcomes.

In stepped wedge designs (SWD), clusters are randomized to the time period during which new patients will receive the intervention under study in a sequential rollout over time. By the study's end, patients at all clusters receive the intervention, eliminating ethical concerns related to withholding potentially efficacious treatments. This is a practical option in many large-scale public health implementation settings. Little statistical theory for these designs exists for binary outcomes. To address this, we utilized a maximum likelihood approach and developed numerical methods to determine the asymptotic power of the SWD for binary outcomes. We studied how the power of a SWD for detecting risk differences varies as a function of the number of clusters, cluster size, the baseline risk, the intervention effect, the intra-cluster correlation coefficient, and the time effect. We studied the robustness of power to the assumed form of the distribution of the cluster random effects, as well as how power is affected by variable cluster size. % SWD power is sensitive to neither, in contrast to the parallel cluster randomized design which is highly sensitive to variable cluster size. We also found that the approximate weighted least square approach of Hussey and Hughes (2007, Design and analysis of stepped wedge cluster randomized trials. Contemporary Clinical Trials 28, 182-191) for binary outcomes under-estimates the power in some regions of the parameter spaces, and over-estimates it in others. The new method was applied to the design of a large-scale intervention program on post-partum intra-uterine device insertion services for preventing unintended pregnancy in the first 1.5 years following childbirth in Tanzania, where it was found that the previously available method under-estimated the power.

[1]  Steven Teerenstra,et al.  Sample size calculation for stepped wedge and other longitudinal cluster randomised trials , 2016, Statistics in medicine.

[2]  M. Taljaard,et al.  Systematic review finds major deficiencies in sample size methodology and reporting for stepped-wedge cluster randomised trials , 2016, BMJ Open.

[3]  Xiaomei Liao,et al.  "Cross-sectional" stepped wedge designs always reduce the required sample size when there is no time effect. , 2017, Journal of clinical epidemiology.

[4]  R. Lilford,et al.  Bmc Medical Research Methodology Open Access the Stepped Wedge Trial Design: a Systematic Review , 2022 .

[5]  Richard J. Hayes,et al.  Cluster randomised trials , 2009 .

[6]  P. Heagerty,et al.  Misspecified maximum likelihood estimates and generalised linear mixed models , 2001 .

[7]  J. Hughes,et al.  Design and analysis of stepped wedge cluster randomized trials. , 2007, Contemporary clinical trials.

[8]  Math J J M Candel,et al.  Repairing the efficiency loss due to varying cluster sizes in two‐level two‐armed randomized trials with heterogeneous clustering , 2016, Statistics in medicine.

[9]  Gerard J P van Breukelen,et al.  Relative efficiency of unequal versus equal cluster sizes in cluster randomized and multicentre trials , 2007, Statistics in medicine.

[10]  R J Lilford,et al.  The stepped wedge cluster randomised trial: rationale, design, analysis, and reporting , 2015, BMJ : British Medical Journal.

[11]  D. Spiegelman,et al.  Institutionalizing postpartum intrauterine device (IUD) services in Sri Lanka, Tanzania, and Nepal: study protocol for a cluster-randomized stepped-wedge trial , 2016, BMC Pregnancy and Childbirth.

[12]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[13]  P. Heagerty,et al.  Current issues in the design and analysis of stepped wedge trials. , 2015, Contemporary clinical trials.

[14]  G. Molenberghs,et al.  Type I and Type II Error Under Random‐Effects Misspecification in Generalized Linear Mixed Models , 2007, Biometrics.

[15]  M. Taljaard,et al.  Sample size calculations for stepped wedge and cluster randomised trials: a unified approach , 2016, Journal of clinical epidemiology.

[16]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .