A GEE approach to determine sample size for pre- and post-intervention experiments with dropout

Pre- and post-intervention experiments are widely used in medical and social behavioral studies, where each subject is supposed to contribute a pair of observations. In this paper we investigate sample size requirement for a scenario frequently encountered by practitioners: All enrolled subjects participate in the pre-intervention phase of study, but some of them will drop out due to various reasons, thus resulting in missing values in the post-intervention measurements. Traditional sample size calculation based on the McNemar's test could not accommodate missing data. Through the GEE approach, we derive a closed-form sample size formula that properly accounts for the impact of partial observations. We demonstrate that when there is no missing data, the proposed sample size estimate under the GEE approach is very close to that under the McNemar's test. When there is missing data, the proposed method can lead to substantial saving in sample size. Simulation studies and an example are presented.

[1]  M. Dignan,et al.  An Increase in HPV-Related Knowledge and Vaccination Intent Among Parental and Non-parental Caregivers of Adolescent Girls, Age 9–17 Years, in Appalachian Pennsylvania , 2011, Journal of Cancer Education.

[2]  Sin-Ho Jung,et al.  Sample size for a two‐group comparison of repeated binary measurements using GEE , 2005, Statistics in medicine.

[3]  H. Kunins,et al.  Hospitalizations and Skilled Nursing Facility Admissions Before and After the Implementation of a Home‐Based Primary Care Program , 2010, Journal of the American Geriatrics Society.

[4]  C. Angelini,et al.  Retrospective study on PET–SPECT imaging in a large cohort of myotonic dystrophy type 1 patients , 2010, Neurological Sciences.

[5]  P. Oster,et al.  Improving care for patients with dementia hospitalized for acute somatic illness in a specialized care unit: a feasibility study , 2009, International Psychogeriatrics.

[6]  J M Lachin,et al.  Power and sample size evaluation for the McNemar test with application to matched case-control studies. , 1992, Statistics in medicine.

[7]  Giacomo Vespasiani,et al.  An interactive diary for diet management (DAI): a new telemedicine system able to promote body weight reduction, nutritional education, and consumption of fresh local produce. , 2010, Diabetes technology & therapeutics.

[8]  V. Skaggs,et al.  The effect of digital cervical examination on group B streptococcal culture. , 2010, American journal of obstetrics and gynecology.

[9]  S. Ennett,et al.  Analysis of prevention program effectiveness with clustered data using generalized estimating equations. , 1996, Journal of consulting and clinical psychology.

[10]  O S Miettinen,et al.  The matched pairs design in the case of all-or-none responses. , 1968, Biometrics.

[11]  J A Bean,et al.  On the sample size for one-sided equivalence of sensitivities based upon McNemar's test. , 1995, Statistics in medicine.

[12]  M. Gonen Sample size and power for McNemar's test with clustered data. , 2004 .

[13]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[14]  W. G. Cochran The comparison of percentages in matched samples. , 1950, Biometrika.

[15]  APPROXIMATING POWER OF THE UNCONDITIONAL TEST FOR CORRELATED BINARY PAIRS. , 1998, Communications in statistics: Simulation and computation.

[16]  D. Altman,et al.  Estimating sample sizes for continuous, binary, and ordinal outcomes in paired comparisons: practical hints. , 1999, Journal of biopharmaceutical statistics.

[17]  M. Lawson,et al.  Prevalence of nocturnal hypoglycemia in pediatric type 1 diabetes: a pilot study using continuous glucose monitoring. , 2011, The Journal of pediatrics.

[18]  Q. Mcnemar Note on the sampling error of the difference between correlated proportions or percentages , 1947, Psychometrika.

[19]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[20]  P Royston,et al.  Exact conditional and unconditional sample size for pair-matched studies with binary outcome: a practical guide. , 1993, Statistics in medicine.

[21]  James Rochon,et al.  Application of GEE procedures for sample size calculations in repeated measures experiments , 1998 .

[22]  K Y Liang,et al.  Sample size calculations for studies with correlated observations. , 1997, Biometrics.

[23]  R. Connor Sample size for testing differences in proportions for the paired-sample design. , 1987, Biometrics.

[24]  Number of observations required for the comparison of two correlated proportions , 1980 .