Semiparametric Estimation of Treatment Effect in a Pretest-Posttest Study with Missing Data.

The pretest-posttest study is commonplace in numerous applications. Typically, subjects are randomized to two treatments, and response is measured at baseline, prior to intervention with the randomized treatment (pretest), and at prespecified follow-up time (posttest). Interest focuses on the effect of treatments on the change between mean baseline and follow-up response. Missing posttest response for some subjects is routine, and disregarding missing cases can lead to invalid inference. Despite the popularity of this design, a consensus on an appropriate analysis when no data are missing, let alone for taking into account missing follow-up, does not exist. Under a semiparametric perspective on the pretest-posttest model, in which limited distributional assumptions on pretest or posttest response are made, we show how the theory of Robins, Rotnitzky and Zhao may be used to characterize a class of consistent treatment effect estimators and to identify the efficient estimator in the class. We then describe how the theoretical results translate into practice. The development not only shows how a unified framework for inference in this setting emerges from the Robins, Rotnitzky and Zhao theory, but also provides a review and demonstration of the key aspects of this theory in a familiar context. The results are also relevant to the problem of comparing two treatment means with adjustment for baseline covariates.

[1]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[2]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[3]  F. Lord A paradox in the interpretation of group comparisons. , 1967, Psychological bulletin.

[4]  D. Luenberger Optimization by Vector Space Methods , 1968 .

[5]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[6]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[7]  Donna R. Brogan,et al.  Comparative Analyses of Pretest-Posttest Research Designs , 1980 .

[8]  D. DeMets,et al.  Fundamentals of Clinical Trials , 1982 .

[9]  Nan M. Laird,et al.  Further Comparative Analyses of Pretest-Posttest Research Designs , 1983 .

[10]  R. Tibshirani,et al.  Generalized additive models for medical research , 1986, Statistical methods in medical research.

[11]  M R Crager,et al.  Analysis of covariance in parallel-group clinical trials with pretreatment baselines. , 1987, Biometrics.

[12]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[13]  E. Stanek,et al.  Choosing a Pretest-Posttest Analysis , 1988 .

[14]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[15]  W. Newey,et al.  Semiparametric Efficiency Bounds , 1990 .

[16]  D A Follmann,et al.  The effect of screening on some pretest-posttest test variances. , 1991, Biometrics.

[17]  R. A. Stone The Assumptions on Which Causal Inferences Rest , 1993 .

[18]  J. Robins Correcting for non-compliance in randomized trials using structural nested mean models , 1994 .

[19]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[20]  Philip E. Cheng,et al.  Nonparametric Estimation of Mean Functionals with Data Missing at Random , 1994 .

[21]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[22]  S. Hammer,et al.  A trial comparing nucleoside monotherapy with combination therapy in HIV-infected adults with CD4 cell counts from 200 to 500 per cubic millimeter. AIDS Clinical Trials Group Study 175 Study Team. , 1996, The New England journal of medicine.

[23]  Julio M. Singer,et al.  Regression Models for the Analysis of Pretest/Posttest Data , 1997 .

[24]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[25]  G G Koch,et al.  Issues for covariance analysis of dichotomous and ordered categorical data from randomized clinical trials and non-parametric strategies for addressing them. , 1998, Statistics in medicine.

[26]  D. Rubin,et al.  Addressing complications of intention-to-treat analysis in the combined presence of all-or-none treatment-noncompliance and subsequent missing outcomes , 1999 .

[27]  J. Schafer,et al.  On the performance of multiple imputation for multivariate data with small sample size , 1999 .

[28]  S. van Buuren,et al.  Flexible mutlivariate imputation by MICE , 1999 .

[29]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[30]  John Van Hoewyk,et al.  A multivariate technique for multiply imputing missing values using a sequence of regression models , 2001 .

[31]  A. Tsiatis,et al.  Efficiency Study of Estimators for a Treatment Effect in a Pretest–Posttest Trial , 2001 .

[32]  J. Schafer,et al.  A comparison of inclusive and restrictive strategies in modern missing data procedures. , 2001, Psychological methods.

[33]  G. Molenberghs,et al.  Linear Mixed Models for Longitudinal Data , 2001 .

[34]  Jun Shao,et al.  Last observation carry‐forward and last observation analysis , 2003, Statistics in medicine.

[35]  R. Little,et al.  Robust Likelihood-based Analysis of Multivariate Data with Missing Values , 2003 .

[36]  Geert Molenberghs,et al.  Assessing Response Profiles from Incomplete Longitudinal Clinical Trial Data Under Regulatory Considerations , 2003, Journal of biopharmaceutical statistics.

[37]  James M. Robins,et al.  Unified Methods for Censored Longitudinal Data and Causality , 2003 .

[38]  M. Davidian,et al.  Semiparametric Estimation of Treatment Effect in a Pretest‐Posttest Study , 2003, Biometrics.

[39]  J. Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study , 2004, Statistics in medicine.

[40]  Russell V. Lenth,et al.  Statistical Analysis With Missing Data (2nd ed.) (Book) , 2004 .

[41]  Geert Molenberghs,et al.  Analyzing incomplete longitudinal clinical trial data. , 2004, Biostatistics.

[42]  D. Altman,et al.  Missing data , 2007, BMJ : British Medical Journal.