A multiple imputation strategy for sequential multiple assignment randomized trials

Sequential multiple assignment randomized trials (SMARTs) are increasingly being used to inform clinical and intervention science. In a SMART, each patient is repeatedly randomized over time. Each randomization occurs at a critical decision point in the treatment course. These critical decision points often correspond to milestones in the disease process or other changes in a patient's health status. Thus, the timing and number of randomizations may vary across patients and depend on evolving patient-specific information. This presents unique challenges when analyzing data from a SMART in the presence of missing data. This paper presents the first comprehensive discussion of missing data issues typical of SMART studies: we describe five specific challenges and propose a flexible imputation strategy to facilitate valid statistical estimation and inference using incomplete data from a SMART. To illustrate these contributions, we consider data from the Clinical Antipsychotic Trial of Intervention and Effectiveness, one of the most well-known SMARTs to date.

[1]  David S. Siscovick,et al.  A multiple-imputation analysis of a case-control study of the risk of primary cardiac arrest among pharmacologicallytreated hypertensives , 1996 .

[2]  Jürgen Unützer,et al.  A comparison of imputation methods in a longitudinal randomized clinical trial , 2005, Statistics in medicine.

[3]  S. Kay,et al.  The positive and negative syndrome scale (PANSS) for schizophrenia. , 1987, Schizophrenia bulletin.

[4]  John M. Davis,et al.  Head-to-head comparisons of metabolic side effects of second generation antipsychotics in the treatment of schizophrenia: A systematic review and meta-analysis , 2010, Schizophrenia Research.

[5]  James R Carpenter,et al.  Sensitivity analysis after multiple imputation under missing at random: a weighting approach , 2007, Statistical methods in medical research.

[6]  A. Zaslavsky,et al.  Multiple imputation in a large-scale complex survey: a practical guide , 2010, Statistical methods in medical research.

[7]  J. Lieberman,et al.  Effectiveness of olanzapine, quetiapine, risperidone, and ziprasidone in patients with chronic schizophrenia following discontinuation of a previous atypical antipsychotic. , 2006, The American journal of psychiatry.

[8]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[9]  S. Murphy,et al.  Experimental design and primary data analysis methods for comparing adaptive interventions. , 2012, Psychological methods.

[10]  C. Adams Schizophrenia trials: past, present and future , 2002, Epidemiologia e Psichiatria Sociale.

[11]  D. Kupfer,et al.  Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study. , 2003, The Psychiatric clinics of North America.

[12]  D. Kupfer,et al.  Strengthening clinical effectiveness trials: Equipoise-stratified randomization , 2001, Biological Psychiatry.

[13]  Geert Verbeke,et al.  Multiple Imputation for Model Checking: Completed‐Data Plots with Missing and Latent Data , 2005, Biometrics.

[14]  J. Schafer,et al.  Computational Strategies for Multivariate Linear Mixed-Effects Models With Missing Values , 2002 .

[15]  B. Arnold,et al.  Conditionally Specified Distributions: An Introduction (with comments and a rejoinder by the authors) , 2001 .

[16]  Edward J. Lee,et al.  A randomized controlled trial of filgrastim during remission induction and consolidation chemotherapy for adults with acute lymphoblastic leukemia: CALGB study 9111. , 1998, Blood.

[17]  William A Ghali,et al.  Multiple imputation versus data enhancement for dealing with missing data in observational health care outcome analyses. , 2002, Journal of clinical epidemiology.

[18]  Philip W. Lavori,et al.  A design for testing clinical strategies: biased adaptive within‐subject randomization , 2000 .

[19]  Ken P Kleinman,et al.  Much Ado About Nothing , 2007, The American statistician.

[20]  H. Boshuizen,et al.  Multiple imputation of missing blood pressure covariates in survival analysis. , 1999, Statistics in medicine.

[21]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[22]  J. Robins,et al.  Estimation and extrapolation of optimal treatment and testing strategies , 2008, Statistics in medicine.

[23]  S. van Buuren Multiple imputation of discrete and continuous data by fully conditional specification , 2007, Statistical methods in medical research.

[24]  Elizabeth A Stuart,et al.  American Journal of Epidemiology Practice of Epidemiology Multiple Imputation with Large Data Sets: a Case Study of the Children's Mental Health Initiative , 2022 .

[25]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[26]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[27]  Jaakko Nevalainen,et al.  Missing values in longitudinal dietary data: A multiple imputation approach based on a fully conditional specification , 2009, Statistics in medicine.

[28]  T. Speed,et al.  Characterizing a joint probability distribution by conditionals , 1993 .

[29]  Paula Diehr,et al.  Imputation of missing longitudinal data: a comparison of methods. , 2003, Journal of clinical epidemiology.

[30]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[31]  Inbal Nahum-Shani,et al.  Q-learning: a data analysis method for constructing adaptive interventions. , 2012, Psychological methods.

[32]  S. Murphy,et al.  A "SMART" design for building individualized treatment sequences. , 2012, Annual review of clinical psychology.

[33]  J. Robins A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect , 1986 .

[34]  Steven G. Heeringa Multivariate imputation of coarsened survey data on household wealth. , 2000 .

[35]  B. Arnold,et al.  Conditionally specified distributions: an introduction , 2001 .

[36]  S. Murphy,et al.  An experimental design for the development of adaptive treatment strategies , 2005, Statistics in medicine.

[37]  D. Kupfer,et al.  Sequenced treatment alternatives to relieve depression (STAR*D): rationale and design. , 2004, Controlled clinical trials.

[38]  J. Robins,et al.  The International Journal of Biostatistics CAUSAL INFERENCE Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes , Part I : Main Content , 2011 .

[39]  T. Stroup,et al.  Assessing Clinical and Functional Outcomes in the Clinical Antipsychotic Trials of Intervention Effectiveness (catie) Schizophrenia Trial Send Reprint Requests to Clinical Outcome Measures: Primary Outcome Clinical and Functional Outcomes Table 1. Catie Schizophrenia Trial Centers' Clinical and Func , 2022 .

[40]  Erica E M Moodie,et al.  Demystifying Optimal Dynamic Treatment Regimes , 2007, Biometrics.

[41]  D. Rubin,et al.  Fully conditional specification in multivariate imputation , 2006 .

[42]  A. Tsiatis,et al.  Optimal Estimator for the Survival Distribution and Related Quantities for Treatment Policies in Two‐Stage Randomization Designs in Clinical Trials , 2004, Biometrics.

[43]  P. Lavori,et al.  Placebo‐free designs for evaluating new mental health treatments: the use of adaptive treatment strategies , 2004, Statistics in medicine.

[44]  J. Lieberman,et al.  The National Institute of Mental Health Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) project: schizophrenia trial design and protocol development. , 2003, Schizophrenia bulletin.

[45]  James M. Robins,et al.  Causal Inference from Complex Longitudinal Data , 1997 .

[46]  M P Becker,et al.  A multiple imputation strategy for incomplete longitudinal data , 2001, Statistics in medicine.

[47]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[48]  J. Lieberman,et al.  Effectiveness of antipsychotic drugs in patients with chronic schizophrenia. , 2005, The New England journal of medicine.

[49]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[50]  Ian R White,et al.  Comparison of imputation and modelling methods in the analysis of a physical activity trial with missing outcomes. , 2004, International journal of epidemiology.

[51]  K. Usher,et al.  The metabolic syndrome and schizophrenia: the latest evidence and nursing guidelines for management. , 2006, Journal of psychiatric and mental health nursing.

[52]  John Van Hoewyk,et al.  A multivariate technique for multiply imputing missing values using a sequence of regression models , 2001 .

[53]  Donald B. Rubin,et al.  Nested multiple imputation of NMES via partially incompatible MCMC , 2003 .

[54]  J M Robins,et al.  Marginal Mean Models for Dynamic Regimes , 2001, Journal of the American Statistical Association.

[55]  Susan A. Murphy,et al.  A Conceptual Framework for Adaptive Preventive Interventions , 2004, Prevention Science.

[56]  D. Rubin,et al.  Ignorability and Coarse Data , 1991 .

[57]  Trevillore E. Raghunathan,et al.  IVEware: Imputation and Variance Estimation Software User Guide , 2002 .

[58]  Ree Dawson,et al.  Dynamic treatment regimes: practical design considerations , 2004, Clinical trials.

[59]  Andrew Gelman,et al.  Diagnostics for multivariate imputations , 2007 .

[60]  H. Sung,et al.  Evaluating multiple treatment courses in clinical trials. , 2000, Statistics in medicine.

[61]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[62]  J. Lieberman,et al.  Effectiveness of clozapine versus olanzapine, quetiapine, and risperidone in patients with chronic schizophrenia who did not respond to prior atypical antipsychotic treatment. , 2006, The American journal of psychiatry.

[63]  D. Rubin INFERENCE AND MISSING DATA , 1975 .