Using latent outcome trajectory classes in causal inference.

In longitudinal studies, outcome trajectories can provide important information about substantively and clinically meaningful underlying subpopulations who may also respond differently to treatments or interventions. Growth mixture analysis is an efficient way of identifying heterogeneous trajectory classes. However, given its exploratory nature, it is unclear how involvement of latent classes should be handled in the analysis when estimating causal treatment effects. In this paper, we propose a 2-step approach, where formulation of trajectory strata and identification of causal effects are separated. In Step 1, we stratify individuals in one of the assignment conditions (reference condition) into trajectory strata on the basis of growth mixture analysis. In Step 2, we estimate treatment effects for different trajectory strata, treating the stratum membership as partly known (known for individuals assigned to the reference condition and missing for the rest). The results can be interpreted as how subpopulations that differ in terms of outcome prognosis under one treatment condition would change their prognosis differently when exposed to another treatment condition. Causal effect estimation in Step 2 is consistent with that in the principal stratification approach (Frangakis and Rubin, 2002) in the sense that clarified identifying assumptions can be employed and therefore systematic sensitivity analyses are possible. Longitudinal development of attention deficit among children from the Johns Hopkins School Intervention Trial (Ialongo et al., 1999) will be presented as an example.

[1]  B. Muthén,et al.  Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling , 2009, Statistics in medicine.

[2]  Michael R Elliott,et al.  Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance , 2008 .

[3]  B. Jo,et al.  Bias Mechanisms in Intention-to-Treat Analysis With Data Subject to Treatment Noncompliance and Missing Outcomes , 2008, Journal of educational and behavioral statistics : a quarterly publication sponsored by the American Educational Research Association and the American Statistical Association.

[4]  Joseph W Hogan,et al.  Principal stratification with predictors of compliance for randomized trials with 2 active treatments. , 2008, Biostatistics.

[5]  Tihomir Asparouhov,et al.  Cluster randomized trials with treatment noncompliance. , 2008, Psychological methods.

[6]  Donald B Rubin,et al.  Principal Stratification for Causal Inference With Extended Partial Compliance , 2008 .

[7]  Michael E. Sobel,et al.  What Do Randomized Studies of Housing Mobility Demonstrate? , 2006 .

[8]  Dylan S. Small,et al.  Defining and Estimating Intervention Effects for Groups that will Develop an Auxiliary Outcome , 2006, math/0609457.

[9]  Karen Bandeen-Roche,et al.  Residual Diagnostics for Growth Mixture Models , 2005 .

[10]  G. Imbens,et al.  Analyzing a randomized trial on breast self-examination with noncompliance and missing outcomes. , 2004, Biostatistics.

[11]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[12]  Booil Jo,et al.  Estimation of Intervention Effects with Noncompliance: Alternative Model Specifications , 2002 .

[13]  Xiao-Hua Zhou,et al.  Clustered encouragement designs with individual noncompliance: bayesian inference with randomization, and application to advance directive forms. , 2002, Biostatistics.

[14]  D. Rubin,et al.  Principal Stratification in Causal Inference , 2002, Biometrics.

[15]  Katherine E. Masyn,et al.  General growth mixture modeling for randomized preventive interventions. , 2001, Biostatistics.

[16]  K. Roeder,et al.  A SAS Procedure Based on Mixture Models for Estimating Developmental Trajectories , 2001 .

[17]  N. Ialongo,et al.  Proximal Impact of Two First-Grade Preventive Interventions on the Early Risk Behaviors for Later Substance Abuse, Depression, and Antisocial Behavior , 1999, American journal of community psychology.

[18]  Daniel S. Nagin,et al.  Analyzing developmental trajectories: A semiparametric, group-based approach , 1999 .

[19]  B. Muthén,et al.  Finite Mixture Modeling with Mixture Outcomes Using the EM Algorithm , 1999, Biometrics.

[20]  D. Rubin,et al.  Addressing complications of intention-to-treat analysis in the combined presence of all-or-none treatment-noncompliance and subsequent missing outcomes , 1999 .

[21]  M. A. Tanner,et al.  Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions, 3rd Edition , 1998 .

[22]  B. Everitt,et al.  Analysis of longitudinal data , 1998, British Journal of Psychiatry.

[23]  Scott L. Zeger,et al.  Latent Variable Regression for Multiple Discrete Outcomes , 1997 .

[24]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[25]  Joshua D. Angrist,et al.  Identification of Causal Effects Using Instrumental Variables , 1993 .

[26]  Anthony S. Bryk,et al.  Hierarchical Linear Models: Applications and Data Analysis Methods , 1992 .

[27]  S. Kellam,et al.  Effect of first-grade classroom environment on shy behavior, aggressive behavior, and concentration problems , 1991, American journal of community psychology.

[28]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[29]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[30]  William Meredith,et al.  Latent curve analysis , 1990 .

[31]  P. Holland Statistics and Causal Inference , 1985 .

[32]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[33]  D. Basu Randomization Analysis of Experimental Data: The Fisher Randomization Test , 1980 .

[34]  D. Basu,et al.  Randomization Analysis of Experimental Data: The Fisher Randomization Test Rejoinder , 1980 .

[35]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[36]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[37]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[38]  Donald B. Rubin,et al.  Comment : Neyman ( 1923 ) and Causal Inference in Experiments and Observational Studies , 2007 .

[39]  Bengt Muthén,et al.  Latent Variable Analysis: Growth Mixture Modeling and Related Techniques for Longitudinal Data , 2004 .

[40]  David Kaplan,et al.  The Sage handbook of quantitative methodology for the social sciences , 2004 .

[41]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .