Propensity score techniques and the assessment of measured covariate balance to test causal associations in psychological research.

There is considerable interest in using propensity score (PS) statistical techniques to address questions of causal inference in psychological research. Many PS techniques exist, yet few guidelines are available to aid applied researchers in their understanding, use, and evaluation. In this study, the authors give an overview of available techniques for PS estimation and PS application. They also provide a way to help compare PS techniques, using the resulting measured covariate balance as the criterion for selecting between techniques. The empirical example for this study involves the potential causal relationship linking early-onset cannabis problems and subsequent negative mental health outcomes and uses data from a prospective cohort study. PS techniques are described and evaluated on the basis of their ability to balance the distributions of measured potentially confounding covariates for individuals with and without early-onset cannabis problems. This article identifies the PS techniques that yield good statistical balance of the chosen measured covariates within the context of this particular research question and cohort.

[1]  Richard E. Tremblay,et al.  Preventing antisocial behavior: Interventions from birth through adolescence. , 1993 .

[2]  Sanford Weisberg,et al.  Computing science and statistics : proceedings of the 30th Symposium on the Interface, Minneapolis, Minnesota, May 13-16, 1998 : dimension reduction, computational complexity and information , 1998 .

[3]  Frank E. Harrell,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2001 .

[4]  P. Rosenbaum,et al.  Sensitivity analysis for matched case-control studies. , 1991, Biometrics.

[5]  George W. Rebok,et al.  Building developmental and etiological theory through epidemiologically based preventive intervention trials. , 1992 .

[6]  Dc Washington Diagnostic and Statistical Manual of Mental Disorders, 4th Ed. , 1994 .

[7]  D B Rubin,et al.  Matching using estimated propensity scores: relating theory to practice. , 1996, Biometrics.

[8]  J. Anthony,et al.  Developmental epidemiologically based preventive trials: Baseline modeling of early target behaviors and depressive symptoms , 1991, American journal of community psychology.

[9]  D. Rubin For objective causal inference, design trumps analysis , 2008, 0811.1640.

[10]  J. Graham,et al.  Missing data analysis: making it work in the real world. , 2009, Annual review of psychology.

[11]  Gary King,et al.  Misunderstandings between experimentalists and observationalists about causal inference , 2008 .

[12]  E. Stuart,et al.  Using full matching to estimate causal effects in nonexperimental studies: examining the relationship between adolescent marijuana use and adult outcomes. , 2008, Developmental psychology.

[13]  Elizabeth A Stuart,et al.  Matching methods for causal inference: A review and a look forward. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[14]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator: Evidence from Evaluating a Job Training Programme , 1997 .

[15]  D. Rubin,et al.  Estimating and Using Propensity Scores with Partially Missing Data , 2000 .

[16]  D. McCaffrey,et al.  Propensity score estimation with boosted regression for evaluating causal effects in observational studies. , 2004, Psychological methods.

[17]  J. Robins,et al.  Results of multivariable logistic regression, propensity matching, propensity adjustment, and propensity-based weighting under conditions of nonuniform effect. , 2006, American journal of epidemiology.

[18]  Herbert L. Smith 6. Matching with Multiple Controls to Estimate Treatment Effects in Observational Studies , 1997 .

[19]  B. Hansen Full Matching in an Observational Study of Coaching for the SAT , 2004 .

[20]  Christopher Winship,et al.  Counterfactuals and Causal Inference: Methods and Principles for Social Research , 2007 .

[21]  D. Rubin Using Propensity Scores to Help Design Observational Studies: Application to the Tobacco Litigation , 2001, Health Services and Outcomes Research Methodology.

[22]  D. Rubin,et al.  Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity Score , 1985 .

[23]  J. Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study , 2004, Statistics in medicine.

[24]  D. Rubin,et al.  Reducing Bias in Observational Studies Using Subclassification on the Propensity Score , 1984 .

[25]  Donald B Rubin,et al.  On principles for modeling propensity scores in medical research , 2004, Pharmacoepidemiology and drug safety.

[26]  Kosuke Imai,et al.  Causal Inference With General Treatment Regimes , 2004 .

[27]  I. Lipkovich,et al.  Propensity score estimation with missing values using a multiple imputation missingness pattern (MIMP) approach , 2009, Statistics in medicine.

[28]  D. Rubin,et al.  Assessing Sensitivity to an Unobserved Binary Covariate in an Observational Study with Binary Outcome , 1983 .

[29]  Gary King,et al.  Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference , 2007, Political Analysis.

[30]  D. Berry,et al.  Statistical models in epidemiology, the environment, and clinical trials , 2000 .

[31]  D S Cordray,et al.  Evaluation methods for social intervention. , 2000, Annual review of psychology.

[32]  M. Aldenderfer,et al.  Cluster Analysis. Sage University Paper Series On Quantitative Applications in the Social Sciences 07-044 , 1984 .

[33]  James M. Robins,et al.  Marginal Structural Models versus Structural nested Models as Tools for Causal inference , 2000 .

[34]  Wei Lang,et al.  Examining the Impact of Missing Data on Propensity Score Estimation in Determining the Effectiveness of Self-Monitoring of Blood Glucose (SMBG) , 2001, Health Services and Outcomes Research Methodology.

[35]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[36]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[37]  Peter M. Steiner,et al.  Can Nonrandomized Experiments Yield Accurate Answers? A Randomized Experiment Comparing Random and Nonrandom Assignments , 2008 .

[38]  D. Rubin,et al.  Combining Propensity Score Matching with Additional Adjustments for Prognostic Covariates , 2000 .

[39]  J. Schafer,et al.  Average causal effects from nonrandomized studies: a practical guide and simulated example. , 2008, Psychological methods.

[40]  W. G. Cochran,et al.  Controlling Bias in Observational Studies: A Review. , 1974 .

[41]  David Madigan,et al.  Discussion of "Least angle regression" by Efron et al , 2004 .

[42]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator , 1998 .

[43]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[44]  P. Rosenbaum Design of Observational Studies , 2009, Springer Series in Statistics.

[45]  J. Anthony,et al.  Incidence of drug problems in young adults exposed to trauma and posttraumatic stress disorder: do early life experiences and predispositions matter? , 2007, Archives of general psychiatry.

[46]  Jake Bowers,et al.  Covariate balance in simple stratified and clustered comparative studies , 2008, 0808.3857.

[47]  J. Anthony,et al.  Targeting early antecedents to prevent tobacco smoking: findings from an epidemiologically based randomized field trial. , 1998, American journal of public health.

[48]  Paul R Rosenbaum,et al.  Combining propensity score matching and group-based trajectory analysis in an observational study. , 2007, Psychological methods.

[49]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[50]  Paul R. Rosenbaum,et al.  Comparison of Multivariate Matching Methods: Structures, Distances, and Algorithms , 1993 .

[51]  J. Anthony,et al.  Drug dependence enviromics: job strain in the work environment and risk of becoming drug-dependent. , 2006, American journal of epidemiology.

[52]  Juwon Song,et al.  Handling Baseline Differences and Missing Items in a Longitudinal Study of HIV Risk Among Runaway Youths , 2001, Health Services and Outcomes Research Methodology.

[53]  C. Klaassen,et al.  Discussion to "Inference for semiparametric models: some questions and an answer" by Peter J. Bickel and Jaimyoung Kwon , 2001 .

[54]  W. G. Cochran The effectiveness of adjustment by subclassification in removing bias in observational studies. , 1968, Biometrics.

[55]  Donald B. Rubin,et al.  BEST PRACTICES IN QUASI- EXPERIMENTAL DESIGNS Matching Methods for Causal Inference , 2007 .

[56]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[57]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[58]  Michael Rosenblum,et al.  Marginal Structural Models , 2011 .

[59]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[60]  G. W. Snedecor Statistical Methods , 1964 .