Propensity Scores

Propensity score analysis is a relatively recent statistical innovation that is useful in the analysis of data from quasi-experiments. The goal of propensity score analysis is to balance two non-equivalent groups on observed covariates to get more accurate estimates of the effects of a treatment on which the two groups differ. This article presents a general introduction to propensity score analysis, provides an example using data from a quasi-experiment compared to a benchmark randomized experiment, offers practical advice about how to do such analyses, and discusses some limitations of the approach. It also presents the first detailed instructions to appear in the literature on how to use classification tree analysis and bagging for classification trees in the construction of propensity scores. The latter two examples serve as an introduction for researchers interested in computing propensity scores using more complex classification algorithms known as ensemble methods.

[1]  B. Sianesi An Evaluation of the Swedish System of Active Labor Market Programs in the 1990s , 2004, Review of Economics and Statistics.

[2]  C. Drake Effects of misspecification of the propensity score on estimators of treatment effect , 1993 .

[3]  Richard A. Berk,et al.  Does arrest really deter wife battery? An effort to replicate the findings of the Minneapolis Spouse Abuse Experiment. , 1985 .

[4]  Francesca Dominici,et al.  Application of a propensity score approach for risk adjustment in profiling multiple physician groups on asthma care. , 2005, Health services research.

[5]  Rajeev Dehejia,et al.  Propensity Score-Matching Methods for Nonexperimental Causal Studies , 2002, Review of Economics and Statistics.

[6]  Paul R. Rosenbaum,et al.  Constructing Matched Sets and Strata , 1995 .

[7]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[8]  Roberto Agodini,et al.  Are Experiments the Only Option? A Look at Dropout Prevention Programs , 2004, Review of Economics and Statistics.

[9]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[10]  Zhong Zhao,et al.  Using Matching to Estimate Treatment Effects: Data Requirements, Matching Metrics, and Monte Carlo Evidence , 2004, Review of Economics and Statistics.

[11]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[12]  W. G. Cochran The effectiveness of adjustment by subclassification in removing bias in observational studies. , 1968, Biometrics.

[13]  T. Shakespeare,et al.  Observational Studies , 2003 .

[14]  Howard S. Bloom,et al.  Can Propensity-Score Methods Match the Findings from a Random Assignment Evaluation of Mandatory Welfare-to-Work Programs? , 2004, Review of Economics and Statistics.

[15]  D. Rubin,et al.  Reducing Bias in Observational Studies Using Subclassification on the Propensity Score , 1984 .

[16]  Donald B. Rubin,et al.  Affinely Invariant Matching Methods with Ellipsoidal Distributions , 1992 .

[17]  G. Imbens The Role of the Propensity Score in Estimating Dose-Response Functions , 1999 .

[18]  William A. Knaus,et al.  The effectiveness of right heart catheterization in the initial care of critically ill patients. SUPPORT Investigators. , 1996, Journal of the American Medical Association (JAMA).

[19]  Mark W. Lipsey,et al.  Evaluation: A Systematic Approach , 1979 .

[20]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[21]  Leona S. Aiken,et al.  Comparison of a Randomized and Two Quasi-Experimental Designs in a Single Outcome Evaluation , 1998 .

[22]  Richard A. Berk,et al.  Thinking about program evaluation , 1990 .

[23]  Steven Glazerman,et al.  Nonexperimental Versus Experimental Estimates of Earnings Impacts , 2003 .

[24]  G. Imbens,et al.  Estimation of Causal Effects using Propensity Score Weighting: An Application to Data on Right Heart Catheterization , 2001, Health Services and Outcomes Research Methodology.

[25]  Herbert L. Smith 6. Matching with Multiple Controls to Estimate Treatment Effects in Observational Studies , 1997 .

[26]  Petra E. Todd,et al.  Evaluating Preschool Programs When Length of Exposure to the Program Varies: A Nonparametric Approach , 2000, Review of Economics and Statistics.

[27]  M. Fine,et al.  Propensity score adjustment for pretreatment differences between hospitalized and ambulatory patients with community-acquired pneumonia. Pneumonia Patient Outcomes Research Team (PORT) Investigators. , 1995, Medical care.

[28]  P. Rosenbaum,et al.  Invited commentary: propensity scores. , 1999, American journal of epidemiology.

[29]  Markus Frlich,et al.  Finite-Sample Properties of Propensity-Score Matching and Weighting Estimators , 2004, Review of Economics and Statistics.

[30]  Elaine L. Zanutto,et al.  Matching With Doses in an Observational Study of a Media Campaign Against Drug Abuse , 2001, Journal of the American Statistical Association.

[31]  W. Shadish,et al.  Experimental and Quasi-Experimental Designs for Generalized Causal Inference , 2001 .

[32]  Paul R. Rosenbaum,et al.  Comparison of Multivariate Matching Methods: Structures, Distances, and Algorithms , 1993 .

[33]  Paul R. Rosenbaum,et al.  Overt Bias in Observational Studies , 2002 .

[34]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[35]  P. Rosenbaum Discussing hidden bias in observational studies. , 1991, Annals of internal medicine.

[36]  Peter H. Rossi,et al.  Examining Ongoing Programs: A Chronological Perspective , 1999 .

[37]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[38]  Allan R. Wilks,et al.  The new S language: a programming environment for data analysis and graphics , 1988 .

[39]  G. Imbens Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review , 2004 .

[40]  Roderick J. A. Little,et al.  Projecting From Advance Data Using Propensity Modeling: An Application to Income and Tax Statistics , 1992 .

[41]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator: Evidence from Evaluating a Job Training Programme , 1997 .

[42]  P. Rosenbaum,et al.  Sensitivity analysis for matched case-control studies. , 1991, Biometrics.

[43]  D. Rubin,et al.  Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity Score , 1985 .

[44]  Alison Snow Jones,et al.  Assessing the Effect of Batterer Program Completion on Reassault Using Propensity Scores , 2004, Journal of interpersonal violence.

[45]  P. Rosenbaum Dropping out of High School in the United States: An Observational Study , 1986 .

[46]  Paul R. Rosenbaum,et al.  Sensitivity to Hidden Bias , 2002 .

[47]  M. Lechner Program Heterogeneity and Propensity Score Matching: An Application to the Evaluation of Active Labor Market Policies , 2002, Review of Economics and Statistics.

[48]  Jinyong Hahn,et al.  When to Control for Covariates? Panel Asymptotics for Estimates of Treatment Effects , 2004, Review of Economics and Statistics.