Balancing Covariates via Propensity Score Weighting

ABSTRACT Covariate balance is crucial for unconfounded descriptive or causal comparisons. However, lack of balance is common in observational studies. This article considers weighting strategies for balancing covariates. We define a general class of weights—the balancing weights—that balance the weighted distributions of the covariates between treatment groups. These weights incorporate the propensity score to weight each group to an analyst-selected target population. This class unifies existing weighting methods, including commonly used weights such as inverse-probability weights as special cases. General large-sample results on nonparametric estimation based on these weights are derived. We further propose a new weighting scheme, the overlap weights, in which each unit’s weight is proportional to the probability of that unit being assigned to the opposite group. The overlap weights are bounded, and minimize the asymptotic variance of the weighted average treatment effect among the class of balancing weights. The overlap weights also possess a desirable small-sample exact balance property, based on which we propose a new method that achieves exact balance for means of any selected set of covariates. Two applications illustrate these methods and compare them with other approaches.

[1]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[2]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[3]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[4]  D. Rubin,et al.  Using Multivariate Matched Sampling and Regression Adjustment to Control Bias in Observational Studies , 1978 .

[5]  D. Rubin Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment , 1980 .

[6]  D. Basu Randomization Analysis of Experimental Data: The Fisher Randomization Test , 1980 .

[7]  D. Basu,et al.  Randomization Analysis of Experimental Data: The Fisher Randomization Test Rejoinder , 1980 .

[8]  D. Hosmer,et al.  Goodness of fit tests for the multiple logistic regression model , 1980 .

[9]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[10]  D. Rubin Statistics and Causal Inference: Comment: Which Ifs Have Causal Answers , 1986 .

[11]  P. Rosenbaum Model-Based Direct Adjustment , 1987 .

[12]  C. Manski Nonparametric Bounds on Treatment Effects , 1989 .

[13]  W. Knaus,et al.  Background for SUPPORT. , 1990, Journal of clinical epidemiology.

[14]  SUPPORT: Study to understand prognoses and preferences for outcomes and risks of treatments. Study design. , 1990, Journal of clinical epidemiology.

[15]  J. Robins,et al.  Semiparametric Efficiency in Multivariate Regression Models with Missing Data , 1995 .

[16]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[17]  William A. Knaus,et al.  The effectiveness of right heart catheterization in the initial care of critically ill patients. SUPPORT Investigators. , 1996, Journal of the American Medical Association (JAMA).

[18]  J. Hahn On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects , 1998 .

[19]  Kosuke Imai,et al.  Survey Sampling , 1998, Nov/Dec 2017.

[20]  G. Imbens The Role of the Propensity Score in Estimating Dose-Response Functions , 1999 .

[21]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[22]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[23]  A M Zaslavsky,et al.  Racial disparity in influenza vaccination: does managed care narrow the gap between African Americans and whites? , 2001, JAMA.

[24]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2002 .

[25]  Guido W. Imbens,et al.  EFFICIENT ESTIMATION OF AVERAGE TREATMENT EFFECTS , 2003 .

[26]  R. Little,et al.  Inference for the Population Total from Probability-Proportional-to-Size Samples Based on Predictions from a Penalized Spline Nonparametric Model , 2003 .

[27]  G. Imbens,et al.  Estimation of Causal Effects using Propensity Score Weighting: An Application to Data on Right Heart Catheterization , 2001, Health Services and Outcomes Research Methodology.

[28]  Kosuke Imai,et al.  Causal Inference With General Treatment Regimes , 2004 .

[29]  Richard K. Crump,et al.  Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand , 2006, SSRN Electronic Journal.

[30]  Andrew Gelman,et al.  Struggles with survey weighting and regression modeling , 2007, 0710.5005.

[31]  B. Graham,et al.  Inverse Probability Tilting for Moment Condition Models with Missing Data , 2008 .

[32]  D. Rubin For objective causal inference, design trumps analysis , 2008, 0811.1640.

[33]  N. Keating,et al.  Financial Status, Employment, and Insurance Among Older Cancer Survivors , 2009, Journal of General Internal Medicine.

[34]  Richard K. Crump,et al.  Dealing with limited overlap in estimation of average treatment effects , 2009 .

[35]  A. Zaslavsky,et al.  Comparing methods of racial and ethnic disparities measurement across different settings of mental health care. , 2010, Health services research.

[36]  Dylan S. Small,et al.  Defining the Study Population for an Observational Study to Ensure Sufficient Overlap: A Tree Approach , 2011 .

[37]  Paul R. Rosenbaum,et al.  Optimal Matching of an Optimally Chosen Subset in Observational Studies , 2012 .

[38]  Jens Hainmueller,et al.  Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies , 2012, Political Analysis.

[39]  Michael E Chernew,et al.  Changes in health care spending and quality for Medicare beneficiaries associated with a commercial ACO contract. , 2013, JAMA.

[40]  Fan Li,et al.  Propensity score weighting with multilevel data , 2013, Statistics in medicine.

[41]  Liang Li,et al.  A Weighting Analogue to Pair Matching in Propensity Score Analysis , 2013, The international journal of biostatistics.

[42]  K. Imai,et al.  Covariate balancing propensity score , 2014 .

[43]  Matías Busso,et al.  New Evidence on the Finite Sample Properties of Propensity Score Reweighting and Matching Estimators , 2014, Review of Economics and Statistics.

[44]  Megan S. Schuler,et al.  Generalizing observational study results: applying propensity score methods to complex surveys. , 2014, Health services research.

[45]  C. Blumberg Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[46]  Elaine L. Zanutto A Comparison of Propensity Score and Linear Regression Analysis of Complex Survey Data , 2021, Journal of Data Science.