Matching using estimated propensity scores: relating theory to practice.

Matched sampling is a standard technique in the evaluation of treatments in observational studies. Matching on estimated propensity scores comprises an important class of procedures when there are numerous matching variables. Recent theoretical work (Rubin, D. B. and Thomas, N., 1992, The Annals of Statistics 20, 1079-1093) on affinely invariant matching methods with ellipsoidal distributions provides a general framework for evaluating the operating characteristics of such methods. Moreover, Rubin and Thomas (1992, Biometrika 79, 797-809) uses this framework to derive several analytic approximations under normality for the distribution of the first two moments of the matching variables in samples obtained by matching on estimated linear propensity scores. Here we provide a bridge between these theoretical approximations and actual practice. First, we complete and refine the nomal-based analytic approximations, thereby making it possible to apply these results to practice. Second, we perform Monte Carlo evaluations of the analytic results under normal and nonnormal ellipsoidal distributions, which confirm the accuracy of the analytic approximations, and demonstrate the predictable ways in which the approximations deviate from simulation results when normal assumptions are violated within the ellipsoidal family. Third, we apply the analytic approximations to real data with clearly nonellipsoidal distributions, and show that the theoretical expressions, although derived under artificial distributional conditions, produce useful guidance for practice. Our results delineate the wide range of settings in which matching on estimated linear propensity scores performs well, thereby providing useful information for the design of matching studies. When matching with a particular data set, our theoretical approximations provide benchmarks for expected performance under favorable conditions, thereby identifying matching variables requiring special treatment. After matching is complete and data analysis is at hand, our results provide the variances required to compute valid standard errors for common estimators.

[1]  Donald B. Rubin,et al.  Affinely Invariant Matching Methods with Ellipsoidal Distributions , 1992 .

[2]  Donald B. Rubin,et al.  MULTIVARIATE MATCHING METHODS THAT ARE EQUAL PERCENT BIAS REDUCING, I: SOME EXAMPLES , 1974 .

[3]  B. Efron The Efficiency of Logistic Regression Compared to Normal Discriminant Analysis , 1975 .

[4]  Ingram Olkin,et al.  Multivariate Correlation Models with Mixed Discrete and Continuous Variables , 1961 .

[5]  Donald B. Rubin,et al.  Multivariate matching methods that are equal percent bias reducing , 1974 .

[6]  D. Rubin,et al.  Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity Score , 1985 .

[7]  P. Rosenbaum Dropping out of High School in the United States: An Observational Study , 1986 .

[8]  Donald B. Rubin,et al.  In Utero Exposure to Phenobarbitol and Intelligence Deficits in Adult Men , 1996 .

[9]  D. Freedman,et al.  Asymptotics of Graphical Projection Pursuit , 1984 .

[10]  P. C. Chang,et al.  Robustness of Fisher's Linear Discriminant Function under Two-Component Mixed Normal Models , 1981 .

[11]  Paul R. Rosenbaum,et al.  Comparison of Multivariate Matching Methods: Structures, Distances, and Algorithms , 1993 .

[12]  Donald B. Rubin,et al.  Characterizing the effect of matching using linear propensity score methods with normal distributions , 1992 .

[13]  D. Rubin Matched Sampling for Causal Effects: The Use of Matched Sampling and Regression Adjustment to Remove Bias in Observational Studies , 1973 .

[14]  D. Rubin Bias Reduction Using Mahalanobis-Metric Matching , 1980 .

[15]  D. Rubin,et al.  The Bias Due to Incomplete Matching , 1985 .

[16]  G. Fisher,et al.  Skills acquisition among matched samples of institutionalized and community-based persons with mental retardation. , 1988, American journal of mental retardation : AJMR.

[17]  S. Kotz,et al.  Symmetric Multivariate and Related Distributions , 1989 .

[18]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[19]  D. Rubin,et al.  Using Multivariate Matched Sampling and Regression Adjustment to Control Bias in Observational Studies , 1978 .

[20]  W. G. Cochran,et al.  Controlling Bias in Observational Studies: A Review. , 1974 .

[21]  C. Drake Effects of misspecification of the propensity score on estimators of treatment effect , 1993 .

[22]  R L Kane,et al.  Improving Primary Care in Nursing Homes , 1991, Journal of the American Geriatrics Society.

[23]  Ker-Chau Li,et al.  Sliced Inverse Regression for Dimension Reduction , 1991 .