Imposing Minimax and Quantile Constraints on Optimal Matching in Observational Studies

ABSTRACT Modern methods construct a matched sample by minimizing the total cost of a flow in a network, finding a pairing of treated and control individuals that minimizes the sum of within-pair covariate distances subject to constraints that ensure distributions of covariates are balanced. In aggregate, these methods work well; however, they can exhibit a lack of interest in a small number of pairs with large covariate distances. Here, a new method is proposed for imposing a minimax constraint on a minimum total distance matching. Such a match minimizes the total within-pair distance subject to various constraints including the constraint that the maximum pair difference is as small as possible. In an example with 1391 matched pairs, this constraint eliminates dozens of pairs with moderately large differences in age, but otherwise exhibits the same excellent covariate balance found without this additional constraint. A minimax constraint eliminates edges in the network, and can improve the worst-case time bound for the performance of the minimum cost flow algorithm, that is, a better match from a practical perspective may take less time to construct. The technique adapts ideas for a different problem, the bottleneck assignment problem, whose sole objective is to minimize the maximum within-pair difference; however, here, that objective becomes a constraint on the minimum cost flow problem. The method generalizes. Rather than constrain the maximum distance, it can constrain an order statistic. Alternatively, the method can minimize the maximum difference in propensity scores, and subject to doing that, minimize the maximum robust Mahalanobis distance. An example from labor economics is used to illustrate. Supplementary materials for this article are available online.

[1]  N. Weller,et al.  Finding Pathways: Mixed-Method Research for Studying Causal Mechanisms , 2014 .

[2]  P. Rosenbaum A Characterization of Optimal Designs for Observational Studies , 1991 .

[3]  Samuel D. Pimentel,et al.  Large, Sparse Optimal Matching With Refined Covariate Balance in an Observational Study of the Health Outcomes Produced by New Surgeons , 2015, Journal of the American Statistical Association.

[4]  Elizabeth A Stuart,et al.  Matching methods for causal inference: A review and a look forward. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[5]  Paul R. Rosenbaum,et al.  Matching for Balance, Pairing for Heterogeneity in an Observational Study of the Effectiveness of For-Profit and Not-For-Profit High Schools in Chile , 2014, 1404.3584.

[6]  Dylan S Small,et al.  Optimal Matching with Minimal Deviation from Fine Balance in a Study of Obesity and Surgical Outcomes , 2012, Biometrics.

[7]  Xinyi Xu,et al.  Optimal Nonbipartite Matching and Its Statistical Applications , 2011, The American statistician.

[8]  Dylan S. Small,et al.  Strong Control of the Familywise Error Rate in Observational Studies that Discover Effect Modification by Exploratory Methods , 2015 .

[9]  P. Rosenbaum,et al.  Characteristics AssociatedWith Differences in Survival Among Black andWhiteWomenWith Breast Cancer , 2013 .

[10]  P. Rosenbaum,et al.  Matching and thick description in an observational study of mortality after surgery. , 2001, Biostatistics.

[11]  D. Bertsekas,et al.  The relax codes for linear minimum cost network flow problems , 1988 .

[12]  Paul R. Rosenbaum,et al.  Optimal Matching for Observational Studies , 1989 .

[13]  Robert S. Garfinkel,et al.  Technical Note - An Improved Algorithm for the Bottleneck Assignment Problem , 1971, Oper. Res..

[14]  Paul R. Rosenbaum,et al.  Heterogeneity and Causality , 2005 .

[15]  Wei Wang,et al.  Template matching for auditing hospital cost and quality. , 2014, Health services research.

[16]  Frank Yoon,et al.  Variable-ratio matching with fine balance in a study of the Peer Health Exchange. , 2015, Statistics in medicine.

[17]  Elizabeth,et al.  Matching Methods for Causal Inference , 2007 .

[18]  Jeffrey H Silber,et al.  Anesthesia technique, mortality, and length of stay after hip fracture surgery. , 2014, JAMA.

[19]  P. Rosenbaum Design of Observational Studies , 2009, Springer Series in Statistics.

[20]  Jan C. van Ours,et al.  How Changes in Financial Incentives Affect the Duration of Unemployment , 2004, SSRN Electronic Journal.

[21]  Mauro Dell'Amico,et al.  Assignment Problems , 1998, IFIP Congress: Fundamentals - Foundations of Computer Science.

[22]  J. Zubizarreta Journal of the American Statistical Association Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure after Surgery Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure after Surgery , 2022 .

[23]  B. Hansen,et al.  Optimal Full Matching and Related Designs via Network Flows , 2006 .

[24]  Ming-Yang Kao,et al.  A Decomposition Theorem for Maximum Weight Bipartite Matchings , 2000, SIAM J. Comput..

[25]  Jeffrey H Silber,et al.  Characteristics associated with differences in survival among black and white women with breast cancer. , 2013, JAMA.

[26]  Gary King,et al.  Matching Methods for Causal Inference , 2011 .

[27]  Dylan S. Small,et al.  Effect Modification and Design Sensitivity in Observational Studies , 2013 .