Assessing lack of common support in causal inference using bayesian nonparametrics: Implications for evaluating the effect of breastfeeding on children's cognitive outcomes

Causal inference in observational studies typically requires making comparisons between groups that are dissimilar. For instance, researchers investigating the role of a prolonged duration of breastfeeding on child outcomes may be forced to make comparisons between women with substantially different characteristics on average. In the extreme there may exist neighborhoods of the covariate space where there are not sufficient numbers of both groups of women (those who breastfed for prolonged periods and those who did not) to make inferences about those women. This is referred to as lack of common support. Problems can arise when we try to estimate causal effects for units that lack common support, thus we may want to avoid inference for such units. If ignorability is satisfied with respect to a set of potential confounders, then identifying whether, or for which units, the common support assumption holds is an empirical question. However, in the high-dimensional covariate space often required to satisfy ignorability such identification may not be trivial. Existing methods used to address this problem often require reliance on parametric assumptions and most, if not all, ignore the information embedded in the response variable. We distinguish between the concepts of “common support” and “common causal support.” We propose a new approach for identifying common causal support that addresses some of the shortcomings of existing methods. We motivate and illustrate the approach using data from the National Longitudinal Survey of Youth to estimate the effect of breastfeeding at least nine months on reading and math achievement scores at age five or six. We also evaluate the comparative performance of this method in hypothetical examples and simulations where the true treatment effect is known.

[1]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[2]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[3]  P. Rosenbaum The Consequences of Adjustment for a Concomitant Variable that Has Been Affected by the Treatment , 1984 .

[4]  Deborah A. Phillips,et al.  Children of the National Longitudinal Survey of Youth: A Unique Research Opportunity. , 1991 .

[5]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator: Evidence from Evaluating a Job Training Programme , 1997 .

[6]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator , 1998 .

[7]  James W. Anderson,et al.  Breast-feeding and cognitive development: a meta-analysis. , 1999, The American journal of clinical nutrition.

[8]  Denise L. Drane,et al.  A critical evaluation of the evidence on the association between type of infant feeding and cognitive development. , 2000, Paediatric and perinatal epidemiology.

[9]  J. Concato,et al.  How good is the evidence linking breastfeeding and intelligence? , 2002, Pediatrics.

[10]  K. Michaelsen,et al.  The association between duration of breastfeeding and adult intelligence. , 2002, JAMA.

[11]  B. Sianesi,et al.  PSMATCH2: Stata module to perform full Mahalanobis and propensity score matching, common support graphing, and covariate imbalance testing , 2003 .

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  Markus Frlich,et al.  Finite-Sample Properties of Propensity-Score Matching and Weighting Estimators , 2004, Review of Economics and Statistics.

[14]  D. Rubin Using Propensity Scores to Help Design Observational Studies: Application to the Tobacco Litigation , 2001, Health Services and Outcomes Research Methodology.

[15]  G. Imbens Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review , 2004 .

[16]  D. McCaffrey,et al.  Propensity score estimation with boosted regression for evaluating causal effects in observational studies. , 2004, Psychological methods.

[17]  J. Robins,et al.  Results of multivariable logistic regression, propensity matching, propensity adjustment, and propensity-based weighting under conditions of nonuniform effect. , 2006, American journal of epidemiology.

[18]  H. Chipman,et al.  Bayesian Additive Regression Trees , 2006 .

[19]  S. Morgan,et al.  Matching Estimators of Causal Effects , 2006 .

[20]  J. Avorn,et al.  Variable selection for propensity score models. , 2006, American journal of epidemiology.

[21]  Edward I. George,et al.  Bayesian Ensemble Learning , 2006, NIPS.

[22]  Debbie A Lawlor,et al.  Early life predictors of childhood intelligence: findings from the Mater-University study of pregnancy and its outcomes. , 2006, Paediatric and perinatal epidemiology.

[23]  I. Deary,et al.  Effect of breast feeding on intelligence in children: prospective study, sibling pairs analysis, and meta-analysis , 2006, BMJ : British Medical Journal.

[24]  Effect of breast feeding on intelligence in children: prospective study, sibling pairs analysis, and meta-analysis , 2008 .

[25]  B. Hansen The prognostic analogue of the propensity score , 2008 .

[26]  Robert W Platt,et al.  Breastfeeding and child cognitive development: new evidence from a large randomized trial. , 2008, Archives of general psychiatry.

[27]  Jerome P. Reiter,et al.  Estimation of propensity scores using generalized additive models , 2008, Statistics in medicine.

[28]  Richard K. Crump,et al.  Dealing with limited overlap in estimation of average treatment effects , 2009 .

[29]  G. Gökçay Breastfeeding and child cognitive development. , 2010, Child: care, health and development.

[30]  H. Chipman,et al.  BART: Bayesian Additive Regression Trees , 2008, 0806.3286.

[31]  B. Strandvik,et al.  Early behaviour and development in breast-fed premature infants are influenced by omega-6 and omega-3 fatty acid status. , 2010, Early human development.

[32]  Jennifer L. Hill,et al.  Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[33]  Gary King,et al.  MatchIt: Nonparametric Preprocessing for Parametric Causal Inference , 2011 .

[34]  Christopher Weiss,et al.  Challenges With Propensity Score Strategies in a High-Dimensional Setting and a Potential Alternative , 2011, Multivariate behavioral research.

[35]  Ben Kelcey Covariate Selection in Propensity Scores Using Outcome Proxies , 2011, Multivariate behavioral research.

[36]  D. Green,et al.  Modeling Heterogeneous Treatment Effects in Survey Experiments with Bayesian Additive Regression Trees , 2012 .

[37]  Greg Ridgeway,et al.  Toolkit for Weighting and Analysis of Nonequivalent Groups , 2014 .

[38]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[39]  J. Haukoos,et al.  The Propensity Score. , 2015, JAMA.

[40]  Evaluation of training programs , 2015 .