An Automatic Finite-Sample Robustness Metric: When Can Dropping a Little Data Make a Big Difference?

Study samples often differ from the target populations of inference and policy decisions in non-random ways. Researchers typically believe that such departures from random sampling -- due to changes in the population over time and space, or difficulties in sampling truly randomly -- are small, and their corresponding impact on the inference should be small as well. We might therefore be concerned if the conclusions of our studies are excessively sensitive to a very small proportion of our sample data. We propose a method to assess the sensitivity of applied econometric conclusions to the removal of a small fraction of the sample. Manually checking the influence of all possible small subsets is computationally infeasible, so we use an approximation to find the most influential subset. Our metric, the"Approximate Maximum Influence Perturbation,"is based on the classical influence function, and is automatically computable for common methods including (but not limited to) OLS, IV, MLE, GMM, and variational Bayes. We provide finite-sample error bounds on approximation performance. At minimal extra cost, we provide an exact finite-sample lower bound on sensitivity. We find that sensitivity is driven by a signal-to-noise ratio in the inference problem, is not reflected in standard errors, does not disappear asymptotically, and is not due to misspecification. While some empirical applications are robust, results of several influential economics papers can be overturned by removing less than 1% of the sample.

[1]  Empirical processes and weak convergence , 2020, Probability and Stochastic Processes.

[2]  Lester Mackey,et al.  Approximate Cross-validation: Guarantees for Model Assessment and Selection , 2020, AISTATS.

[3]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[4]  Michael I. Jordan,et al.  A Higher-Order Swiss Army Infinitesimal Jackknife , 2019, ArXiv.

[5]  Alwyn Young Consistency without Inference: Instrumental Variables in Practical Application , 2019, European Economic Review.

[6]  Michael I. Jordan,et al.  Evaluating Sensitivity to the Stick-Breaking Prior in Bayesian Nonparametrics , 2018, Bayesian Analysis.

[7]  Michael I. Jordan,et al.  A Swiss Army Infinitesimal Jackknife , 2018, AISTATS.

[8]  Michael I. Jordan,et al.  Covariances, Robustness, and Variational Bayes , 2017, J. Mach. Learn. Res..

[9]  Rachael Meager Aggregating Distributional Treatment Effects: A Bayesian Hierarchical Analysis of the Microcredit Literature , 2017, American Economic Review.

[10]  Alexandre Poirier,et al.  Inference on Breakdown Frontiers , 2017, Quantitative Economics.

[11]  Percy Liang,et al.  Understanding Black-box Predictions via Influence Functions , 2017, ICML.

[12]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[13]  Dustin Tran,et al.  Automatic Differentiation Variational Inference , 2016, J. Mach. Learn. Res..

[14]  Jun Lu,et al.  Case deletion diagnostics for GMM estimation , 2016, Comput. Stat. Data Anal..

[15]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[16]  Barak A. Pearlmutter,et al.  Automatic differentiation in machine learning: a survey , 2015, J. Mach. Learn. Res..

[17]  Orazio Attanasio,et al.  The Impacts of Microfinance: Evidence from Joint-Liability Lending in Mongolia , 2015 .

[18]  Esther Duflo,et al.  Estimating the Impact of Microcredit on Those Who Take it Up: Evidence from a Randomized Experiment in Morocco , 2014 .

[19]  Jonathan Zinman,et al.  Microcredit Impacts: Evidence from a Randomized Microcredit Program Placement Experiment by Compartamos Banco , 2014 .

[20]  A. Banerjee,et al.  The Miracle of Microfinance? Evidence from a Randomized Evaluation , 2013 .

[21]  C. Meghir,et al.  The Impacts of Microcredit: Evidence from Bosnia and Herzegovina , 2012 .

[22]  Xiaohong Chen,et al.  Sensitivity Analysis in Semiparametric Likelihood Models , 2011 .

[23]  Dean S. Karlan,et al.  Microcredit in Theory and Practice: Using Randomized Credit Scoring for Impact Evaluation , 2011, Science.

[24]  S. Hattori,et al.  Approximate subject-deletion influence diagnostics for Inverse Probability of Censoring Weighted (IPCW) method , 2009 .

[25]  M. Angelucci,et al.  Indirect Effects of an Aid Program: How Do Cash Transfers Affect Ineligibles' Consumption? , 2009 .

[26]  L. Stefanski,et al.  The Calculus of M-Estimation , 2002 .

[27]  I. Sobola,et al.  Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates , 2001 .

[28]  R. Dennis Cook,et al.  Detection of Influential Observation in Linear Regression , 2000, Technometrics.

[29]  L. Wasserman,et al.  Infinitesimal sensitivity of posterior distributions , 1993 .

[30]  L. W. Taylor A unified approach to the derivation of influential data diagnostics , 1993 .

[31]  B. Carlin An Expected Utility Approach to Influence Diagnostics , 1991 .

[32]  Xuming He TAIL BEHAVIOR OF REGRESSION ESTIMATORS AND THEIR BREAKDOWN POINTS , 1990 .

[33]  R. Dennis Cook,et al.  Assessing influence on regression coefficients in generalized linear models , 1989 .

[34]  John Law,et al.  Robust Statistics—The Approach Based on Influence Functions , 1986 .

[35]  S. Chatterjee,et al.  Influential Observations, High Leverage Points, and Outliers in Linear Regression , 1986 .

[36]  P. Kempthorne Decision-theoretic Measures of Influence in Regression , 1986 .

[37]  D. Freedman,et al.  On the consistency of Bayes estimates , 1986 .

[38]  Edward E. Leamer,et al.  Global Sensitivity Results for Generalized Least Squares Estimates , 1984 .

[39]  N. Lange,et al.  Approximate case influence for the proportional hazards regression model with censored data. , 1984, Biometrics.

[40]  L. Fernholz von Mises Calculus For Statistical Functionals , 1983 .

[41]  S. Geisser,et al.  A Predictive View of the Detection and Characterization of Influential Observations in Regression Analysis , 1983 .

[42]  S. Weisberg,et al.  Residuals and Influence in Regression , 1982 .

[43]  D. Pregibon Logistic Regression Diagnostics , 1981 .

[44]  David A. Belsley,et al.  Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1980 .

[45]  John W. Tukey,et al.  Data Analysis and Regression: A Second Course in Statistics , 1977 .

[46]  F. Hampel The Influence Curve and Its Role in Robust Estimation , 1974 .

[47]  R. V. Mises On the Asymptotic Distribution of Differentiable Statistical Functions , 1947 .

[48]  Carroll Morgan,et al.  Robustness , 2020, Encyclopedia of the UN Sustainable Development Goals.

[49]  Rachael Meager,et al.  Understanding the Average Impact of Microcredit Expansions: A Bayesian Hierarchical Analysis of Seven Randomized Experiments , 2019, American Economic Journal: Applied Economics.

[50]  Alessandro Tarozzi,et al.  The Impacts of Microcredit : Evidence from Ethiopia , 2014 .

[51]  Stefano Tarantola,et al.  Global Sensitivity Analysis: An Introduction , 2005 .

[52]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[53]  Paul Gustafson,et al.  Local Robustness in Bayesian Analysis , 2000 .

[54]  S. R. Jammalamadaka,et al.  Local Posterior Robustness with Parametric Priors: Maximum and Average Sensitivity , 1996 .

[55]  R. Cook Assessment of Local Influence , 1986 .

[56]  Edward E. Leamer,et al.  Sensitivity Analyses Would Help , 1985 .

[57]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .

[58]  Frederick R. Forst,et al.  On robust estimation of the location parameter , 1980 .

[59]  P. J. Huber The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .

[60]  J. A. Díaz-García,et al.  SENSITIVITY ANALYSIS IN LINEAR REGRESSION , 2022 .