The Transfer Performance of Economic Models

Economists often estimate models using data from a particular domain, e.g. estimating risk preferences in a particular subject pool or for a specific class of lotteries. Whether a model's predictions extrapolate well across domains depends on whether the estimated model has captured generalizable structure. We provide a tractable formulation for this"out-of-domain"prediction problem and define the transfer error of a model based on how well it performs on data from a new domain. We derive finite-sample forecast intervals that are guaranteed to cover realized transfer errors with a user-selected probability when domains are iid, and use these intervals to compare the transferability of economic models and black box algorithms for predicting certainty equivalents. We find that in this application, the black box algorithms we consider outperform standard economic models when estimated and tested on data from the same domain, but the economic models generalize across domains better than the black-box algorithms do.

[1]  Sylvain Chassang,et al.  Designing Randomized Controlled Trials with External Validity in Mind , 2022, Social Science Research Network.

[2]  Stefan Wager,et al.  Learning from a Biased Sample , 2022, ArXiv.

[3]  Anastasios Nikolas Angelopoulos,et al.  Conformal Risk Control , 2022, ArXiv.

[4]  Xinkun Nie,et al.  Covariate Balancing Sensitivity Analysis for Extrapolating Randomized Trials across Locations , 2021, 2112.04723.

[5]  D. Fudenberg,et al.  Measuring the Completeness of Economic Models , 2019, Journal of Political Economy.

[6]  Matthew J. Salganik,et al.  Integrating explanation and prediction in computational social science , 2021, Nature.

[7]  Joshua C. Peterson,et al.  Using large-scale experiments and machine learning to discover theories of human decision-making , 2021, Science.

[8]  Chen Change Loy,et al.  Domain Generalization: A Survey , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Michael I. Jordan,et al.  Distribution-Free, Risk-Controlling Prediction Sets , 2021, J. ACM.

[10]  D. Fudenberg,et al.  Quantifying the Restrictiveness of Theories , 2020 .

[11]  Christoph Kuzmics,et al.  Comparing Theories of One-Shot Play Out of Treatment , 2020, J. Econ. Theory.

[12]  Joshua C. Peterson,et al.  Scaling up psychology via Scientific Regret Minimization , 2020, Proceedings of the National Academy of Sciences.

[13]  Zhaoran Wang,et al.  Behavioral Neural Networks , 2020, SSRN Electronic Journal.

[14]  B. Bernheim,et al.  On the Empirical Validity of Cumulative Prospect Theory: Experimental Evidence of Rank-Independent Probability Weighting , 2020, Econometrica.

[15]  Charles F. Manski,et al.  Econometrics for Decision Making: Building Foundations Sketched by Haavelmo and Wald , 2019, Econometrica.

[16]  Sanjay Mehrotra,et al.  Distributionally Robust Optimization: A Review , 2019, ArXiv.

[17]  M. Dean,et al.  The empirical relationship between nonstandard economic behaviors , 2019, Proceedings of the National Academy of Sciences.

[18]  Alexander Maedche,et al.  How effective is nudging? A quantitative review on the effect sizes and limits of empirical nudging studies , 2019, Journal of Behavioral and Experimental Economics.

[19]  Ryan J. Tibshirani,et al.  Predictive inference with the jackknife+ , 2019, The Annals of Statistics.

[20]  Taisuke Imai,et al.  Meta-Analysis of Present-Bias Estimation Using Convex Time Budgets , 2019, The Economic Journal.

[21]  Devin G. Pope,et al.  Stability of Experimental Results: Forecasts and Evidence , 2019, SSRN Electronic Journal.

[22]  Moshe Tennenholtz,et al.  Predicting human decisions with behavioral theories and machine learning , 2019, ArXiv.

[23]  Alec Smith,et al.  Dynamic Unstructured Bargaining with Private Information: Theory, Experiment, and Outcome Prediction via Machine Learning , 2019, Manag. Sci..

[24]  D. Budescu,et al.  Decisions With Compound Lotteries , 2019, Decision.

[25]  Drew Fudenberg,et al.  Predicting and Understanding Initial Play , 2019, American Economic Review.

[26]  Purchasing power parities , 2018 .

[27]  Robert B. Olsen,et al.  A Review of Statistical Methods for Generalizing From Evaluations of Educational Interventions , 2018, Educational Researcher.

[28]  Victor Chernozhukov,et al.  An Exact and Robust Conformal Inference Method for Counterfactual and Synthetic Controls , 2017, Journal of the American Statistical Association.

[29]  Ferdinand M. Vieider,et al.  Accommodating stake effects under prospect theory , 2017 .

[30]  Rachael Meager Aggregating Distributional Treatment Effects: A Bayesian Hierarchical Analysis of the Microcredit Literature , 2017, American Economic Review.

[31]  Isaiah Andrews,et al.  A Simple Approximation for Evaluating External Validity Bias , 2017, Economics Letters.

[32]  Katherine L. Milkman,et al.  Should Governments Invest More in Nudging? , 2017, Psychological science.

[33]  Susan Athey,et al.  Beyond prediction: Using big data for policy problems , 2017, Science.

[34]  Andrea Prat,et al.  Do Women Respond Less to Performance Pay? Building Evidence from Multiple Experiments , 2016 .

[35]  Amit Daniely,et al.  Behavior-Based Machine-Learning: A Hybrid Approach for Predicting Human Decision Making , 2016, ArXiv.

[36]  E. Dougherty,et al.  Big data need big theory too , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[37]  Moshe Tennenholtz,et al.  Psychological Forest: Predicting Human Behavior , 2016, AAAI.

[38]  Alessandro Rinaldo,et al.  Distribution-Free Predictive Inference for Regression , 2016, Journal of the American Statistical Association.

[39]  C. Starmer,et al.  How do risk attitudes affect measured confidence? , 2016 .

[40]  Mohammed Abdellaoui,et al.  Experiments on Compound Risk in Relation to Simple Risk and to Ambiguity , 2015, Manag. Sci..

[41]  Rajeev Dehejia,et al.  From Local to Global: External Validity in a Fertility Natural Experiment , 2015, Journal of Business & Economic Statistics.

[42]  Susan Athey,et al.  Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.

[43]  Guido W. Imbens,et al.  External Validity in Fuzzy Regression Discontinuity Designs , 2014, Journal of Business & Economic Statistics.

[44]  Elias Bareinboim,et al.  External Validity: From Do-Calculus to Transportability Across Populations , 2014, Probabilistic and Causal Inference.

[45]  Andrew J. Hill,et al.  Superstition in the Housing Market , 2014, SSRN Electronic Journal.

[46]  Shai Ben-David,et al.  Understanding Machine Learning: From Theory to Algorithms , 2014 .

[47]  Donald K. K. Lee,et al.  Interval estimation of population means under unknown but bounded probabilities of sample selection , 2013 .

[48]  Bernhard Schölkopf,et al.  Domain Generalization via Invariant Feature Representation , 2013, ICML.

[49]  V. Vovk,et al.  Combining P-Values Via Averaging , 2012, Biometrika.

[50]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[51]  Helga Fehr-Duda,et al.  Probability and Risk: Foundations and Economic Implications of Probability-Dependent Risk Preferences , 2012 .

[52]  John A. List,et al.  On the Generalizability of Experimental Results in Economics , 2012 .

[53]  Gilles Blanchard,et al.  Generalizing from Several Related Classification Tasks to a New Unlabeled Sample , 2011, NIPS.

[54]  Matthias Sutter,et al.  Working Papers in Economics and Statistics Group polarization in the team dictator game reconsidered , 2007 .

[55]  Elias Bareinboim,et al.  Transportability across studies: A formal approach , 2011 .

[56]  Olivier L’Haridon,et al.  Monetary incentives in the loss domain and behavior toward risk: An experimental comparison of three reward schemes including real losses , 2011 .

[57]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[58]  J. Angrist,et al.  Extrapolate-Ing: External Validity and Overidentification in the Late Framework , 2010 .

[59]  Ferdinand M. Vieider,et al.  Incentive Effects on Risk Attitude in Small Probability Prospects , 2010, SSRN Electronic Journal.

[60]  Angus Deaton Instruments, Randomization, and Learning about Development , 2010 .

[61]  Adrian Bruhin,et al.  Risk and Rationality: Uncovering Heterogeneity in Probability Distortion , 2009 .

[62]  R. Schubert,et al.  Rationality on the rise: Why relative risk aversion increases with stake size , 2009 .

[63]  G. Imbens,et al.  Better Late than Nothing: Some Comments on Deaton (2009) and Heckman and Urzua (2009) , 2009 .

[64]  Steven D. Levitt,et al.  Viewpoint: On the Generalizability of Lab Behaviour to the Field , 2007 .

[65]  Andreas Maurer,et al.  Concentration inequalities for functions of independent variables , 2006, Random Struct. Algorithms.

[66]  Yoram Halevy Ellsberg Revisited: An Experimental Study , 2005 .

[67]  V. J. Hotz,et al.  Predicting the efficacy of future training programs using past experiences at other locations , 2005 .

[68]  V. Bentkus On Hoeffding’s inequalities , 2004, math/0410159.

[69]  Werner Güth,et al.  On the Interaction of Risk and Time Preferences: An Experimental Study , 2001 .

[70]  Xiao-Li Meng,et al.  Posterior Predictive $p$-Values , 1994 .

[71]  A. Witte,et al.  The Influence of Probability on Risky Choice: A Parametric Examination , 1992 .

[72]  H. J. Einhorn,et al.  Expression theory and the preference reversal phenomena. , 1987 .

[73]  Uday S. Karmarkar,et al.  Subjectively weighted utility: A descriptive extension of the expected utility model , 1978 .

[74]  D. McFadden MEASUREMENT OF URBAN TRAVEL DEMAND , 1974 .

[75]  S. Zionts,et al.  Programming with linear fractional functionals , 1968 .

[76]  T. Haavelmo,et al.  The probability approach in econometrics , 1944 .

[77]  Stanislav Abaimov,et al.  Understanding Machine Learning , 2022, Machine Learning for Cyber Agents.

[78]  Galit Shmueli,et al.  Transporting Causal Effects Across Populations Using Structural Causal Modeling: The Example of Work-From-Home Productivity , 2021, Social Science Research Network.

[79]  Ferdinand M. Vieider,et al.  All over the map: A worldwide comparison of risk preferences , 2019, Quantitative Economics.

[80]  Rachael Meager,et al.  Understanding the Average Impact of Microcredit Expansions: A Bayesian Hierarchical Analysis of Seven Randomized Experiments , 2019, American Economic Journal: Applied Economics.

[81]  Kevin Leyton-Brown,et al.  Deep Learning for Predicting Human Strategic Behavior , 2016, NIPS.

[82]  Guido Imbens,et al.  Site Selection Bias in Program Evaluation , 2014 .

[83]  David Card,et al.  Time-Series Minimum-Wage Studies: A Meta-analysis , 1995 .

[84]  N. Fisher,et al.  Probability Inequalities for Sums of Bounded Random Variables , 1994 .

[85]  Stochastic Inequalities,et al.  RANDOM VARIABLES WITH MAXIMUM SUMS , 1982 .