The Impact of Machine Learning on Economics

This paper provides an assessment of the early contributions of machine learning to economics, as well as predictions about its future contributions. It begins by briefly overviewing some themes from the literature on machine learning, and then draws some contrasts with traditional approaches to estimating the impact of counterfactual policies in economics. Next, we review some of the initial “off-the-shelf” applications of machine learning to economics, including applications in analyzing text and images. We then describe new types of questions that have been posed surrounding the application of machine learning to policy problems, including “prediction policy problems,” as well as considerations of fairness and manipulability. We present some highlights from the emerging econometric literature combining machine learning and causal inference. Finally, we overview a set of broader predictions about the future impact of machine learning on economics, including its impacts on the nature of collaboration, funding, research tools, and research questions.

[1]  Susan Athey,et al.  Set-Asides and Subsidies in Auctions , 2011 .

[2]  Halbert White,et al.  Artificial Neural Networks: Approximation and Learning Theory , 1992 .

[3]  G. Imbens,et al.  Estimating Treatment Effects using Multiple Surrogates: The Role of the Surrogate Score and the Surrogate Index , 2016, 1603.09326.

[4]  Jon M. Kleinberg,et al.  Inherent Trade-Offs in the Fair Determination of Risk Scores , 2016, ITCS.

[5]  Sam Asher,et al.  Classification Trees for Heterogeneous Moment-Based Models , 2016 .

[6]  Lihong Li,et al.  Learning from Logged Implicit Exploration Data , 2010, NIPS.

[7]  Daniel Björkegren,et al.  Behavior Revealed in Mobile Phone Usage Predicts Loan Repayment , 2017, The World Bank Economic Review.

[8]  K. Do,et al.  Efficient and Adaptive Estimation for Semiparametric Models. , 1994 .

[9]  S. Athey,et al.  Generalized random forests , 2016, The Annals of Statistics.

[10]  J. Zubizarreta Stable Weights that Balance Covariates for Estimation With Incomplete Outcome Data , 2015 .

[11]  Sang Michael Xie,et al.  Combining satellite imagery and machine learning to predict poverty , 2016, Science.

[12]  Marc Ratkovic,et al.  Estimating treatment effect heterogeneity in randomized program evaluation , 2013, 1305.5682.

[13]  Christian Hansen,et al.  Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach , 2015 .

[14]  Susan Athey,et al.  Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.

[15]  John Langford,et al.  Doubly Robust Policy Evaluation and Optimization , 2014, ArXiv.

[16]  Jie Liu,et al.  Modeling Consumer Preferences and Price Sensitivities from Large-Scale Grocery Shopping Transaction Logs , 2017, WWW.

[17]  Alexander Peysakhovich,et al.  Combining observational and experimental data to find heterogeneous treatment effects , 2016, ArXiv.

[18]  Ramesh Raskar,et al.  Streetscore -- Predicting the Perceived Safety of One Million Streetscapes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[19]  Philip A. Haile,et al.  Nonparametric Approaches to Auctions , 2007 .

[20]  Toru Kitagawa,et al.  Who should be Treated? Empirical Welfare Maximization Methods for Treatment Choice , 2015 .

[21]  Susan Athey,et al.  Estimation Considerations in Contextual Bandits , 2017, ArXiv.

[22]  Wei Chu,et al.  A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.

[23]  Jon Kleinberg,et al.  Making sense of recommendations , 2019, Journal of Behavioral Decision Making.

[24]  Wei Chu,et al.  An unbiased offline evaluation of contextual bandit algorithms with generalized linear models , 2011 .

[25]  John Langford,et al.  Doubly Robust Policy Evaluation and Learning , 2011, ICML.

[26]  Jonathan D. Levin,et al.  Comparing Open and Sealed Bid Auctions: Evidence from Timber Auctions , 2008 .

[27]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[28]  Christian Hansen,et al.  Double/Debiased/Neyman Machine Learning of Treatment Effects , 2017, 1701.08687.

[29]  Mohsen Bayati,et al.  Online Decision-Making with High-Dimensional Covariates , 2015 .

[30]  Christian Hansen,et al.  High-Dimensional Methods and Inference on Structural and Treatment Effects , 2013 .

[31]  Susan Athey,et al.  The State of Applied Econometrics - Causality and Policy Evaluation , 2016, 1607.00699.

[32]  Denis Nekipelov,et al.  Estimation of Treatment Effects from Combined Data: Identification versus Data Security , 2015 .

[33]  Sören R. Künzel,et al.  Meta-learners for Estimating Heterogeneous Treatment Effects using Machine Learning , 2017 .

[34]  Justin M. Rao,et al.  Precinct or Prejudice? Understanding Racial Disparities in New York City's Stop-and-Frisk Policy , 2016 .

[35]  A. Zeevi,et al.  A Linear Response Bandit Problem , 2013 .

[36]  Justin M. Rao,et al.  Precinct or Prejudice? Understanding Racial Disparities in New York City's Stop-and-Frisk Policy , 2015 .

[37]  M. J. van der Laan,et al.  The International Journal of Biostatistics Targeted Maximum Likelihood Learning , 2011 .

[38]  Lihong Li,et al.  Provable Optimal Algorithms for Generalized Linear Contextual Bandits , 2017, ArXiv.

[39]  Margaret E. Roberts,et al.  How to make causal inferences using texts , 2018, Science advances.

[40]  J. Bai,et al.  Large Dimensional Factor Analysis , 2008 .

[41]  Lihong Li,et al.  An Empirical Evaluation of Thompson Sampling , 2011, NIPS.

[42]  Andrew Y. Ng,et al.  Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[43]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.

[44]  Dean Eckles,et al.  Design and Analysis of Experiments in Networks: Reducing Bias from Interference , 2014, ArXiv.

[45]  David M. Blei,et al.  SHOPPER: A Probabilistic Model of Consumer Choice with Substitutes and Complements , 2017, The Annals of Applied Statistics.

[46]  D. Giannone,et al.  Now-Casting and the Real-time Data Flow , 2012, SSRN Electronic Journal.

[47]  David M. Blei,et al.  Scalable Recommendation with Hierarchical Poisson Factorization , 2015, UAI.

[48]  A. Roth The Economist as Engineer: Game Theory, Experimentation, and Computation as Tools for Design Economics , 2002 .

[49]  Nathan Kallus,et al.  Balanced Policy Evaluation and Learning , 2017, NeurIPS.

[50]  Hal R. Varian,et al.  Big Data: New Tricks for Econometrics , 2014 .

[51]  J. Laffont,et al.  ECONOMETRICS OF FIRST-PRICE AUCTIONS , 1995 .

[52]  G. Imbens,et al.  Balancing, Regression, Difference-in-Differences and Synthetic Control Methods: A Synthesis , 2016, 1610.07748.

[53]  Philip S. Thomas,et al.  Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning , 2016, ICML.

[54]  Stefan Wager,et al.  Estimating Average Treatment Effects: Supplementary Analyses and Remaining Challenges , 2017, 1702.01250.

[55]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[56]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[57]  Kevin Leyton-Brown,et al.  Counterfactual Prediction with Deep Instrumental Variables Networks , 2016, ArXiv.

[58]  P. Robinson ROOT-N-CONSISTENT SEMIPARAMETRIC REGRESSION , 1988 .

[59]  Nan Jiang,et al.  Doubly Robust Off-policy Value Evaluation for Reinforcement Learning , 2015, ICML.

[60]  Susan Athey,et al.  A Measure of Robustness to Misspecification , 2015 .

[61]  G. Imbens,et al.  Exact p-Values for Network Interference , 2015, 1506.02084.

[62]  G. Imbens,et al.  Matrix Completion Methods for Causal Panel Data Models , 2017, Journal of the American Statistical Association.

[63]  Ramesh Raskar,et al.  Computer vision uncovers predictors of physical urban change , 2017, Proceedings of the National Academy of Sciences.

[64]  J. Hahn On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects , 1998 .

[65]  David M. Blei,et al.  Estimating Heterogeneous Consumer Preferences for Restaurants and Travel Time Using Mobile Location Data , 2018, ArXiv.

[66]  G. Imbens,et al.  Approximate residual balancing: debiased inference of average treatment effects in high dimensions , 2016, 1604.07125.

[67]  Susan Athey,et al.  Counterfactual inference for consumer choice across many product categories , 2019, Quantitative Marketing and Economics.

[68]  Michael Luca,et al.  Crowdsourcing City Government: Using Tournaments to Improve Inspection Accuracy , 2016 .

[69]  Ryan Engstrom,et al.  Poverty from Space: Using High-Resolution Satellite Imagery for Estimating Economic Well-Being , 2017, The World Bank Economic Review.

[70]  Steven L. Scott,et al.  A modern Bayesian look at the multi-armed bandit , 2010 .

[71]  Susan Athey,et al.  The Impact of Aggregators on Internet News Consumption , 2017, SSRN Electronic Journal.

[72]  Justin M. Rao,et al.  The Unfavorable Economics of Measuring the Returns to Advertising , 2014 .

[73]  Lihong Li,et al.  Counterfactual Estimation and Optimization of Click Metrics for Search Engines , 2014, ArXiv.

[74]  Esther Duflo,et al.  The Economist as Plumber , 2017 .

[75]  Sendhil Mullainathan,et al.  Machine Learning: An Applied Econometric Approach , 2017, Journal of Economic Perspectives.

[76]  Thorsten Joachims,et al.  Batch learning from logged bandit feedback through counterfactual risk minimization , 2015, J. Mach. Learn. Res..

[77]  Susan Athey,et al.  Beyond prediction: Using big data for policy problems , 2017, Science.

[78]  Raphaël Féraud,et al.  Random Forest for the Contextual Bandit Problem , 2015, AISTATS.

[79]  Matthew Goldman,et al.  Experiments as Instruments: Heterogeneous Position Effects in Sponsored Search Auctions , 2015, EAI Endorsed Trans. Serious Games.

[80]  R. Tibshirani,et al.  Local Likelihood Estimation , 1987 .

[81]  J. Kleinberg,et al.  Prediction Policy Problems. , 2015, The American economic review.

[82]  Michael Luca,et al.  Big Data and Big Cities: The Promises and Limitations of Improved Measures of Urban Life , 2015 .

[83]  P. Bickel Efficient and Adaptive Estimation for Semiparametric Models , 1993 .

[84]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .