Beyond prediction: Using big data for policy problems

Machine-learning prediction methods have been extremely productive in applications ranging from medicine to allocating fire and health inspectors in cities. However, there are a number of gaps between making a prediction and making a decision, and underlying assumptions need to be understood in order to optimize data-driven decision-making.

[1]  John Langford,et al.  Doubly Robust Policy Evaluation and Learning , 2011, ICML.

[2]  Gabriel Cadamuro,et al.  Predicting poverty and wealth from mobile phone metadata , 2015, Science.

[3]  Jonathan Hersh,et al.  Poverty in HD : What Does High Resolution Satellite Imagery Reveal about Economic Welfare ? , 2016 .

[4]  Z. Obermeyer,et al.  Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. , 2016, The New England journal of medicine.

[5]  Yejin Choi,et al.  Where Not to Eat? Improving Public Policy by Predicting Hygiene Inspections Using Online Reviews , 2013, EMNLP.

[6]  Julie Tibshirani,et al.  Solving Heterogeneous Estimating Equations with Gradient Forests , 2016 .

[7]  Eva Ascarza Retention Futility: Targeting High-Risk Customers Might be Ineffective , 2018 .

[8]  G. Imbens,et al.  Approximate residual balancing: debiased inference of average treatment effects in high dimensions , 2016, 1604.07125.

[9]  G. Imbens,et al.  Efficient Inference of Average Treatment Effects in High Dimensions via Approximate Residual Balancing , 2016 .

[10]  Steven L. Scott,et al.  Multi-armed bandit experiments in the online service economy , 2015 .

[11]  Mark Braverman,et al.  Data-Driven Decisions for Reducing Readmissions for Heart Failure: General Methodology and Case Study , 2014, PloS one.

[12]  Michael S. Bernstein,et al.  Designing and deploying online field experiments , 2014, WWW.

[13]  J. Kleinberg,et al.  Prediction Policy Problems. , 2015, The American economic review.

[14]  Michael Luca,et al.  Big Data and Big Cities: The Promises and Limitations of Improved Measures of Urban Life , 2015 .

[15]  Steven Tadelis,et al.  Consumer Heterogeneity and Paid Search Effectiveness: A Large Scale Field Experiment , 2014 .

[16]  Justin Grimmer,et al.  Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts , 2013, Political Analysis.

[17]  Jun Sakuma,et al.  Fairness-Aware Classifier with Prejudice Remover Regularizer , 2012, ECML/PKDD.

[18]  A. Belloni,et al.  Inference on Treatment Effects after Selection Amongst High-Dimensional Controls , 2011, 1201.0224.

[19]  Susan Athey,et al.  Auction-Based Timber Pricing and Complementary Market Reforms in British Columbia , 2002 .

[20]  César A. Hidalgo,et al.  Cities Are Physical Too: Using Computer Vision to Measure the Quality and Impact of Urban Appearance , 2016 .

[21]  R. Berk Criminal Justice Forecasts of Risk: A Machine Learning Approach , 2012 .

[22]  Richard Berk,et al.  Criminal Justice Forecasts of Risk , 2012, SpringerBriefs in Computer Science.

[23]  A. Belloni,et al.  SPARSE MODELS AND METHODS FOR OPTIMAL INSTRUMENTS WITH AN APPLICATION TO EMINENT DOMAIN , 2012 .

[24]  Michael Luca,et al.  Crowdsourcing City Government: Using Tournaments to Improve Inspection Accuracy , 2016 .

[25]  Steven Tadelis,et al.  Consumer Heterogeneity and Paid Search Effectiveness: A Large-Scale Field Experiment: Paid Search Effectiveness , 2015 .