Human Decisions and Machine Predictions

Can machine learning improve human decision making? Bail decisions provide a good test case. Millions of times each year, judges make jail-or-release decisions that hinge on a prediction of what a defendant would do if released. The concreteness of the prediction task combined with the volume of data available makes this a promising machine-learning application. Yet comparing the algorithm to judges proves complicated. First, the available data are generated by prior judge decisions. We only observe crime outcomes for released defendants, not for those judges detained. This makes it hard to evaluate counterfactual decision rules based on algorithmic predictions. Second, judges may have a broader set of preferences than the variable the algorithm predicts; for instance, judges may care specifically about violent crimes or about racial inequities. We deal with these problems using different econometric strategies, such as quasi-random assignment of cases to judges. Even accounting for these concerns, our results suggest potentially large welfare gains: one policy simulation shows crime reductions up to 24.7% with no change in jailing rates, or jailing rate reductions up to 41.9% with no increase in crime rates. Moreover, all categories of crime, including violent crimes, show reductions; and these gains can be achieved while simultaneously reducing racial disparities. These results suggest that while machine learning can be valuable, realizing this value requires integrating these tools into an economic framework: being clear about the link between predictions and decisions; specifying the scope of payoff functions; and constructing unbiased decision counterfactuals. JEL Codes: C10 (Econometric and statistical methods and methodology), C55 (Large datasets: Modeling and analysis), K40 (Legal procedure, the legal system, and illegal behavior).

[1]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[2]  B. Harcourt,et al.  Risk as a Proxy for Race , 2010 .

[3]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[4]  Susan Athey,et al.  Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.

[5]  Matthew O. Jackson Networks in the Understanding of Economic Behaviors , 2014 .

[6]  Cynthia Rudin,et al.  Learning Cost-Effective and Interpretable Treatment Regimes , 2017, AISTATS.

[7]  Bo Pang,et al.  The effect of wording on message propagation: Topic- and author-controlled natural experiments on Twitter , 2014, ACL.

[8]  Justin Bleich,et al.  Forecasts of Violence to Inform Sentencing Decisions , 2014 .

[9]  Jure Leskovec,et al.  A Bayesian Framework for Modeling Human Evaluations , 2015, SDM.

[10]  Sendhil Mullainathan,et al.  Machine Learning: An Applied Econometric Approach , 2017, Journal of Economic Perspectives.

[11]  Jeffrey R. Kling,et al.  Incarceration Length, Employment, and Earnings , 2006 .

[12]  Arjun Venkatesh,et al.  The Determinants of Productivity in Medical Testing: Intensity and Allocation of Care. , 2019, The American economic review.

[13]  Jon Kleinberg,et al.  Making sense of recommendations , 2019, Journal of Behavioral Decision Making.

[14]  Crystal S. Yang,et al.  The Effects of Pre-Trial Detention on Conviction, Future Crime, and Employment: Evidence from Randomly Assigned Judges , 2016 .

[15]  R. Dawes Judgment under uncertainty: The robust beauty of improper linear models in decision making , 1979 .

[16]  R. Dawes,et al.  Heuristics and Biases: Clinical versus Actuarial Judgment , 2002 .

[17]  A. Shleifer,et al.  Salience Theory of Choice Under Risk , 2010 .

[18]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[19]  Jaime Henderson,et al.  Using Regression Kernels to Forecast A Failure to Appear in Court , 2014, 1409.1798.

[20]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[21]  Cynthia Rudin,et al.  Interpretable classification models for recidivism prediction , 2015, 1503.07810.

[22]  Sonja B. Starr Evidence-Based Sentencing and the Scientific Rationalization of Discrimination , 2013 .

[23]  E. L. Kelly Clinical versus statistical prediction: A theoretical analysis and review of the evidence. , 1955 .

[24]  Hal R. Varian,et al.  Big Data: New Tricks for Econometrics , 2014 .

[25]  Michael R. Gottfredson,et al.  Policy Guidelines for Bail: An Experiment in Court Reform. , 1985 .

[26]  Tracey Kyckelhahn,et al.  Felony Defendants in Large Urban Counties, 2006: An Overview of How Defendants Charged with a Felony Offense Are Processed from Initial Appearance through Adjudication and Sentencing , 2010 .

[27]  Michele Banko,et al.  Scaling to Very Very Large Corpora for Natural Language Disambiguation , 2001, ACL.

[28]  Paul E. Meehl,et al.  Clinical Versus Statistical Prediction: A Theoretical Analysis and a Review of the Evidence , 1996 .

[29]  J. Kleinberg,et al.  Prediction Policy Problems. , 2015, The American economic review.

[30]  Aurelie Ouss,et al.  THE CRIMINAL AND LABOR MARKET IMPACTS OF INCARCERATION , 2014 .

[31]  Shawn D. Bushway,et al.  Sentencing Using Statistical Treatment Rules: What We Don’t Know Can Hurt Us , 2007 .

[32]  D. Goldstein,et al.  Simple Rules for Complex Decisions , 2017, 1702.04690.

[33]  Brian A. Jacob,et al.  Can Principals Identify Effective Teachers? Evidence on Subjective Performance Evaluation in Education , 2008, Journal of Labor Economics.

[34]  Richard A. Berk,et al.  An impact assessment of machine learning risk forecasts on parole board decisions and recidivism , 2017, Journal of Experimental Criminology.

[35]  Christian Hansen,et al.  High-Dimensional Methods and Inference on Structural and Treatment Effects , 2013 .

[36]  R. Berk Criminal Justice Forecasts of Risk: A Machine Learning Approach , 2012 .

[37]  A. Shleifer,et al.  Journal of Financial Economics] (]]]])]]]–]]] Contents lists available at ScienceDirect Journal of Financial Economics journal homepage: www.elsevier.com/locate/jfec Chasing noise $ , 2022 .

[38]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[39]  Jennifer Marie Logg,et al.  Organizational Behavior and Human Decision Processes , 2019 .

[40]  Steven D. Levitt,et al.  What Does Performance in Graduate School Predict? Graduate Economics Education and Student Outcomes , 2007 .

[41]  Arpita Gupta,et al.  The Heavy Costs of High Bail: Evidence from Judge Randomization , 2016, The Journal of Legal Studies.

[42]  Bernard E. Harcourt,et al.  Risk as a Proxy for Race , 2010 .

[43]  R. Di Tella,et al.  Criminal Recidivism after Prison and Electronic Monitoring , 2009, Journal of Political Economy.

[44]  M. Stevenson Distortion of Justice: How the Inability to Pay Bail Affects Case Outcomes , 2018, The Journal of Law, Economics, and Organization.

[45]  W. Grove,et al.  Clinical versus mechanical prediction: a meta-analysis. , 2000, Psychological assessment.

[46]  O. D. Duncan,et al.  The Efficiency of Prediction in Criminology , 1949, American Journal of Sociology.

[47]  Andrew M. Rosenfield,et al.  NOISE: How to overcome the high, hidden cost of inconsistent decision making , 2016 .

[48]  R. Dawes A case study of graduate admissions: Application of three principles of human decision making. , 1971 .

[49]  Christian Henrichson,et al.  The Price of Jails: Measuring the Taxpayer Cost of Local Incarceration , 2015 .

[50]  A. Aizer,et al.  Juvenile Incarceration, Human Capital and Future Crime: Evidence from Randomly-Assigned Judges , 2013 .

[51]  Berkeley J. Dietvorst,et al.  Algorithm Aversion: People Erroneously Avoid Algorithms after Seeing Them Err , 2014, Journal of experimental psychology. General.

[52]  Chris Rohlfs,et al.  Optimal Bail and the Value of Freedom: Evidence from the Philadelphia Bail Experiment , 2007 .

[53]  D. Rubin,et al.  Assessing Sensitivity to an Unobserved Binary Covariate in an Observational Study with Binary Outcome , 1983 .

[54]  Michael Luca,et al.  Supplemental Appendix for : Productivity and Selection of Human Capital with Machine Learning , 2016 .