Algorithmic Risk Assessments Can Alter Human Decision-Making Processes in High-Stakes Government Contexts

Governments are increasingly turning to algorithmic risk assessments when making important decisions, such as whether to release criminal defendants before trial. Policymakers assert that providing public servants with algorithmic advice will improve human risk predictions and thereby lead to better (e.g., fairer) decisions. Yet because many policy decisions require balancing risk-reduction with competing goals, improving the accuracy of predictions may not necessarily improve the quality of decisions. If risk assessments make people more attentive to reducing risk at the expense of other values, these algorithms would diminish the implementation of public policy even as they lead to more accurate predictions. Through an experiment with 2,140 lay participants simulating two high-stakes government contexts, we provide the first direct evidence that risk assessments can systematically alter how people factor risk into their decisions. These shifts counteracted the potential benefits of improved prediction accuracy. In the pretrial setting of our experiment, the risk assessment made participants more sensitive to increases in perceived risk; this shift increased the racial disparity in pretrial detention by 1.9%. In the government loans setting of our experiment, the risk assessment made participants more risk-averse; this shift reduced government aid by 8.3%. These results demonstrate the potential limits and harms of attempts to improve public policy by incorporating predictive algorithms into multifaceted policy decisions. If these observed behaviors occur in practice, presenting risk assessments to public servants would generate unexpected and unjust shifts in public policy without being subject to democratic deliberation or oversight.

[1]  Dominique Makowski,et al.  bayestestR: Describing Effects and their Uncertainty, Existence and Significance within the Bayesian Framework , 2019, J. Open Source Softw..

[2]  Ben Green,et al.  The false promise of risk assessments: epistemic reform and the limits of fairness , 2020, FAT*.

[3]  John A. List,et al.  Do professional traders exhibit myopic loss aversion? An experimental analysis , 2005 .

[4]  John S. Hollywood,et al.  Predictions put into practice: a quasi-experimental evaluation of Chicago’s predictive policing pilot , 2016 .

[5]  Berkeley J. Dietvorst,et al.  Algorithm Aversion: People Erroneously Avoid Algorithms after Seeing Them Err , 2014, Journal of experimental psychology. General.

[6]  Peter Baumgartner,et al.  Public safety assessment , 2020, Criminology & Public Policy.

[7]  Sarah Desmarais Jay Singh,et al.  Risk Assessment Instruments Validated and Implemented in Correctional Settings in the United States , 2013 .

[8]  Katharina Reinecke,et al.  Crowdsourcing performance evaluations of user interfaces , 2013, CHI.

[9]  Dominique Makowski,et al.  Indices of Effect Existence and Significance in the Bayesian Framework , 2019, Front. Psychol..

[10]  Denis J. Hilton,et al.  Choice preferences without inferences: subconscious priming of risk attitudes , 2002 .

[11]  Jon Kleinberg,et al.  Making sense of recommendations , 2019, Journal of Behavioral Decision Making.

[12]  Chris Guthrie,et al.  Inside the Judicial Mind , 2001 .

[13]  Ernst Fehr,et al.  UvA-DARE ( Digital Academic Repository ) Evidence for Countercyclical Risk Aversion : An Experiment with Financial Professionals , 2015 .

[14]  Robert Fildes,et al.  Judgmental forecasts of time series affected by special events: does providing a statistical forecast improve accuracy? , 1999 .

[15]  M. Stevenson,et al.  Assessing Risk Assessment in Action , 2018 .

[16]  Ryanne A. Brown,et al.  Impact of a deep learning assistant on the histopathologic classification of liver cancer , 2020, npj Digital Medicine.

[17]  Brian F. Schaffner,et al.  Priming Risk: The Accessibility of Uncertainty in Public Policy Decision Making , 2011 .

[18]  Ben Green The Smart Enough City , 2019 .

[19]  Paul-Christian Bürkner,et al.  Advanced Bayesian Multilevel Modeling with the R Package brms , 2017, R J..

[20]  Don A. Moore,et al.  Organizational Behavior and Human Decision Processes , 2019 .

[21]  Eric Horvitz,et al.  Combining human and machine intelligence in large-scale crowdsourcing , 2012, AAMAS.

[22]  Yang Liu,et al.  Actionable Recourse in Linear Classification , 2018, FAT.

[23]  Peter Baumgartner,et al.  The Public Safety Assessment: A Re-Validation and Assessment of Predictive Utility and Differential Prediction by Race and Gender in Kentucky , 2018 .

[24]  V Kishore Ayyadevara,et al.  Gradient Boosting Machine , 2018 .

[25]  D. Yates,et al.  Street-Level Bureaucracy: Dilemmas of the Individual in Public Services , 1981, Michigan Law Review.

[26]  Crystal S. Yang,et al.  TOWARD AN OPTIMAL BAIL SYSTEM , 2017 .

[27]  Timothy D. Wilson,et al.  Telling more than we can know: Verbal reports on mental processes. , 1977 .

[28]  Juan M. Del Nido Metrics at Work: Journalism and the Contested Meaning of Algorithms , 2020 .

[29]  J. Travis Effects of Judges ’ Sentencing Decisions on Criminal Careers , 1999 .

[30]  J. Rachlinski,et al.  Gains, Losses, and Judges: Framing and the Judiciary , 2018 .

[31]  André Furlani The Smart Enough City: Putting Technology in Its Place to Reclaim Our Urban Future , 2021, The European Legacy.

[32]  Eric Potash,et al.  Predictive Modeling for Public Health: Preventing Childhood Lead Poisoning , 2015, KDD.

[33]  D. Kliger,et al.  Priming the Risk Attitudes of Professionals in Financial Decision Making , 2008 .

[34]  Jennifer L. Doleac,et al.  Algorithmic Risk Assessment in the Hands of Humans , 2019, SSRN Electronic Journal.

[35]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[36]  J. Kleinberg,et al.  Prediction Policy Problems. , 2015, The American economic review.

[37]  BEN GREEN,et al.  The Principles and Limits of Algorithm-in-the-Loop Decision Making , 2019, Proc. ACM Hum. Comput. Interact..

[38]  Vincent Aleven,et al.  Designing for Complementarity: Teacher and Student Needs for Orchestration Support in AI-Enhanced Classrooms , 2019, AIED.

[39]  Jiangtao Wang,et al.  Crowd-Assisted Machine Learning: Current Issues and Future Directions , 2019, Computer.

[40]  Alex Albright,et al.  IF YOU GIVE A JUDGE A RISK SCORE: EVIDENCE FROM KENTUCKY BAIL DECISIONS , 2019 .

[41]  Virginia E. Eubanks Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor , 2018 .

[42]  Barbara Kiviat The Moral Limits of Predictive Practices: The Case of Credit-Based Insurance Scores , 2019, American Sociological Review.

[43]  J. Monahan,et al.  Impact of risk assessment on judges' fairness in sentencing relatively poor defendants. , 2020, Law and human behavior.

[44]  D. Citron Technological Due Process , 2007 .

[45]  R. John,et al.  The Effect of Framing Actuarial Risk Probabilities on Involuntary Civil Commitment Decisions , 2011, Law and human behavior.

[46]  Bo Cowgill The Impact of Algorithms on Judicial Discretion : Evidence from Regression Discontinuities , 2018 .

[47]  Alexandra Chouldechova,et al.  A Case for Humans-in-the-Loop: Decisions in the Presence of Erroneous Algorithmic Scores , 2020, CHI.

[48]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[49]  Vivian Lai,et al.  On Human Predictions with Explanations and Predictions of Machine Learning Models: A Case Study on Deception Detection , 2018, FAT.

[50]  S. B. Baughman Restoring the Presumption of Innocence , 2011 .

[51]  Youngjae Lee,et al.  Dangerous Defendants , 2019 .

[52]  Jure Leskovec,et al.  Human Decisions and Machine Predictions , 2017, The quarterly journal of economics.

[53]  Kristie B. Hadden,et al.  2020 , 2020, Journal of Surgical Orthopaedic Advances.

[54]  Sonja B. Starr Evidence-Based Sentencing and the Scientific Rationalization of Discrimination , 2013 .

[55]  M. O'Connor,et al.  Judgemental adjustment of initial forecasts: Its effectiveness and biases , 1995 .

[56]  Ben Green,et al.  Algorithmic realism: expanding the boundaries of algorithmic thought , 2020, FAT*.

[57]  K. Gummadi,et al.  Human Decision Making with Machine Assistance , 2019, Proc. ACM Hum. Comput. Interact..

[58]  Angèle Christin,et al.  Technologies of Crime Prediction: The Reception of Algorithms in Policing and Criminal Courts , 2020 .

[59]  A. Coppock Generalizing from Survey Experiments Conducted on Mechanical Turk: A Replication Approach , 2018, Political Science Research and Methods.

[60]  Chris Guthrie,et al.  Does Unconscious Racial Bias Affect Trial Judges , 2009 .

[61]  D. G. Robinson,et al.  Danger Ahead: Risk Assessment and the Future of Bail Reform , 2018 .

[62]  A. Tversky,et al.  The framing of decisions and the psychology of choice. , 1981, Science.

[63]  Ben Green,et al.  Disparate Interactions: An Algorithm-in-the-Loop Analysis of Fairness in Risk Assessments , 2019, FAT.

[64]  Ben Green Risk Assessments : A Precarious Approach for Criminal Justice Reform , 2018 .