Singular race models: addressing bias and accuracy in predicting prisoner recidivism

As machine learning based predictive systems pervade many aspects of our lives, an inherent bias and unfairness surface from time to time in the form of mispredictions in various domains. Recidivism, the tendency of offenders to reoffend after release from prison on parole, is one such domain where one race-based sub-population has been found to be treated more harshly than others. Current practices have focused on eliminating race information from datasets to reduce the predictive bias. In contrast to this, we built Singular Race Models, a novel approach of segmenting the dataset based on race, to train and test single race-based models to increase prediction accuracy and reduce racially inspired bias by considering only one race at a time. We created Singular Race Models for four different crime categories and compared these with base models created using all crimes and all races. This modeling choice helped us increase accuracy and analyze race related discrimination. A three-layered artificial neural network was utilized to do the heavy weight-lifting of recidivism prediction. With the help of several suitable metrics, in this paper, we demonstrate the increase in predictive accuracy of these Singular Race Models in various crime categories and analyze the causes and the secondary effect on bias.

[1]  Richard Berk,et al.  Criminal Justice Forecasts of Risk , 2012, SpringerBriefs in Computer Science.

[2]  Alexia Cooper,et al.  Multistate Criminal History Patterns of Prisoners Released in 30 States , 2015 .

[3]  Toon Calders,et al.  Three naive Bayes approaches for discrimination-free classification , 2010, Data Mining and Knowledge Discovery.

[4]  James Byrne,et al.  The Empirical Basis for the RNR Model with an Updated RNR Conceptual Framework , 2013 .

[5]  Hide Yamatani,et al.  Recidivism and Survival Time: Racial Disparity among Jail Ex-Inmates. , 2010 .

[6]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[7]  P. Langan,et al.  Recidivism of Prisoners Released in 1994 , 2002 .

[8]  Grant Duwe,et al.  Out With the Old and in With the New? An Empirical Comparison of Supervised Learning Algorithms to Predict Recidivism , 2017 .

[9]  Faye S. Taxman,et al.  Simulation Strategies to Reduce Recidivism , 2013 .

[10]  Hany Farid,et al.  The accuracy, fairness, and limits of predicting recidivism , 2018, Science Advances.

[11]  Turgut Ozkan,et al.  Predicting Recidivism Through Machine Learning , 2017 .

[12]  Osonde Osoba,et al.  An Intelligence in Our Image , 2017 .

[13]  Robert E. Schapire,et al.  Explaining AdaBoost , 2013, Empirical Inference.

[14]  William Welser,et al.  An Intelligence in Our Image: The Risks of Bias and Errors in Artificial Intelligence , 2017 .

[15]  Christopher T. Lowenkamp,et al.  False Positives, False Negatives, and False Analyses: A Rejoinder to "Machine Bias: There's Software Used across the Country to Predict Future Criminals. and It's Biased against Blacks" , 2016 .

[16]  R. Berk Criminal Justice Forecasts of Risk: A Machine Learning Approach , 2012 .

[17]  Anil K. Jain,et al.  Artificial Neural Networks: A Tutorial , 1996, Computer.

[18]  Caterina G. Roman,et al.  Evaluating and Quantifying the Specific Deterrent Effects of DNA Databases , 2014, Evaluation review.