Prediction of Recidivism in Thefts and Burglaries Using Machine Learning

Background/objectives: Theft and burglary are two crimes against property that have a great social impact. Their prevention drastically lowers victimization rates and the feeling of insecurity in the population. The objective of this investigation is to obtain an index that allows the prediction of repeat offenses by criminals in these types of crimes, in order to support decision-making with respect to preventative actions. Methodology: In order to obtain the index, a group of machines learning was trained, with information provided by the Criminal Analysis and Investigative Focus System (CAIFS) from the Regional Public Prosecutor’s Office in Biobío, Chile. The information provided was from thefts and burglaries committed between 2012 and 2017 in the city of Concepción. Findings/application: The results show a characterization of repeat offenders in these types of crime and a recurrence index that allows for a greater assertiveness in the prediction of recidivism than the method that is currently being used.

[1]  Mansi Ghodsi,et al.  A review of data mining applications in crime , 2016, Stat. Anal. Data Min..

[2]  A. M. Hay,et al.  The derivation of global estimates from a confusion matrix , 1988 .

[3]  Rekha Bhowmik,et al.  Detecting Auto Insurance Fraud by Data Mining Techniques , 2011 .

[4]  P. Thongtae,et al.  An Analysis of Data Mining Applications in Crime Domain , 2008, 2008 IEEE 8th International Conference on Computer and Information Technology Workshops.

[5]  Wei Ding,et al.  Crime Forecasting Using Data Mining Techniques , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[6]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[7]  Sotiris B. Kotsiantis,et al.  Machine learning: a review of classification and combining techniques , 2006, Artificial Intelligence Review.

[8]  Leonard J. Tashman,et al.  Out-of-sample tests of forecasting accuracy: an analysis and review , 2000 .

[9]  Fredy Troncoso,et al.  A novel approach to detect associations in criminal networks , 2020, Decis. Support Syst..

[10]  Teara Archwamety,et al.  Factors Related to Recidivism Among Delinquent Youths in a State Correctional Facility , 1997 .

[11]  Christie M. Fuller,et al.  An investigation of data and text mining methods for real world deception detection , 2011, Expert Syst. Appl..

[12]  Flora D. Salim,et al.  Crime event prediction with dynamic features , 2018, EPJ Data Science.

[13]  Vadlamani Ravi,et al.  Detecting phishing e-mails using text and data mining , 2012, 2012 IEEE International Conference on Computational Intelligence and Computing Research.

[14]  Ronald J. Brachman,et al.  The Process of Knowledge Discovery in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[15]  Robert P. Goldman,et al.  Imputation of Missing Data Using Machine Learning Techniques , 1996, KDD.

[16]  Ying Wang,et al.  Computer Crime Forensics Based on Improved Decision Tree Algorithm , 2014, J. Networks.

[17]  N. Tollenaar,et al.  Which method predicts recidivism best?: a comparison of statistical, machine learning and data mining predictive models , 2013 .

[18]  Shyam Varan Nath,et al.  Crime Pattern Detection Using Data Mining , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops.

[19]  Nikolaj Tollenaar,et al.  Optimizing predictive performance of criminal recidivism models using registration data with binary and survival outcomes , 2019, PloS one.

[20]  Mohammad Reza Keyvanpour,et al.  Detecting and investigating crime by means of data mining: a general crime matching framework , 2011, WCIT.

[21]  Nitesh V. Chawla,et al.  SPECIAL ISSUE ON LEARNING FROM IMBALANCED DATA SETS , 2004 .

[22]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[23]  Walter A. Kosters,et al.  Data Mining Approaches to Criminal Career Analysis , 2006, Sixth International Conference on Data Mining (ICDM'06).

[24]  Seyed Mohammad Seyedhosseini,et al.  Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty , 2010, Expert Syst. Appl..

[25]  Padhraic Smyth,et al.  Knowledge Discovery and Data Mining: Towards a Unifying Framework , 1996, KDD.

[26]  Teara Archwamety,et al.  Factors Related to Recidivism Among Delinquent Females at a State Correctional Facility , 1997 .

[27]  Hany Farid,et al.  The accuracy, fairness, and limits of predicting recidivism , 2018, Science Advances.

[28]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[29]  Walter A. Kosters,et al.  A Distance Measure for Determining Similarity Between Criminal Investigations , 2006, Industrial Conference on Data Mining.

[30]  S. Appavu alias Balamurugan,et al.  Association Rule Mining for Suspicious Email Detection: A Data Mining Approach , 2007, 2007 IEEE Intelligence and Security Informatics.

[31]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[32]  Turgut Ozkan,et al.  Predicting Recidivism Through Machine Learning , 2017 .

[33]  Roderick J A Little,et al.  A Review of Hot Deck Imputation for Survey Non‐response , 2010, International statistical review = Revue internationale de statistique.

[34]  Yannis Manolopoulos,et al.  Data Mining techniques for the detection of fraudulent financial statements , 2007, Expert Syst. Appl..

[35]  Richard Simon,et al.  Bias in error estimation when using cross-validation for model selection , 2006, BMC Bioinformatics.

[36]  Gary M. Weiss Mining with rarity: a unifying framework , 2004, SKDD.

[37]  Gang Wang,et al.  Crime data mining: a general framework and some examples , 2004, Computer.

[38]  Aida Mustapha,et al.  An experimental study of classification algorithms for crime prediction. , 2013 .