Recidivism forecasting: A study on process of feature selection

Across the world, there are several factors attributed to high crime rates. The prevention of and the fight against crimes is a major concern of all countries. In the era of globalization and new information and communication technologies, reducing these crimes rate by using conventional methods (law enforcement, social interventions...) are not enough. In fact, they have many limits. Today, by analyzing a large volume of crimes data with machine learning algorithms, researchers can take important advantage of these technologies, especially in the context of the world's famous problem of recidivism. By using these recent innovations, security departments can predict how, when, and where reoffending will happen before it actually happens. However, the efficiency, the quality, and the accuracy of these forcasting models and software depend on several factors. The process of feature selection is one of these key factors. By improving the quality of this process, we can reduce over fitting and eliminating redundant data as well as training time. In this context, this investigation paid particular attention to the process of recidivism features selection (first phase of our future recidivism forcasting framework). Based on detailed study of recidivism theoretical factors, previous and recent methods used in recidivism features selection, we present a comparative study on all key elements used in this phase (features, categories of features and methods of features selection). Our main objective is to prepare an important knowledge database for recidivism features. This database will take into account different sets of recidivism features obtained by all previous and recent projects. It will also be used in our recidivism forcasting framework.

[1]  Jose Miguel Puerta,et al.  A GRASP algorithm for fast hybrid (filter-wrapper) feature subset selection in high-dimensional datasets , 2011, Pattern Recognit. Lett..

[2]  Arif Gülten,et al.  A Robust Multi-Class Feature Selection Strategy Based on Rotation Forest Ensemble Algorithm for Diagnosis of Erythemato-Squamous Diseases , 2012, Journal of Medical Systems.

[3]  Michel Lang,et al.  A Multicriteria Approach to Find Predictive and Sparse Models with Stable Feature Selection for High-Dimensional Data , 2017, Comput. Math. Methods Medicine.

[4]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[5]  Donghai Guan,et al.  A Review of Ensemble Learning Based Feature Selection , 2014 .

[6]  W. Mohamed,et al.  Reducing Recidivism Rates through Vocational Education and Training , 2015 .

[7]  Pa-Chun Wang,et al.  Particle swarm optimization for feature selection with application in obstructive sleep apnea diagnosis , 2011, Neural Computing and Applications.

[8]  SahinFerat,et al.  A survey on feature selection methods , 2014 .

[9]  J. V. van Horn,et al.  Comparing the Central Eight Risk Factors: Do They Differ Across Age Groups of Sex Offenders? , 2018, International journal of offender therapy and comparative criminology.

[10]  R. Banse,et al.  Pro-criminal attitudes, intervention, and recidivism ☆ , 2013 .

[11]  J. Fleiss Statistical methods for rates and proportions , 1974 .

[12]  Colas Schretter,et al.  Information-Theoretic Feature Selection in Microarray Data Using Variable Complementarity , 2008, IEEE Journal of Selected Topics in Signal Processing.

[13]  Ron Kohavi,et al.  Feature Subset Selection Using the Wrapper Method: Overfitting and Dynamic Search Space Topology , 1995, KDD.

[14]  R. Reulke,et al.  Remote Sensing and Spatial Information Sciences , 2005 .

[15]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[16]  J. Bonta,et al.  The Prediction of Recidivism with Aboriginal Offenders: A Theoretically Informed Meta-Analysis , 2013 .

[17]  Duncan Fyfe Gillies,et al.  A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data , 2015, Adv. Bioinformatics.

[18]  Verónica Bolón-Canedo,et al.  A review of feature selection methods on synthetic data , 2013, Knowledge and Information Systems.

[19]  Ezhilmaran Devarasan,et al.  Computing the Probability on Socio Economic Factors to Predict the Crime Locations by Means of Joint Probability Based AMABC-FCIL , 2016 .

[20]  Sangkyun Lee,et al.  Feature Selection for High-Dimensional Data with RapidMiner , 2012 .

[21]  Serkan Gunal Hybrid feature selection for text classification , 2012 .

[22]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[23]  Wagdy Loza Predicting violent and nonviolent recidivism of incarcerated male offenders. , 2003 .

[24]  Mengjie Zhang,et al.  A Comprehensive Comparison on Evolutionary Feature Selection Approaches to Classification , 2015, Int. J. Comput. Intell. Appl..

[25]  Peter A. Bandettini,et al.  Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images , 2012, NeuroImage.

[26]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[27]  Huan Liu,et al.  A Probabilistic Approach to Feature Selection - A Filter Solution , 1996, ICML.

[28]  Li Guo,et al.  TCM-KNN scheme for network anomaly detection using feature-based optimizations , 2008, SAC '08.

[29]  Amparo Alonso-Betanzos,et al.  A Wrapper Method for Feature Selection in Multiple Classes Datasets , 2009, IWANN.

[30]  Chenn-Jung Huang,et al.  Application of wrapper approach and composite classifier to the stock trend prediction , 2008, Expert Syst. Appl..

[31]  Simon Fong,et al.  Feature selection methods: Case of filter and wrapper approaches for maximising classification accuracy , 2018 .

[32]  Aravind Seshadri,et al.  A FAST ELITIST MULTIOBJECTIVE GENETIC ALGORITHM: NSGA-II , 2000 .

[33]  Dana Kulic,et al.  An evaluation of classifier-specific filter measure performance for feature selection , 2015, Pattern Recognit..

[34]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[35]  Richard Weber,et al.  Feature selection for high-dimensional class-imbalanced data sets using Support Vector Machines , 2014, Inf. Sci..

[36]  Mahdi Eftekhari,et al.  A Hybrid Filter-Based Feature Selection Method via Hesitant Fuzzy and Rough Sets Concepts , 2018, How Fuzzy Concepts Contribute to Machine Learning.

[37]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[38]  Huan Liu,et al.  Feature selection for classification: A review , 2014 .

[39]  Sofie Van Roeyen,et al.  Criminal careers in transition: the social context of desistance from crime , 2016 .

[40]  Amir-Massoud Bidgoli,et al.  A Hybrid Feature Selection Method to Improve Performance of a Group of Classification Algorithms , 2013, ArXiv.

[41]  Gavin Brown,et al.  Conditional Likelihood Maximisation: A Unifying Framework for Information Theoretic Feature Selection , 2012, J. Mach. Learn. Res..

[42]  Amir Reza Saffari Azar Alamdari Variable Selection using Correlation and Single Variable Classifier Methods: Applications , 2006 .

[43]  L. Motiuk The Statistical Information on Recidivism - Revised 1 (SIR-R1) Scale: A Psychometric Examination , 2002 .

[44]  Laurent Younes,et al.  A Stochastic Algorithm for Feature Selection in Pattern Recognition , 2007, J. Mach. Learn. Res..

[45]  Charles E. Catlett,et al.  Spatio-temporal crime predictions in smart cities: A data-driven approach and experiments , 2019, Pervasive Mob. Comput..

[46]  Jianyu Miao,et al.  A Survey on Feature Selection , 2016 .

[47]  J. Bonta,et al.  The prediction of criminal and violent recidivism among mentally disordered offenders: a meta-analysis. , 1998, Psychological bulletin.

[48]  Kevin T. Wolff,et al.  Dynamic risk factors and timing of recidivism for youth in residential placement , 2019, Journal of Criminal Justice.

[49]  Noor Maizura Mohamad Noor,et al.  A Comparative Study to Evaluate Filtering Methods for Crime Data Feature Selection , 2017, ICCSCI.

[50]  Antonio Andrés Pueyo,et al.  THE PSYCHOLOGY OF CRIMINAL CONDUCT , 2007 .

[51]  J. Anuradha,et al.  A Review of Feature Selection and Its Methods , 2019, Cybernetics and Information Technologies.

[52]  Cynthia Rudin,et al.  Interpretable classification models for recidivism prediction , 2015, 1503.07810.