Algorithms for Crime Prediction in Smart Cities Through Data Mining

The concentration of police resources in conflict zones contributes to the reduction of crime in the region and the optimization of those resources. This paper presents the use of regression techniques to predict the number of criminal acts in Colombian municipalities. To this end, a set of data was generated merging the data from the Guardia Civil with public data on the demographic structure and voting trends in the municipalities. The best regressor obtained (Random Forests) achieves a RRSE (Root Relative Squared Error) of 40.12% and opens the way to keep incorporating public data of another type with greater predictive power. In addition, M5Rules were used to interpret the results.

[1]  Hang-Bong Kang,et al.  Prediction of crime occurrence from multi-modal data using deep learning , 2017, PloS one.

[2]  Viloria Amelec,et al.  Relationship Between Variables of Performance Social and Financial of Microfinance Institutions , 2015 .

[3]  Luiz G A Alves,et al.  Crime prediction through urban metrics and statistical learning , 2017, Physica A: Statistical Mechanics and its Applications.

[4]  Amelec Viloria,et al.  Improvements for Determining the Number of Clusters in k-Means for Innovation Databases in SMEs , 2019, ANT/EDI40.

[5]  Charlie S. Marzan,et al.  Geospatial-temporal analysis and classification of criminal data in Manila , 2017, 2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA).

[6]  Donald E. Brown,et al.  Spatial and temporal analysis of sex crimes in Charlottesville, Virginia , 2015, 2015 Systems and Information Engineering Design Symposium.

[7]  Reynold Xin,et al.  Apache Spark , 2016 .

[8]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[9]  Amelec Viloria,et al.  Database Performance Tuning and Query Optimization , 2018, DMBD.

[10]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[11]  Jorge C. Leitao,et al.  Is this scaling nonlinear? , 2016, Royal Society Open Science.

[12]  Reda Alhajj,et al.  EFFECTIVENESS OF SUPPORT VECTOR MACHINE FOR CRIME HOT-SPOTS PREDICTION , 2008, Appl. Artif. Intell..

[13]  Rajeev Motwani,et al.  Scalable Techniques for Mining Causal Structures , 1998, Data Mining and Knowledge Discovery.

[14]  Harris Drucker,et al.  Improving Regressors using Boosting Techniques , 1997, ICML.

[15]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[16]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[17]  Anton Borg,et al.  Finding Influential Users in Social Media Using Association Rule Learning , 2016, Entropy.

[18]  Michael Hahsler,et al.  Visualizing association rules in hierarchical groups , 2016, Journal of Business Economics.

[19]  Amelec Viloria,et al.  Integration of Data Mining Techniques to PostgreSQL Database Manager System , 2019, Procedia Computer Science.