An architecture for emergency event prediction using LSTM recurrent neural networks

Abstract Emergency event prediction is a crucial topic since the events could involve human injuries or even deaths. Many countries record a considerable number of emergency events (EVs) that are caused by a variety of incidents such as murder and robbery. Emergency response systems based on more accurate EV prediction can help to allocate the required resources and resolve the emergencies through more rapid and effective risk management. Most real-time EV prediction systems are based on traditional time series analysis techniques such as moving average or autoregressive integrated moving average (ARIMA) models. To improve the accuracy of EV prediction, we propose a new architecture for EV prediction based on recurrent neural networks (RNN), specifically a long short-term memory (LSTM) architecture. A comparative analysis is presented to show the effectiveness of the proposed architecture compared to traditional time series analysis and machine learning methods through the evaluation of historical EV data provided by the national police of Guatemala.

[1]  Etienne G Krug,et al.  Global Status Report on Violence Prevention 2014. , 2015, American journal of preventive medicine.

[2]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[3]  Jerry H. Ratcliffe,et al.  Aoristic Signatures and the Spatio-Temporal Analysis of High Volume Crime Patterns , 2002 .

[4]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[5]  Jürgen Schmidhuber,et al.  Learning Precise Timing with LSTM Recurrent Networks , 2003, J. Mach. Learn. Res..

[6]  C. Holt Author's retrospective on ‘Forecasting seasonals and trends by exponentially weighted moving averages’ , 2004 .

[7]  Konstantinos Pelechrinis,et al.  Urban navigation beyond shortest route: The case of safe paths , 2016, Inf. Syst..

[8]  B. Bowerman,et al.  Forecasting, time series, and regression : an applied approach , 2005 .

[9]  Li-Yen Chang,et al.  Analysis of freeway accident frequencies: Negative binomial regression versus artificial neural network , 2005 .

[10]  Giles Oatley,et al.  Crimes analysis software: 'pins in maps', clustering and Bayes net prediction , 2003, Expert Syst. Appl..

[11]  Wei Ding,et al.  Crime Forecasting Using Spatio-temporal Pattern with Ensemble Learning , 2014, PAKDD.

[12]  E. McKenzie General exponential smoothing and the equivalent arma process , 1984 .

[13]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[14]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[16]  Mangai Natarajan,et al.  Police Response to Domestic Violence: A Case Study of TecSOS Mobile Phone Use in the London Metropolitan Police Service , 2016 .

[17]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[18]  Wilpen L. Gorr,et al.  Short-term forecasting of crime , 2003 .

[19]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[20]  May Yuan,et al.  Assessing Similarity of Geographic Processes and Events , 2005, Trans. GIS.

[21]  Hans-Peter Kriegel,et al.  A Database Interface for Clustering in Large Spatial Databases , 1995, KDD.

[22]  Sankar K. Pal,et al.  Multilayer perceptron, fuzzy sets, and classification , 1992, IEEE Trans. Neural Networks.

[23]  Laurene V. Fausett,et al.  Fundamentals Of Neural Networks , 1994 .

[24]  Roberto Rosas-Romero,et al.  Forecasting of stock return prices with sparse representation of financial time series over redundant dictionaries , 2016, Expert Syst. Appl..

[25]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[26]  Deyi Li,et al.  Spatial Data Mining: Theory and Application , 2016 .

[27]  Hermann Ney,et al.  Cross-entropy vs. squared error training: a theoretical and experimental comparison , 2013, INTERSPEECH.

[28]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[29]  Rob J Hyndman,et al.  Automatic Time Series Forecasting: The forecast Package for R , 2008 .

[30]  Slava Kisilevich,et al.  Spatio-temporal clustering , 2010, Data Mining and Knowledge Discovery Handbook.

[31]  I. D. Wilson,et al.  Predicting the geo-temporal variations of crime and disorder , 2003 .

[32]  Sheng-Tun Li,et al.  An intelligent decision-support model using FSOM and rule extraction for crime prevention , 2010, Expert Syst. Appl..

[33]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[34]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[35]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[36]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[38]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[39]  Zhenliang Liao,et al.  Environmental emergency decision support system based on Artificial Neural Network , 2012 .

[40]  Donald E. Brown,et al.  A decision model for spatial site selection by criminals: a foundation for law enforcement decision support , 2003, IEEE Trans. Syst. Man Cybern. Part C.

[41]  Valentina Janev,et al.  Application of Complex Event Processing Paradigm in Situation Awareness and Management , 2011, 2011 22nd International Workshop on Database and Expert Systems Applications.