Hierarchical Spatio-Temporal Pattern Discovery and Predictive Modeling

We propose a new approach, CCRBoost, to identify the hierarchical structure of spatio-temporal patterns at different resolution levels and subsequently construct a predictive model based on the identified structure. To accomplish this, we first obtain indicators within different spatio-temporal spaces from the raw data. A distributed spatio-temporal pattern (DSTP) is extracted from a distribution, which consists of the locations with similar indicators from the same time period, generated by multi-clustering. Next, we use a greedy searching and pruning algorithm to combine the DSTPs in order to form an ensemble spatio-temporal pattern (ESTP). An ESTP can represent the spatio-temporal pattern of various regularities or a non-stationary pattern. To consider all the possible scenarios of a real-world ST pattern, we then build a model with layers of weighted ESTPs. By evaluating all the indicators of one location, this model can predict whether a target event will occur at this location. In the case study of predicting crime events, our results indicate that the predictive model can achieve 80 percent accuracy in predicting residential burglary, which is better than other methods.

[1]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[2]  Robi Polikar,et al.  Incremental Learning of Concept Drift in Nonstationary Environments , 2011, IEEE Transactions on Neural Networks.

[3]  T. Pratt,et al.  THE EMPIRICAL STATUS OF GOTTFREDSON AND HIRSCHI'S GENERAL THEORY OF CRIME: A META‐ANALYSIS , 2000 .

[4]  Samuel Greengard,et al.  Policing the future , 2012, Commun. ACM.

[5]  K. Leong,et al.  A review of spatio-temporal pattern analysis approaches on crime analysis , 2015 .

[6]  Stan Matwin,et al.  Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[7]  Gordon F. Mulligan,et al.  Using Geographically Weighted Regression to Explore Local Crime Patterns , 2007 .

[8]  Wilpen L. Gorr,et al.  Development of Crime Forecasting and Mapping Systems for Use by Police , 2005 .

[9]  Shane D. Johnson,et al.  The Stability of Space-Time Clusters of Burglary , 2004 .

[10]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[11]  M. Townsley,et al.  Infectious Burglaries. A Test of the Near Repeat Hypothesis , 2003 .

[12]  Trevor Hastie,et al.  Additive Logistic Regression : a Statistical , 1998 .

[13]  Trevor Bennett,et al.  Preventing Residential Burglary in Cambridge: From Crime Audits to Targeted Strategies , 1999 .

[14]  Graham Farrell,et al.  CRIM SEASONALITY: Domestic Disputes and Residential Burglary in Merseyside 1988–90 , 1994 .

[15]  Wei Ding,et al.  Crime Forecasting Using Data Mining Techniques , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[16]  Shashi Shekhar,et al.  Cascading Spatio-Temporal Pattern Discovery , 2012, IEEE Transactions on Knowledge and Data Engineering.

[17]  Bryan A. Garner,et al.  A Dictionary of Modern Legal Usage , 1987 .

[18]  Andrea L. Bertozzi,et al.  Nonlinear Patterns in Urban Crime: Hotspots, Bifurcations, and Suppression , 2010, SIAM J. Appl. Dyn. Syst..

[19]  J. W. Tukey,et al.  The Measurement of Power Spectra from the Point of View of Communications Engineering , 1958 .

[20]  Yiyi Wang ANTICIPATING LAND USE CHANGE USING GEOGRAPHICALLY WEIGHTED REGRESSION MODELS FOR DISCRETE RESPONSE , 2011 .

[21]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[22]  Omar F. El-Gayar,et al.  Discovering Predictive Event Sequences in Criminal Careers , 2013 .

[23]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[24]  Kam C. Wong Black's theory on the behavior of law revisited , 1995 .

[25]  George E. Tita,et al.  Self-Exciting Point Process Modeling of Crime , 2011 .

[26]  G. Villarini,et al.  Nonstationary modeling of a long record of rainfall and temperature over Rome , 2010 .

[27]  A. Malathi,et al.  An Enhanced Algorithm to Predict a Future Crime using Data Mining , 2011 .

[28]  Nikos Mamoulis,et al.  Mining frequent spatio-temporal sequential patterns , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[29]  Geoff Holmes,et al.  Multiclass Alternating Decision Trees , 2002, ECML.

[30]  M. Charlton,et al.  Geographically Weighted Regression: A Natural Evolution of the Expansion Method for Spatial Data Analysis , 1998 .

[31]  John G. Cleary,et al.  K*: An Instance-based Learner Using and Entropic Distance Measure , 1995, ICML.

[32]  Christopher J. Sullivan,et al.  Local Life Circumstances and Offending Specialization/Versatility , 2007 .

[33]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[34]  Hwan-Seung Yong,et al.  Mining Spatio-Temporal Patterns in Trajectory Data , 2010, J. Inf. Process. Syst..

[35]  Aida Mustapha,et al.  An experimental study of classification algorithms for crime prediction. , 2013 .

[36]  Gang Wang,et al.  Crime data mining: a general framework and some examples , 2004, Computer.

[37]  Yoshua Bengio Scaling up deep learning , 2014, KDD.

[38]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT.

[39]  Beth Pearsall,et al.  Predictive Policing: The Future of Law Enforcement? , 2010 .

[40]  Srinivasan Parthasarathy,et al.  A generalized framework for mining spatio-temporal patterns in scientific data , 2005, KDD '05.

[41]  Jerry H. Ratcliffe,et al.  Aoristic Signatures and the Spatio-Temporal Analysis of High Volume Crime Patterns , 2002 .

[42]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[43]  Shashi Shekhar,et al.  Focal-Test-Based Spatial Decision Tree Learning , 2015, IEEE Transactions on Knowledge and Data Engineering.

[44]  Vipin Kumar,et al.  Introduction to Data Mining, (First Edition) , 2005 .

[45]  Yoram Singer,et al.  A simple, fast, and effective rule learner , 1999, AAAI 1999.

[46]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[47]  Ilias Fountalis,et al.  Spatio-temporal network analysis for studying climate patterns , 2014, Climate Dynamics.

[48]  M. Vijaya Kumar,et al.  Spatial Clustering Simulation on Analysis of Spatial- Temporal Crime Hotspot for Predicting Crime activities , 2011 .

[49]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.