Measuring Ambient Population from Location-Based Social Networks to Describe Urban Crime

Recently, a lot of attention has been given to crime prediction, both by the general public and by the research community. Most of the latest work has concentrated on showing the potential of novel data sources like social media, mobile phone data, points of interest, or transportation data for the crime prediction task and researchers have focused mostly on techniques from supervised machine learning to show their predictive potential. Yet, the question remains if indeed this data can be used to better describe urban crime. In this paper, we investigate the potential of data harvested from location-based social networks (specifically Foursquare) to describe urban crime. Towards this end, we apply techniques from spatial econometrics. We show that this data, seen as a measurement for the ambient population of a neighborhood, is able to further describe crime levels in comparison to models built solely on census data, seen as measurement for the resident population of a neighborhood. In an analysis of crime on census tract level in New York City, the total number of incidents can be described by our models with up to \(R^2 = 56\%\), while the best model for the different crime subtypes is achieved for larcenies with roughly \(67\%\) of the variance explained.

[1]  M. Maguire The Crime Reduction Programme in England and Wales , 2004 .

[2]  Robert K. Peet,et al.  The Measurement of Species Diversity , 1974 .

[3]  J. Ord,et al.  Spatial Processes. Models and Applications , 1982 .

[4]  Giovanni Quattrone,et al.  Mining Mobile Phone Data to Investigate Urban Crime Theories at Scale , 2014, SocInfo.

[5]  L. Anselin,et al.  Digital neighborhoods , 2016 .

[6]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[7]  Daniel Arribas-Bel,et al.  The sociocultural sources of urban buzz , 2016 .

[8]  P. Brantingham,et al.  Nodes, paths and edges: Considerations on the complexity of crime and the physical environment , 1993 .

[9]  Lawrence E. Cohen,et al.  Social Change and Crime Rate Trends: A Routine Activity Approach , 1979 .

[10]  L. Anselin Spatial Econometrics: Methods and Models , 1988 .

[11]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[12]  P. Brantingham,et al.  Criminality of place , 1995 .

[13]  Travis C. Pratt,et al.  Assessing Macro-Level Predictors and Theories of Crime: A Meta-Analysis , 2005, Crime and Justice.

[14]  Daniel Kifer,et al.  Crime Rate Inference with Big Data , 2016, KDD.

[15]  Robert J. Sampson,et al.  Spatial Heterogeneity in the Effects of Immigration and Diversity on Neighborhood Homicide Rates , 2009, Homicide studies.

[16]  Alex Pentland,et al.  Once Upon a Crime: Towards Crime Prediction from Demographics and Mobile Data , 2014, ICMI.

[17]  J. Neter,et al.  Applied Linear Regression Models , 1983 .

[18]  Nicu Sebe,et al.  The Death and Life of Great Italian Cities: A Mobile Phone Data Perspective , 2016, WWW.

[19]  L. Anselin Local Indicators of Spatial Association—LISA , 2010 .

[20]  Zbigniew Smoreda,et al.  An analytical framework to nowcast well-being using mobile phone data , 2016, International Journal of Data Science and Analytics.

[21]  Sergio J. Rey,et al.  PySAL: A Python Library of Spatial Analytical Methods , 2010 .

[22]  A. Sheldon,et al.  Equitability Indices: Dependence on the Species Count , 1969 .

[23]  Cecilia Mascolo,et al.  Geo-spotting: mining online location-based services for optimal retail store placement , 2013, KDD.

[24]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[25]  S. Raudenbush,et al.  Neighborhoods and violent crime: a multilevel study of collective efficacy. , 1997, Science.

[26]  James R. Kluegel,et al.  Social Inequality and Predatory Criminal Victimization: An Exposition and Test of a Formal Theory , 1981 .

[27]  Galit Shmueli,et al.  To Explain or To Predict? , 2010, 1101.0891.

[28]  Matthew S. Gerber,et al.  Predicting crime using Twitter and kernel density estimation , 2014, Decis. Support Syst..

[29]  J. Jacobs The Death and Life of Great American Cities , 1962 .

[30]  N. Eagle,et al.  Network Diversity and Economic Development , 2010, Science.