Network-based likelihood modeling of event occurrences in space and time: a case study of traffic accidents in Dallas, Texas, USA

ABSTRACT We propose a novel approach to network-based event likelihood modeling that estimates the probabilities of event occurrences on a network and identifies the influences of site and situation characteristics. Our premise is that the occurrences of events that involve human activities are subject to site and situational characteristics, and an understanding of event occurrences serves the basis for preparation or mitigation. Using data from Dallas, Texas, USA, we take the proposed approach to estimate the likelihood of traffic accidents based on binary (event or nonevent) space–time atoms of 100-m road segments and 1-h intervals. We choose 12 variables representing time, site characteristics, and situational conditions based on literature reviews to develop logistic regression and random forest models. The traffic accident data on even days were used for model construction and data on odd days for model testing. Both models result in comparable accuracy at 84.11% (logistic regression) and 85.42% (random forest) with significant differences in the spatial patterns of how site and situation correlate to traffic accidents. The difference signals the dynamic influence of site and situation characteristics on the event likelihood over time. The proposed approach shall be applicable to other point events on a network.

[1]  Wei Qin,et al.  Modeling urban expansion by using variable weights logistic cellular automata: a case study of Nanjing, China , 2017, Int. J. Geogr. Inf. Sci..

[2]  Gary King,et al.  Logistic Regression in Rare Events Data , 2001, Political Analysis.

[3]  M. Townsley,et al.  Infectious Burglaries. A Test of the Near Repeat Hypothesis , 2003 .

[4]  William J. Elliot,et al.  Spatial Prediction of Landslide Hazard Using Logistic Regression and ROC Analysis , 2006, Trans. GIS.

[5]  Haavard Rue,et al.  A toolbox for fitting complex spatial point process models using integrated nested Laplace approximation (INLA) , 2012, 1301.1817.

[6]  W. Siddall,et al.  Geography as Spatial Interaction , 1980 .

[7]  Jun Yan,et al.  Kernel Density Estimation of traffic accidents in a network space , 2008, Comput. Environ. Urban Syst..

[8]  Shenjun Yao,et al.  Traffic collisions in space: four decades of advancement in applied GIS , 2016, Ann. GIS.

[9]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[10]  E. Renshaw,et al.  Gibbs point processes for studying the development of spatial-temporal stochastic processes , 2001 .

[11]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[12]  L. Chapman,et al.  The influence of traffic on road surface temperatures: implications for thermal mapping studies , 2005 .

[13]  Christopher J. Paciorek,et al.  Computational techniques for spatial logistic regression with large data sets , 2007, Comput. Stat. Data Anal..

[14]  Virgilio Gómez-Rubio,et al.  Spatial Point Patterns: Methodology and Applications with R , 2016 .

[15]  Marta Blangiardo,et al.  A space–time multivariate Bayesian model to analyse road traffic accidents by severity , 2017 .

[16]  Harvey J. Miller,et al.  Exploring traffic flow databases using space-time plots and data cubes , 2011, Transportation.

[17]  Paul P Jovanis,et al.  Analysis of Road Crash Frequency with Spatial Models , 2008 .

[18]  D. Call An Assessment of National Weather Service Warning Procedures for Ice Storms , 2009 .

[19]  M. Parimala,et al.  A Survey on Density Based Clustering Algorithms for Mining Large Spatial Databases , 2011 .

[20]  Victoria Gitelman,et al.  Evaluating Crash Risk in Urban Areas Based on Vehicle and Pedestrian Modeling , 2017 .

[21]  Seiji Hashimoto,et al.  Development and application of traffic accident density estimation models using kernel density estimation , 2016 .

[22]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[23]  S. MacEachern,et al.  Bayesian Nonparametric Spatial Modeling With Dirichlet Process Mixing , 2005 .

[24]  D. Stoyan,et al.  Statistical Analysis and Modelling of Spatial Point Patterns , 2008 .

[25]  B. Warf,et al.  Introduction: the reinsertion of space in the humanities and social sciences , 2009 .

[26]  Bill Hillier,et al.  Exploring countrywide spatial systems: Spatio-structural correlates at the regional and national scales , 2015 .

[27]  Michael F. Goodchild,et al.  Towards a general theory of geographic representation in GIS , 2007, Int. J. Geogr. Inf. Sci..

[28]  Atsuyuki Okabe,et al.  A kernel density estimation method for networks, its computational method and a GIS‐based tool , 2009, Int. J. Geogr. Inf. Sci..

[29]  Alasdair Turner,et al.  From Axial to Road-Centre Lines: A New Representation for Space Syntax and a New Model of Route Choice for Transport Network Analysis , 2007 .

[30]  M. N. M. Lieshout,et al.  ADRIAN BADDELEY, EGE RUBAK, AND ROLF TURNER, Spatial Point Patterns: Methodology and Applications with R. Boca Raton, FL: CRC Press , 2016 .

[31]  Chao Wang,et al.  The effect of traffic and road characteristics on road safety: A review and future research direction , 2013 .

[32]  R. Waagepetersen,et al.  Modern Statistics for Spatial Point Processes * , 2007 .

[33]  Tessa K Anderson,et al.  Kernel density estimation and K-means clustering to profile road accident hotspots. , 2009, Accident; analysis and prevention.

[34]  M. Kulldorff Spatial Scan Statistics: Models, Calculations, and Applications , 1999 .

[35]  Richard Silberglitt,et al.  The Road to Zero: A Vision for Achieving Zero Roadway Deaths by 2050. , 2018, Rand health quarterly.

[36]  Steven M. Manson,et al.  Spatial simulation: exploring pattern and process , 2015, Int. J. Geogr. Inf. Sci..

[37]  Metin Senbil,et al.  Spatio-temporal accident analysis for accident prevention in relation to behavioral factors in driving: The case of South Anatolian Motorway , 2015 .

[38]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[39]  Benjamin Romano,et al.  Visualizing Traffic Accident Hotspots Based on Spatial-Temporal Network Kernel Density Estimation , 2017, SIGSPATIAL/GIS.

[40]  Lin Qiu,et al.  Effects of Adverse Weather on Traffic Crashes , 2008 .

[41]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[42]  Bill Hillier,et al.  Network and Psychological Effects in Urban Movement , 2005, COSIT.