Bayesian ranking of sites for engineering safety improvements: decision parameter, treatability concept, statistical criterion, and spatial dependence.

In recent years, there has been a renewed interest in applying statistical ranking criteria to identify sites on a road network, which potentially present high traffic crash risks or are over-represented in certain type of crashes, for further engineering evaluation and safety improvement. This requires that good estimates of ranks of crash risks be obtained at individual intersections or road segments, or some analysis zones. The nature of this site ranking problem in roadway safety is related to two well-established statistical problems known as the small area (or domain) estimation problem and the disease mapping problem. The former arises in the context of providing estimates using sample survey data for a small geographical area or a small socio-demographic group in a large area, while the latter stems from estimating rare disease incidences for typically small geographical areas. The statistical problem is such that direct estimates of certain parameters associated with a site (or a group of sites) with adequate precision cannot be produced, due to a small available sample size, the rareness of the event of interest, and/or a small exposed population or sub-population in question. Model based approaches have offered several advantages to these estimation problems, including increased precision by "borrowing strengths" across the various sites based on available auxiliary variables, including their relative locations in space. Within the model based approach, generalized linear mixed models (GLMM) have played key roles in addressing these problems for many years. The objective of the study, on which this paper is based, was to explore some of the issues raised in recent roadway safety studies regarding ranking methodologies in light of the recent statistical development in space-time GLMM. First, general ranking approaches are reviewed, which include naïve or raw crash-risk ranking, scan based ranking, and model based ranking. Through simulations, the limitation of using the naïve approach in ranking is illustrated. Second, following the model based approach, the choice of decision parameters and consideration of treatability are discussed. Third, several statistical ranking criteria that have been used in biomedical, health, and other scientific studies are presented from a Bayesian perspective. Their applications in roadway safety are then demonstrated using two data sets: one for individual urban intersections and one for rural two-lane roads at the county level. As part of the demonstration, it is shown how multivariate spatial GLMM can be used to model traffic crashes of several injury severity types simultaneously and how the model can be used within a Bayesian framework to rank sites by crash cost per vehicle-mile traveled (instead of by crash frequency rate). Finally, the significant impact of spatial effects on the overall model goodness-of-fit and site ranking performances are discussed for the two data sets examined. The paper is concluded with a discussion on possible directions in which the study can be extended.

[1]  Peter J. Diggle,et al.  Statistical analysis of spatial point patterns , 1983 .

[2]  Christian P. Robert,et al.  Monte Carlo Statistical Methods , 2005, Springer Texts in Statistics.

[3]  H. Goldstein Multilevel Statistical Models , 2006 .

[4]  R Tunaru Hierarchical Bayesian models for road accident data , 1999 .

[5]  Eric R. Green,et al.  DEVELOPMENT OF PROCEDURES FOR IDENTIFYING HIGH-CRASH LOCATIONS AND PRIORITIZING SAFETY IMPROVEMENTS , 2003 .

[6]  J. Besag,et al.  Bayesian Computation and Stochastic Systems , 1995 .

[7]  Bhavani Shankar,et al.  Bayesian Spatial Probit Estimation: A primer and an application to HYV rice adoption , 2002 .

[8]  Jake Kononov Identifying Locations with Potential for Accident Reductions: Use of Direct Diagnostics and Pattern Recognition Methodologies , 2002 .

[9]  Luc Anselin,et al.  Spatial Externalities, Spatial Multipliers, And Spatial Econometrics , 2003 .

[10]  A. Lawson Disease map reconstruction , 2001, Statistics in medicine.

[11]  Craig Lyon,et al.  Empirical Bayes Procedure for Ranking Sites for Safety Investigation by Potential for Safety Improvement , 1999 .

[12]  Bradley P. Carlin,et al.  BAYES AND EMPIRICAL BAYES METHODS FOR DATA ANALYSIS , 1996, Stat. Comput..

[13]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[14]  Alexis Raffalovich The State Monopoly of Spirits in Russia, and its Influence on the Prosperity of the Population , 1901 .

[15]  Peter Congdon Bayesian statistical modelling , 2002 .

[16]  Andrew B. Lawson,et al.  Spatial cluster modelling , 2002 .

[17]  T. Louis,et al.  Triple‐goal estimates in two‐stage hierarchical models , 1998 .

[18]  Trevor C. Bailey,et al.  Interactive Spatial Data Analysis , 1995 .

[19]  Ezra Hauer,et al.  Screening the Road Network for Sites with Promise , 2002 .

[20]  B. Mallick,et al.  Bayesian multivariate spatial models for roadway traffic crash mapping , 2006 .

[21]  Sylvia Richardson,et al.  Markov Chain Monte Carlo in Practice , 1997 .

[22]  Sw. Banerjee,et al.  Hierarchical Modeling and Analysis for Spatial Data , 2003 .

[23]  Bani K. Mallick,et al.  ROADWAY TRAFFIC CRASH MAPPING: A SPACE-TIME MODELING APPROACH , 2003 .

[24]  J. LeSage A Family of Geographically Weighted Regression Models , 2004 .

[25]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[26]  Shaw-Pin Miaou,et al.  Modeling Traffic Crash-Flow Relationships for Intersections: Dispersion Parameter, Functional Form, and Bayes Versus Empirical Bayes Methods , 2003 .

[27]  Dongchu Sun,et al.  Spatio-temporal interaction with disease mapping. , 2000, Statistics in medicine.

[28]  C C Wright,et al.  ACCIDENT "MIGRATION" AFTER REMEDIAL TREATMENT AT ACCIDENT BLACKSPOTS , 1984 .

[29]  R. Haining Spatial Data Analysis in the Social and Environmental Sciences , 1990 .

[30]  B. G. Heydecker,et al.  Identification of sites for road accident remedial work by Bayesian statistical methods: an example of uncertain inference , 2001 .

[31]  Malay Ghosh,et al.  Hierarchical Bayes GLMs for the analysis of spatial data: An application to disease mapping , 1999 .

[32]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[33]  Alan Nicholson,et al.  Ranking and selecting motor vehicle accident sites by using a hierarchical Bayesian model , 1997 .

[34]  Daniel A. Griffith,et al.  Advanced Spatial Statistics , 1988 .

[35]  M J Maher,et al.  A bivariate negative binomial model to explain traffic accident migration. , 1990, Accident; analysis and prevention.

[36]  Bhagwant Persaud,et al.  Accident Prediction Models With and Without Trend: Application of the Generalized Estimating Equations Procedure , 2000 .

[37]  Dominique Lord,et al.  The prediction of accidents on digital networks, characteristics and issues related to the application of accident prediction models , 2000 .

[38]  Harvey Goldstein,et al.  League Tables and Their Limitations: Statistical Issues in Comparisons of Institutional Performance , 1996 .

[39]  E. Lesaffre,et al.  Disease mapping and risk assessment for public health. , 1999 .

[40]  Jesper Møller,et al.  Spatial statistics and computational methods , 2003 .

[41]  Chris Brunsdon,et al.  Geographically Weighted Regression: The Analysis of Spatially Varying Relationships , 2002 .

[42]  Luc Anselin,et al.  Do spatial effects really matter in regression analysis , 2005 .

[43]  J. Rao Small Area Estimation , 2003 .

[44]  Shauna L. Hallmark,et al.  EVALUATION OF THE IOWA DOT'S SAFETY IMPROVEMENT CANDIDATE LIST PROCESS , 2002 .