Applying quantile regression for modeling equivalent property damage only crashes to identify accident blackspots.

Hot spot identification (HSID) aims to identify potential sites-roadway segments, intersections, crosswalks, interchanges, ramps, etc.-with disproportionately high crash risk relative to similar sites. An inefficient HSID methodology might result in either identifying a safe site as high risk (false positive) or a high risk site as safe (false negative), and consequently lead to the misuse the available public funds, to poor investment decisions, and to inefficient risk management practice. Current HSID methods suffer from issues like underreporting of minor injury and property damage only (PDO) crashes, challenges of accounting for crash severity into the methodology, and selection of a proper safety performance function to model crash data that is often heavily skewed by a preponderance of zeros. Addressing these challenges, this paper proposes a combination of a PDO equivalency calculation and quantile regression technique to identify hot spots in a transportation network. In particular, issues related to underreporting and crash severity are tackled by incorporating equivalent PDO crashes, whilst the concerns related to the non-count nature of equivalent PDO crashes and the skewness of crash data are addressed by the non-parametric quantile regression technique. The proposed method identifies covariate effects on various quantiles of a population, rather than the population mean like most methods in practice, which more closely corresponds with how black spots are identified in practice. The proposed methodology is illustrated using rural road segment data from Korea and compared against the traditional EB method with negative binomial regression. Application of a quantile regression model on equivalent PDO crashes enables identification of a set of high-risk sites that reflect the true safety costs to the society, simultaneously reduces the influence of under-reported PDO and minor injury crashes, and overcomes the limitation of traditional NB model in dealing with preponderance of zeros problem or right skewed dataset.

[1]  Bhagwant Persaud DO TRAFFIC SIGNALS AFFECT SAFETY? SOME METHODOLOGICAL ISSUES , 1988 .

[2]  Li-Yen Chang,et al.  Analysis of freeway accident frequencies: Negative binomial regression versus artificial neural network , 2005 .

[3]  Simon Washington,et al.  On the nature of over-dispersion in motor vehicle crash prediction models. , 2007, Accident; analysis and prevention.

[4]  Bhagwant Persaud,et al.  Accident Prediction Models With and Without Trend: Application of the Generalized Estimating Equations Procedure , 2000 .

[5]  Xiao Qin,et al.  Conditional Quantile Analysis for Crash Count Data , 2011 .

[6]  Fred L Mannering,et al.  A note on modeling vehicle accident frequencies with random-parameters count models. , 2009, Accident; analysis and prevention.

[7]  Ezra Hauer,et al.  Observational Before-After Studies in Road Safety , 1997 .

[8]  T R Miller,et al.  DATABOOK ON NONFATAL INJURY: INCIDENCE, COSTS, AND CONSEQUENCES , 1995 .

[9]  Xiao Qin,et al.  Identifying crash-prone locations with quantile regression. , 2010, Accident; analysis and prevention.

[10]  Wen Cheng,et al.  Experimental evaluation of hotspot identification methods. , 2005, Accident; analysis and prevention.

[11]  Fred L. Mannering,et al.  The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives , 2010 .

[12]  J L Higle,et al.  Bayesian identification of hazardous locations , 1988 .

[13]  Craig Lyon,et al.  Empirical Bayes Procedure for Ranking Sites for Safety Investigation by Potential for Safety Improvement , 1999 .

[14]  Shaw-Pin Miaou,et al.  Modeling Traffic Crash-Flow Relationships for Intersections: Dispersion Parameter, Functional Form, and Bayes Versus Empirical Bayes Methods , 2003 .

[15]  Ted R. Miller,et al.  THE ECONOMIC IMPACT OF MOTOR VEHICLE CRASHES, 2000 , 2002 .

[16]  R. B. Albin,et al.  Evaluating Median Crossover Likelihoods with Clustered Accident Counts: An Empirical Inquiry Using the Random Effects Negative Binomial Model , 1998 .

[17]  Simon Washington,et al.  Property Damage Crash Equivalency Factors to Solve Crash Frequency–Severity Dilemma: Case Study on South Korean Rural Roads , 2010 .

[18]  Ezra Hauer,et al.  Estimation of safety at signalized intersections , 1988 .

[19]  Joyoung Lee,et al.  Accident Frequency Model Using Zero Probability Process , 2006 .

[20]  A. Shalaby,et al.  Macrolevel Accident Prediction Models for Evaluating Safety of Urban Transportation Systems , 2003 .

[21]  Wen Cheng,et al.  New Criteria for Evaluating Methods of Identifying Hot Spots , 2008 .

[22]  Dominique Lord,et al.  Further notes on the application of zero-inflated models in highway safety. , 2007, Accident; analysis and prevention.

[23]  F Mannering,et al.  Modeling accident frequencies as zero-altered probability processes: an empirical inquiry. , 1997, Accident; analysis and prevention.

[24]  Paul Damien,et al.  A multivariate Poisson-lognormal regression model for prediction of crash counts by severity, using Bayesian methods. , 2008, Accident; analysis and prevention.

[25]  Xiang Liu,et al.  Accident Analysis and Prevention Analysis of U.s. Freight-train Derailment Severity Using Zero-truncated Negative Binomial Regression and Quantile Regression , 2022 .

[26]  Hoong Chor Chin,et al.  Applying Bayesian hierarchical models to examine motorcycle crashes at signalized intersections. , 2010, Accident; analysis and prevention.

[27]  Dominique Lord,et al.  Application of finite mixture models for vehicle crash data analysis. , 2009, Accident; analysis and prevention.

[28]  Md. Mazharul Haque,et al.  An estimate of road accident costs in Singapore , 2006 .

[29]  Hoong Chor Chin,et al.  Modeling Count Data with Excess Zeroes , 2003 .

[30]  Md. Mazharul Haque,et al.  Empirical Evaluation of Alternative Approaches in Identifying Crash Hot Spots , 2009 .

[31]  Simon Washington,et al.  On the commonly accepted assumptions regarding observed motor vehicle crash counts at transport system locations , 2013 .

[32]  Dominique Lord,et al.  Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory. , 2005, Accident; analysis and prevention.

[33]  Bhagwant Persaud,et al.  ESTIMATING ACCIDENT POTENTIAL OF ONTARIO ROAD SECTIONS , 1991 .

[34]  Bhagwant Persaud,et al.  PROBLEM OF IDENTIFYING HAZARDOUS LOCATIONS USING ACCIDENT DATA , 1984 .

[35]  Xiao Qin Quantile Effects of Causal Factors on Crash Distributions , 2012 .

[36]  J. Higle,et al.  A COMPARISON OF TECHNIQUES FOR THE IDENTIFICATION OF HAZARDOUS LOCATIONS , 1989 .