Comparison of Sichel and Negative Binomial Models in Estimating Empirical Bayes Estimates

Traditionally, transportation safety analysts have used the empirical Bayes (EB) method to improve the estimate of the long-term mean of individual sites and to identify hotspot locations. The EB method combines two sources of information: (a) the expected number of crashes estimated by crash prediction models and (b) the observed number of crashes at individual sites. Because of the overdispersion commonly found in crash data, a negative binomial (NB) modeling framework has been used extensively in crash prediction estimation models. Recent studies have shown that the Sichel (SI) distribution provides a promising avenue for developing crash prediction models. The objective of this study was to examine the application of the SI model in calculating EB estimates. The study used crash data collected at four-lane undivided rural highways in Texas to develop SI models with fixed and varying dispersion terms. The results led to the following main conclusions: (a) the selection of the crash prediction model (i.e., the SI or the NB model) affected the value of the weight factor used for estimating the EB output and (b) the identification of hazardous sites, based on the EB method, could be different when the SI model was used. Finally, a simulation study that is designed to examine which crash prediction model can identify hotspots better is recommended for future research.

[1]  Chandra R. Bhat,et al.  A latent variable representation of count data models to accommodate spatial and temporal dependence: application to predicting crash frequency at intersections , 2011 .

[2]  Srinivas Reddy Geedipally,et al.  The negative binomial-Lindley generalized linear model: characteristics and application using crash data. , 2012, Accident; analysis and prevention.

[3]  Shaw-Pin Miaou,et al.  Modeling Traffic Crash-Flow Relationships for Intersections: Dispersion Parameter, Functional Form, and Bayes Versus Empirical Bayes Methods , 2003 .

[4]  K. El-Basyouny,et al.  Comparison of Two Negative Binomial Regression Techniques in Developing Accident Prediction Models , 2006 .

[5]  E Hauer,et al.  Empirical Bayes approach to the estimation of "unsafety": the multivariate regression method. , 1992, Accident; analysis and prevention.

[6]  Doohee Nam,et al.  Accident prediction model for railway-highway interfaces. , 2006, Accident; analysis and prevention.

[7]  Yajie Zou,et al.  Analyzing Highly Dispersed Crash Data Using the Sichel Generalized Additive Models for Location , Scale and Shape , 2011 .

[8]  H. S. Sichel,et al.  Repeat‐Buying and the Generalized Inverse Gaussian–Poisson Distribution , 1982 .

[9]  F Mannering,et al.  Modeling accident frequencies as zero-altered probability processes: an empirical inquiry. , 1997, Accident; analysis and prevention.

[10]  Luis F. Miranda-Moreno,et al.  Effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter of Poisson-gamma models for modeling motor vehicle crashes: a Bayesian perspective , 2008 .

[11]  Liping Fu,et al.  Alternative Risk Models for Ranking Locations for Safety Improvement , 2005 .

[12]  Seng-Huat Ong,et al.  Analysis of Long-Tailed Count Data by Poisson Mixtures , 2005 .

[13]  R. A. Rigby,et al.  Article in Press Computational Statistics and Data Analysis a Framework for Modelling Overdispersed Count Data, including the Poisson-shifted Generalized Inverse Gaussian Distribution , 2022 .

[14]  Fred L. Mannering,et al.  The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives , 2010 .

[15]  Srinivas Reddy Geedipally,et al.  Application of the Conway-Maxwell-Poisson generalized linear model for analyzing motor vehicle crashes. , 2008, Accident; analysis and prevention.

[16]  Liping Fu,et al.  Bayesian multiple testing procedures for hotspot identification. , 2007, Accident; analysis and prevention.

[17]  Srinivas Reddy Geedipally,et al.  Analyzing Different Parameterizations of the Varying Dispersion Parameter as a Function of Segment Length , 2009 .

[18]  Wen Cheng,et al.  Experimental evaluation of hotspot identification methods. , 2005, Accident; analysis and prevention.

[19]  Bhagwant Persaud,et al.  Comparison of empirical Bayes and full Bayes approaches for before-after road safety evaluations. , 2010, Accident; analysis and prevention.

[20]  J. Hilbe Negative Binomial Regression: Preface , 2007 .

[21]  J. Hilbe Negative Binomial Regression: Index , 2011 .

[22]  Dominique Lord,et al.  Examining the effects of site selection criteria for evaluating the effectiveness of traffic safety countermeasures. , 2012, Accident; analysis and prevention.

[23]  Ezra Hauer,et al.  OBSERVATIONAL BEFORE-AFTER STUDIES IN ROAD SAFETY -- ESTIMATING THE EFFECT OF HIGHWAY AND TRAFFIC ENGINEERING MEASURES ON ROAD SAFETY , 1997 .

[24]  Walter Zucchini,et al.  Parameter Estimation for the Sichel Distribution and its Multivariate Extension , 1987 .

[25]  Yajie Zou,et al.  Application of finite mixture of negative binomial regression models with varying weight parameters for vehicle crash data analysis. , 2013, Accident; analysis and prevention.

[26]  Dominique Lord,et al.  Investigating the effects of the fixed and varying dispersion parameters of Poisson-gamma models on empirical Bayes estimates. , 2008, Accident; analysis and prevention.

[27]  Joseph Hilbe,et al.  Negative Binomial Regression: Negative binomial regression , 2011 .