Transportation agencies in the United States have started implementing SafetyAnalyst, highway safety management software. Some agencies have been developing or considering developing safety performance functions (SPFs), a core of the software, using local data to maximize the benefits of the software. With multiple years of data on hand, several regression model types are available for developing the SPFs, yet no reliable guide has been provided in selecting appropriate model types. This study examined 11 model types including 8 panel and 3 cross-sectional models to recommend appropriate model types for intersection SPFs for use with SafetyAnalyst. The models were developed using the data collected from 18,356 intersections in Virginia from 2003 through 2008 and were evaluated using 4 comparison approaches. Appropriate model types for each of the 8 intersection subtypes were found; four all-way stop control subtypes were excluded because of the lack of sufficient observations. In general, a cross-sectional model with summed or averaged crash frequencies was found to underestimate a dispersion parameter that plays a critical role in the empirical Bayes method. Depending on intersection subtype, appropriate model types were found to vary. A panel model with independent correlation and a pooled cross-sectional model were found to be appropriate across all 8 subtypes; if the same model type is desired for the 8 subtypes, these 2 models are recommended for developing SPFs using local data for SafetyAnalyst. Although the conclusions and recommendations were based on sufficient data, they are valid for Virginia and may not be applicable to other states.
[1]
S. Zeger,et al.
Longitudinal data analysis using generalized linear models
,
1986
.
[2]
W. Pan.
Akaike's Information Criterion in Generalized Estimating Equations
,
2001,
Biometrics.
[3]
A. Raftery.
Bayesian Model Selection in Social Research
,
1995
.
[4]
F B Hu,et al.
Comparison of population-averaged and subject-specific approaches for analyzing repeated binary outcomes.
,
1998,
American journal of epidemiology.
[5]
Pravin K. Trivedi,et al.
Regression Analysis of Count Data
,
1998
.
[6]
Zhehui Luo,et al.
Fixed effects, random effects and GEE: What are the differences?
,
2009,
Statistics in medicine.
[7]
Fred L Mannering,et al.
A note on modeling vehicle accident frequencies with random-parameters count models.
,
2009,
Accident; analysis and prevention.
[8]
Young-Jun Kweon,et al.
Appropriate Regression Model Types for Empirical Bayes Method and SafetyAnalyst
,
2010
.
[9]
R Kulmala,et al.
Measuring the contribution of randomness, exposure, weather, and daylight to the variation in road accident counts.
,
1995,
Accident; analysis and prevention.
[10]
Darren J Torbic,et al.
SafetyAnalyst: Software Tools for Safety Management of Specific Highway Sites
,
2007
.