Adjustment for Maximum Likelihood Estimate of Negative Binomial Dispersion Parameter

The negative binomial (NB) (or Poisson–gamma) model has been used extensively by highway safety analysts because it can accommodate the overdispersion often exhibited in crash data. However, it has been reported in the literature that the maximum likelihood estimate of the dispersion parameter of NB models can be significantly affected when the data are characterized by small sample size and low sample mean. Given the important roles of the dispersion parameter in various types of highway safety analyses, there is a need to determine whether the bias could be potentially corrected or minimized. The objectives of this study are to explore whether a systematic relationship exists between the estimated and true dispersion parameters, determine the bias as a function of the sample size and sample mean, and develop a procedure for correcting the bias caused by these two conditions. For this purpose, simulated data were used to derive the relationship under the various combinations of sample mean, dispersion parameter, and sample size, which encompass all simulation conditions performed in previous research. The dispersion parameter was estimated by using the maximum likelihood method. The results confirmed previous studies and developed a reasonable relationship between the estimated and true dispersion parameters for reducing the bias. Details for the application of the correction procedure were also provided by using the crash data collected at 458 three-leg unsignalized intersections in California. Finally, the study provided several discussion points for further work.

[1]  J. Lawless Negative binomial and mixed Poisson regression , 1987 .

[2]  Joe N. Perry,et al.  Estimation of the Negative Binomial Parameter κ by Maximum Quasi -Likelihood , 1989 .

[3]  W. Piegorsch Maximum likelihood estimation for the negative binomial dispersion parameter. , 1990, Biometrics.

[4]  C. Dean,et al.  Modified pseudo-likelihood estimator of the overdispersion parameter in Poisson mixture models , 1994 .

[5]  S-P Miaou,et al.  MEASURING THE GOODNESS-OF-FIT OF ACCIDENT PREDICTION MODELS , 1996 .

[6]  Pravin K. Trivedi,et al.  Regression Analysis of Count Data , 1998 .

[7]  E Hauer,et al.  Overdispersion in modelling accidents on road sections and in empirical bayes estimation. , 2001, Accident; analysis and prevention.

[8]  R. H. Myers Generalized Linear Models: With Applications in Engineering and the Sciences , 2001 .

[9]  J. T. Wulu,et al.  Regression analysis of count data , 2002 .

[10]  Dominique Lord,et al.  Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory. , 2005, Accident; analysis and prevention.

[11]  G R Wood Confidence and prediction intervals for generalised linear accident models. , 2005, Accident; analysis and prevention.

[12]  James A Bonneson,et al.  Calibration of Predictive Models for Estimating Safety of Ramp Design Configurations , 2005 .

[13]  Sudhir Paul,et al.  Bias-corrected maximum likelihood estimator of the negative binomial dispersion parameter. , 2005, Biometrics.

[14]  Abdel H. El-Shaarawi,et al.  Negative Binomial Distribution , 2006 .

[15]  Nils Toft,et al.  The Gamma-Poisson model as a statistical method to determine if micro-organisms are randomly distributed in a food matrix. , 2006, Food microbiology.

[16]  Dominique Lord,et al.  Modeling motor vehicle crashes using Poisson-gamma models: examining the effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter. , 2006, Accident; analysis and prevention.

[17]  Kay Fitzpatrick,et al.  Development of Tools for Evaluating the Safety Implications of Highway Design Decisions , 2007 .

[18]  J. Lloyd-Smith Maximum Likelihood Estimation of the Negative Binomial Dispersion Parameter for Highly Overdispersed Data, with Applications to Infectious Diseases , 2007, PloS one.

[19]  James A Bonneson,et al.  Development of Accident Modification Factors for Rural Frontage Road Segments in Texas , 2007 .