Modeling unobserved heterogeneity using finite mixture random parameters for spatially correlated discrete count data

Road segments with identical site-specific attributes often exhibit significantly different crash counts due to unobserved reasons. The extent of unobserved heterogeneity associated with a road feature is to be estimated prior to selecting the relevant safety treatment. Moreover, crash count data is often over-dispersed and spatially correlated. This paper proposes a spatial negative binomial specification with random parameters for modeling crash counts of contiguous road segments. The unobserved heterogeneity is incorporated using a finite multi-variate normal mixture prior on the random parameters; this allows for non-normality, skewness in the distribution of the random parameters, facilitates correlation across the random parameters, and relaxes any distributional assumptions. The model extracts the inherent groups of road segments with crash counts that are equally sensitive to the road attributes on an average; the heterogeneity within these groups is also allowed in the proposed framework. The specification simultaneously accounts for potential spatial correlation of the crash counts from neighboring road segments. A Gibbs sampling framework is proposed that leverages recent theoretical developments on data-augmentation algorithms, and elegantly sidesteps many of the computational difficulties usually associated with Bayesian inference of count models. Empirical results suggests the presence of two latent groups and spatial correlation within the study road network. Road features with significantly different effect on crash counts across two latent groups of road segments were identified.

[1]  Sylvia Frühwirth-Schnatter,et al.  Capturing consumer heterogeneity in metric conjoint analysis using Bayesian mixture models , 2004 .

[2]  Fred L Mannering,et al.  A note on modeling vehicle accident frequencies with random-parameters count models. , 2009, Accident; analysis and prevention.

[3]  Satish V. Ukkusuri,et al.  Random Parameter Model Used to Explain Effects of Built-Environment Characteristics on Pedestrian Crash Frequency , 2011 .

[4]  Chandra R. Bhat,et al.  Unobserved heterogeneity and the statistical analysis of highway accident data , 2016 .

[5]  Fred L. Mannering,et al.  The heterogeneous effects of guardian supervision on adolescent driver-injury severities: A finite-mixture random-parameters approach , 2013 .

[6]  Erdong Chen,et al.  Modeling safety of highway work zones with random parameters and random effects models , 2014 .

[7]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[8]  C. Bhat Simulation estimation of mixed discrete choice models using randomized and scrambled Halton sequences , 2003 .

[9]  Ezra Hauer,et al.  OBSERVATIONAL BEFORE-AFTER STUDIES IN ROAD SAFETY -- ESTIMATING THE EFFECT OF HIGHWAY AND TRAFFIC ENGINEERING MEASURES ON ROAD SAFETY , 1997 .

[10]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[11]  Dirk Eddelbuettel,et al.  Rcpp: Seamless R and C++ Integration , 2011 .

[12]  P. Deb,et al.  Demand for Medical Care by the Elderly: A Finite Mixture Approach , 1997 .

[13]  Sw. Banerjee,et al.  Hierarchical Modeling and Analysis for Spatial Data , 2003 .

[14]  Markus Jochmann,et al.  Estimating the demand for health care with panel data: a semiparametric Bayesian approach. , 2004, Health economics.

[15]  Ahmet Tortum,et al.  Accident analysis with aggregated data: the random parameters negative binomial panel count data model , 2015 .

[16]  S. Frühwirth-Schnatter,et al.  Bayesian Analysis of the Heterogeneity Model , 2004 .

[17]  Paul P Jovanis,et al.  Analysis of Road Crash Frequency with Spatial Models , 2008 .

[18]  Gudmundur F. Ulfarsson,et al.  Random parameter models of interstate crash frequencies by severity, number of vehicles involved, collision and location type. , 2013, Accident; analysis and prevention.

[19]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[20]  Fred L Mannering,et al.  A study of factors affecting highway accident rates using the random-parameters tobit model. , 2012, Accident; analysis and prevention.

[21]  Mohamed M. Ahmed,et al.  Exploring a Bayesian hierarchical approach for developing safety performance functions for a mountainous freeway. , 2011, Accident; analysis and prevention.

[22]  Michel Wedel,et al.  A Latent Class Poisson Regression Model for Heterogeneous Count Data , 1993 .

[23]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[24]  Eduard Zaloshnja,et al.  The Economic and Societal Impact of Motor Vehicle Crashes, 2010 (Revised) , 2015 .

[25]  Xiao-Li Meng,et al.  The Art of Data Augmentation , 2001 .

[26]  Dani Gamerman,et al.  Markov Chain Monte Carlo: Stochastic Simulation for Bayesian Inference , 1997 .

[27]  P. Green,et al.  On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion) , 1997 .

[28]  Md. Tazul Islam,et al.  Effects of spatial correlation in random parameters collision count-data models , 2015 .

[29]  Yajie Zou,et al.  Application of finite mixture of negative binomial regression models with varying weight parameters for vehicle crash data analysis. , 2013, Accident; analysis and prevention.

[30]  Shaw-Pin Miaou,et al.  Bayesian ranking of sites for engineering safety improvements: decision parameter, treatability concept, statistical criterion, and spatial dependence. , 2005, Accident; analysis and prevention.

[31]  Dominique Lord,et al.  Application of finite mixture models for vehicle crash data analysis. , 2009, Accident; analysis and prevention.

[32]  Chandra R. Bhat,et al.  The maximum approximate composite marginal likelihood (MACML) estimation of multinomial probit-based unordered response choice models , 2011 .

[33]  Chandra R. Bhat,et al.  Analytic methods in accident research: Methodological frontier and future directions , 2014 .

[34]  Petros Dellaportas,et al.  Bayesian Analysis of Extreme Values by Mixture Modeling , 2003 .

[35]  F. Mannering,et al.  Safety impacts of signal-warning flashers and speed control at high-speed signalized intersections. , 2013, Accident; analysis and prevention.

[36]  Nicholas K Tulach,et al.  Do lower income areas have more pedestrian casualties? , 2013, Accident; analysis and prevention.

[37]  James G. Scott,et al.  Bayesian Inference for Logistic Models Using Pólya–Gamma Latent Variables , 2012, 1205.0310.

[38]  Baoshan Huang,et al.  Multivariate random-parameters zero-inflated negative binomial regression model: an application to estimate crash frequencies at intersections. , 2014, Accident; analysis and prevention.

[39]  D. Karlis,et al.  Mixed Poisson Distributions , 2005 .

[40]  W. Greene,et al.  Functional Form and Heterogeneity in Models for Count Data , 2007 .

[41]  Hoong Chor Chin,et al.  Applying the random effect negative binomial model to examine traffic accident occurrence at signalized intersections. , 2003, Accident; analysis and prevention.

[42]  Z. Griliches,et al.  Econometric Models for Count Data with an Application to the Patents-R&D Relationship , 1984 .

[43]  Kara M Kockelman,et al.  A Poisson-lognormal conditional-autoregressive model for multivariate spatial analysis of pedestrian crash counts across neighborhoods. , 2013, Accident; analysis and prevention.

[44]  M. Harding,et al.  A Poisson mixture model of discrete choice , 2011 .

[45]  Hong Yang,et al.  Crash frequency modeling for signalized intersections in a high-density urban road network , 2014 .

[46]  Qiang Zeng,et al.  Bayesian spatial joint modeling of traffic crashes on an urban road network. , 2014, Accident; analysis and prevention.

[47]  Sudip Barua,et al.  Multivariate random parameters collision count data models with spatial heterogeneity , 2016 .

[48]  Quenouille Mh,et al.  A relation between the logarithmic, Poisson, and negative binomial series. , 1949 .