Improved Support Vector Machine Models for Work Zone Crash Injury Severity Prediction and Analysis

Work zones are a high priority issue in the field of road transportation because of their impacts on traffic safety. A better understanding of work zone crashes can help to identify the contributing factors and countermeasures to enhance roadway safety. This study investigates the prediction of work zone crash severity and the contributing factors by employing a parametric approach using the mixed logit modeling framework and a non-parametric machine learning approach using the support vector machine (SVM). The mixed logit model belongs to the class of random parameter models in which the effects of flexible variables across different observations are identified, that is, data heterogeneity is taken into account. The performance of the SVM model is enhanced by applying three metaheuristic algorithms: particle swarm optimization (PSO), harmony search (HS), and the whale optimization algorithm (WOA). Empirical findings indicate that SVM provides higher prediction accuracy and outperforms the mixed logit model. Estimation results reveal key factors that increase the likelihood of severe work zone crashes. Furthermore, the analysis illustrates the ability of the three metaheuristics to enhance the SVM and the superiority of the harmony search algorithm in improving the performance of the SVM model.

[1]  Fan Ye,et al.  Comparing Three Commonly Used Crash Severity Models on Sample Size Requirements : Multinomial Logit , Ordered Probit and Mixed Logit Models , 2013 .

[2]  Jyh-Hone Wang,et al.  THE IMPACT OF TEXT DRIVING ON DRIVING SAFETY , 2016 .

[3]  J. Halton On the efficiency of certain quasi-random sequences of points in evaluating multi-dimensional integrals , 1960 .

[4]  C. Bhat Simulation estimation of mixed discrete choice models using randomized and scrambled Halton sequences , 2003 .

[5]  Nataliya V Malyshkina,et al.  Empirical assessment of the impact of highway design exceptions on the frequency and severity of vehicle accidents. , 2009, Accident; analysis and prevention.

[6]  Salvador Hernandez,et al.  Heavy-Vehicle Crash Rate Analysis: Comparison of Heterogeneity Methods Using Idaho Crash Data , 2017 .

[7]  Wen Chen,et al.  Parameter Optimization of SVM Based on Improved ACO for Data Classification , 2016, MUE 2016.

[8]  J. Weston,et al.  Support Vector Machine Solvers , 2007 .

[9]  A. Moudon,et al.  The risk of pedestrian injury and fatality in collisions with motor vehicles, a social ecological study of state routes and city streets in King County, Washington. , 2011, Accident; analysis and prevention.

[10]  Fred L Mannering,et al.  Highway accident severities and the mixed logit model: an exploratory empirical analysis. , 2008, Accident; analysis and prevention.

[11]  Aamer Athar,et al.  Potential Operation and Maintenance (O&M) Opportunities at the Florida Department of Highway Safety and Motor Vehicles , 1996 .

[12]  Jason Anderson,et al.  Roadway classifications and the accident injury severities of heavy-vehicle drivers , 2017 .

[13]  Sabyasachee Mishra,et al.  Analysis of injury severity of large truck crashes in work zones. , 2016, Accident; analysis and prevention.

[14]  Ajith Abraham,et al.  Traffic Accident Analysis Using Machine Learning Paradigms , 2005, Informatica.

[15]  Fred L Mannering,et al.  A multivariate tobit analysis of highway accident-injury-severity rates. , 2012, Accident; analysis and prevention.

[16]  Erdong Chen,et al.  Modeling safety of highway work zones with random parameters and random effects models , 2014 .

[17]  Florian Heiss,et al.  Discrete Choice Methods with Simulation , 2016 .

[18]  Faming Liang,et al.  Crash Injury Severity Analysis Using a Bayesian Ordered Probit Model , 2007 .

[19]  Asad J. Khattak,et al.  Extracting Useful Information from Basic Safety Message Data: An Empirical Study of Driving Volatility Measures and Crash Frequency at Intersections , 2018, 1803.02433.

[20]  Wei Wang,et al.  Using support vector machine models for crash injury severity analysis. , 2012, Accident; analysis and prevention.

[21]  Sabyasachee Mishra,et al.  Analysis of passenger-car crash injury severity in different work zone configurations. , 2018, Accident; analysis and prevention.

[22]  Mohamed Ahmed,et al.  A Tree-Based Ordered Probit Approach to Identify Factors Affecting Work Zone Weather-Related Crashes Severity in North Carolina Using the Highway Safety Information System Dataset , 2017 .

[23]  Fang Yuan,et al.  INCIDENT DETECTION USING SUPPORT VECTOR MACHINES , 2003 .

[24]  Xuedong Yan,et al.  Investigating the influence of curbs on single-vehicle crash injury severity utilizing zero-inflated ordered probit models. , 2013, Accident; analysis and prevention.

[25]  Shakil Mohammad Rifaat,et al.  Effect of street pattern on the severity of crashes involving vulnerable road users. , 2011, Accident; analysis and prevention.

[26]  Seyedmirsajad Mokhtarimousavi,et al.  A Non-Dominated Sorting Genetic Algorithm Approach for Optimization of Multi-Objective Airport Gate Assignment Problem , 2017, Transportation Research Record: Journal of the Transportation Research Board.

[27]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[28]  Fei Ye,et al.  An improved chaotic fruit fly optimization based on a mutation strategy for simultaneous feature selection and parameter optimization for SVM and its applications , 2017, PloS one.

[29]  Zong Woo Geem,et al.  A New Heuristic Optimization Algorithm: Harmony Search , 2001, Simul..

[30]  Carol A C Flannagan,et al.  Identification and validation of a logistic regression model for predicting serious injuries associated with motor vehicle crashes. , 2011, Accident; analysis and prevention.

[31]  Sandeep Mudigonda,et al.  Analysis of Incident-Induced Capacity Reductions for Improved Delay Estimation , 2019, Journal of Transportation Engineering, Part A: Systems.

[32]  K. Train Halton Sequences for Mixed Logit , 2000 .

[33]  D. McFadden,et al.  MIXED MNL MODELS FOR DISCRETE RESPONSE , 2000 .

[34]  Chandra R. Bhat,et al.  Unobserved heterogeneity and the statistical analysis of highway accident data , 2016 .

[35]  Nanning Zheng,et al.  Pedestrian detection using sparse Gabor filter and support vector machine , 2005, IEEE Proceedings. Intelligent Vehicles Symposium, 2005..

[36]  Timothy C. Coburn,et al.  Statistical and Econometric Methods for Transportation Data Analysis , 2004, Technometrics.

[37]  Mohamed Abdel-Aty,et al.  Examining traffic crash injury severity at unsignalized intersections. , 2010, Journal of safety research.

[38]  Mohamed Ahmed,et al.  Parametric Ordinal Logistic Regression and Non-Parametric Decision Tree Approaches for Assessing the Impact of Weather Conditions on Driver Speed Selection Using Naturalistic Driving Data , 2018 .

[39]  Ye Fei,et al.  Simultaneous feature with support vector selection and parameters optimization using GA-based SVM solve the binary classification , 2016, 2016 First IEEE International Conference on Computer Communication and the Internet (ICCCI).

[40]  Mohamed Abdel-Aty,et al.  Analyzing crash injury severity for a mountainous freeway incorporating real-time traffic and weather data , 2014 .

[41]  Dominique Lord,et al.  Comparing Three Commonly Used Crash Severity Models on Sample Size Requirements: Multinomial Logit, Ordered Probit, and Mixed Logit Models , 2014 .

[42]  Salvador Hernandez,et al.  Large-Truck Involved Crashes: An Exploratory Injury Severity Analysis , 2013 .

[43]  Fred L Mannering,et al.  A study of factors affecting highway accident rates using the random-parameters tobit model. , 2012, Accident; analysis and prevention.

[44]  Sherif Sakr,et al.  Using machine learning on cardiorespiratory fitness data for predicting hypertension: The Henry Ford ExercIse Testing (FIT) Project , 2018, PloS one.

[45]  Mohamed Ahmed,et al.  Crash Characteristics and Injury Severity at Work Zones Considering Adverse Weather Conditions in Washington Using SHRP 2 Roadway Information Database , 2016 .

[46]  Fred L Mannering,et al.  An empirical assessment of fixed and random parameter logit models using crash- and non-crash-specific injury data. , 2011, Accident; analysis and prevention.

[47]  Hossam Faris,et al.  Optimizing connection weights in neural networks using the whale optimization algorithm , 2016, Soft Computing.

[48]  Dursun Delen,et al.  Investigating injury severity risk factors in automobile crashes with predictive analytics and sensitivity analysis methods , 2017 .

[49]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[50]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[51]  B. Walczak,et al.  Particle swarm optimization (PSO). A tutorial , 2015 .

[52]  Chandra R. Bhat,et al.  Analytic methods in accident research: Methodological frontier and future directions , 2014 .

[53]  Jasmine Pahukula,et al.  A time of day analysis of crashes involving large trucks in urban areas. , 2015, Accident; analysis and prevention.

[54]  Andrew Lewis,et al.  The Whale Optimization Algorithm , 2016, Adv. Eng. Softw..

[55]  Reza Vatani Nezafat,et al.  A comparative study on using meta-heuristic algorithms for road maintenance planning: Insights from field study in a developing country , 2017 .

[56]  Hiren M. Mandalia,et al.  Using Support Vector Machines for Lane-Change Detection , 2005 .

[57]  Mahmoud Saffarzadeh,et al.  Determination of the Aircraft Landing Sequence by Two Meta-Heuristic Algorithms , 2014 .

[58]  Madhar Taamneh,et al.  Severity Prediction of Traffic Accident Using an Artificial Neural Network , 2017 .

[59]  F Mannering,et al.  Analysis of injury severity and vehicle occupancy in truck- and non-truck-involved accidents. , 1999, Accident; analysis and prevention.

[60]  Zong Tian,et al.  Investigating driver injury severity patterns in rollover crashes using support vector machine models. , 2016, Accident; analysis and prevention.

[61]  Salvador Hernandez,et al.  Modeling Injury Outcomes of Crashes Involving Heavy Vehicles on Texas Highways , 2013 .

[62]  Andrew P Tarko,et al.  Tobit analysis of vehicle accident rates on interstate highways. , 2008, Accident; analysis and prevention.

[63]  Ryan Doczy,et al.  Analysis of Gap Acceptance Behavior for Unprotected Right and Left Turning Maneuvers at Signalized Intersections using Data Mining Methods: A Driving Simulation Approach , 2018, Transportation Research Record: Journal of the Transportation Research Board.