Application of XGBoost for Hazardous Material Road Transport Accident Severity Analysis

Hazardous material road transport accidents pose a serious threat to public life, property and the environment. Therefore, studying the factors influencing road transport accidents involving hazardous materials can help identify the main causes behind them and contribute to the adoption of specific and targeted measures to reduce casualty rates and improve traffic safety. However, most existing research either adopted methods based on statistical analysis or neglected to further evaluate the spatial relationships. This study aims to use the eXtreme Gradient Boosting (XGBoost) algorithm to analyze hazardous material road transport accident data from seven regions of China. Considering the rarity of these events, the classification performance of different methods is compared based on precision, recall, F-score and Area Under Curve (AUC). The results indicate that the proposed XGBoost method has the best modeling performance. There is some variation between regions in the features that have a significant impact on accident severity. The influence of the same feature on the severity of an accident even varies from region to region. The aforementioned results provide a theoretical basis for exploring the issues, sustainability, challenges, and tasks of safe transportation activities for hazardous materials in the future. These results can help regions develop targeted prevention and response policies to efficiently reduce the incidence and severity of accidents.

[1]  He-Da Zhang,et al.  Characteristics of hazardous chemical accidents in China: A statistical investigation , 2012 .

[2]  Jian Lu,et al.  Exploring Risk Factors Contributing to the Severity of Hazardous Material Transportation Accidents in China , 2020, International journal of environmental research and public health.

[3]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[4]  Donghong Ji,et al.  Novel framework for image attribute annotation with gene selection XGBoost algorithm and relative attribute model , 2019, Appl. Soft Comput..

[5]  Xiaowei Li,et al.  Cause Analysis of Unsafe Behaviors in Hazardous Chemical Accidents: Combined with HFACs and Bayesian Network , 2019, International journal of environmental research and public health.

[6]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[7]  Jun Ma,et al.  Analyzing the Leading Causes of Traffic Fatalities Using XGBoost and Grid-Based Analysis: A City Management Perspective , 2019, IEEE Access.

[8]  Sven-Eric Andersson SAFE TRANSPORT OF DANGEROUS GOODS: ROAD, RAIL OR SEA? A SCREENING OF TECHNICAL AND ADMINISTRATIVE FACTORS , 1993 .

[9]  Suren Chen,et al.  Injury severities of truck drivers in single- and multi-vehicle accidents on rural highways. , 2011, Accident; analysis and prevention.

[10]  Wang Dian-hong,et al.  Tracking laser Doppler measurement for velocity of moving target , 2012, 2012 International Conference on Computer Science and Information Processing (CSIP).

[11]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[12]  Jungyeol Hong,et al.  Application of association rules mining algorithm for hazardous materials transportation crashes on expressway. , 2020, Accident; analysis and prevention.

[13]  Jun Bi,et al.  A survey on hazardous materials accidents during road transport in China from 2000 to 2008. , 2010, Journal of hazardous materials.

[14]  Hongchao Liu,et al.  Factor Identification and Prediction for Teen Driver Crash Severity Using Machine Learning: A Case Study , 2020, Applied Sciences.

[15]  Michèle Sebag,et al.  Machine Learning and Knowledge Discovery in Databases , 2015, Lecture Notes in Computer Science.

[16]  Saleh R Mousa,et al.  A Comprehensive Railroad-Highway Grade Crossing Consolidation Model: A Machine Learning Approach. , 2019, Accident; analysis and prevention.

[17]  Mohammad Mehdi Besharati,et al.  A data mining approach to investigate the factors influencing the crash severity of motorcycle pillion passengers. , 2014, Journal of safety research.

[18]  Yong Qi,et al.  An accident prediction approach based on XGBoost , 2017, 2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE).

[19]  Mark D. Abkowitz,et al.  Assessing the Economic Effect of Incidents Involving Truck Transport of Hazardous Materials , 2001 .

[20]  J. Current,et al.  A MODEL TO ASSESS RISK, EQUITY AND EFFICIENCY IN FACILITY LOCATION AND TRANSPORTATION OF HAZARDOUS MATERIALS. , 1995 .

[21]  Luís Torgo,et al.  OpenML: networked science in machine learning , 2014, SKDD.

[22]  Ciro Caliendo,et al.  Quantitative Risk Analysis on the Transport of Dangerous Goods Through a Bi-Directional Road Tunnel. , 2017, Risk analysis : an official publication of the Society for Risk Analysis.

[23]  Igor Radun,et al.  Convicted of fatigued driving: who, why and how? , 2009, Accident; analysis and prevention.

[24]  Dongjoo Park,et al.  Transport Management Characteristics of Urban Hazardous Material Handling Business Entities , 2019 .

[25]  Hamidreza Asgari,et al.  Severity analysis for large truck rollover crashes using a random parameter ordered logit model. , 2019, Accident; analysis and prevention.

[26]  Patrick Patterson,et al.  Are safety and performance affected by navigation system display size, environmental illumination, and gender when driving in both urban and rural areas? , 2020, Accident; analysis and prevention.

[27]  Xulei Wang,et al.  Analysis of Factors that Influence Hazardous Material Transportation Accidents Based on Bayesian Networks: A Case Study in China , 2012 .

[28]  Bruno Fabiano,et al.  Dangerous good transportation by road: from risk analysis to emergency planning , 2005 .

[29]  Xiaonan Li,et al.  Analysis on Tank Truck Accidents Involved in Road Hazardous Materials Transportation in China , 2014, Traffic injury prevention.

[30]  Richard Williams Generalized Ordered Logit/Partial Proportional Odds Models for Ordinal Dependent Variables , 2006 .

[31]  Changxi Ma,et al.  Causation Analysis of Hazardous Material Road Transportation Accidents Based on the Ordered Logit Regression Model , 2020, International journal of environmental research and public health.

[32]  M R J Baldock,et al.  An examination of the environmental, driver and vehicle factors associated with the serious and fatal crashes of older rural drivers. , 2013, Accident; analysis and prevention.

[33]  Wei Zou,et al.  Truck crash severity in New York city: An investigation of the spatial and the time of day effects. , 2017, Accident; analysis and prevention.

[34]  Anders Krogh,et al.  Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[35]  Xun Zhang,et al.  Traffic accidents involving fatigue driving and their extent of casualties. , 2016, Accident; analysis and prevention.

[36]  Mark D. Abkowitz,et al.  Emerging Technologies Applicable to Hazardous Materials Transportation Safety and Security , 2011 .

[37]  Chengcheng Xu,et al.  Analysis of the Risk Factors Affecting the Size of Fatal Accidents Involving Trucks Based on the Structural Equation Model , 2019, Transportation Research Record: Journal of the Transportation Research Board.

[38]  A. Khattak,et al.  RISK FACTORS IN LARGE TRUCK ROLLOVERS AND INJURY SEVERITY: ANALYSIS OF SINGLE-VEHICLE COLLISIONS , 2003 .

[39]  David M. Goldberg,et al.  Minimizing the Risks of Highway Transport of Hazardous Materials , 2019, Sustainability.

[40]  Berrin Tansel,et al.  A transportation network assessment tool for hazardous material cargo routing: Weighing exposure health risks, proximity to vulnerable areas, delay costs and trucking expenses , 2016 .

[41]  W. Duan,et al.  The situation of hazardous chemical accidents in China between 2000 and 2006. , 2011, Journal of hazardous materials.

[42]  D. Powers Evaluation: From Precision, Recall and F-Factor to ROC, Informedness, Markedness & Correlation , 2008 .

[43]  Ali Movahedi,et al.  Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis. , 2019, Accident; analysis and prevention.

[44]  Aemal J. Khattak,et al.  Modeling the Probability of Hazardous Materials Release in Crashes at Highway–Rail Grade Crossings , 2018 .

[45]  Kwasi Kwafo Adarkwa,et al.  Next to suffer: Population exposure risk to hazardous material transportation in Ghana , 2018, Journal of Transport & Health.

[46]  Amirfarrokh Iranitalab,et al.  Comparison of four statistical and machine learning methods for crash severity prediction. , 2017, Accident; analysis and prevention.

[47]  Qin Shi,et al.  Identification of black spots on highways using fault tree analysis and vehicle safety boundaries , 2019, Journal of Transportation Safety & Security.

[48]  Dimitris Kanellopoulos,et al.  Data Preprocessing for Supervised Leaning , 2007 .

[49]  Laijun Zhao,et al.  An Analysis of Hazardous Chemical Accidents in China between 2006 and 2017 , 2018, Sustainability.

[50]  Fang Liu,et al.  Lane-changes prediction based on adaptive fuzzy neural network , 2018, Expert Syst. Appl..

[51]  Ryan Doczy,et al.  Machine Learning Methods to Analyze Injury Severity of Drivers from Different Age and Gender Groups , 2018, Transportation Research Record: Journal of the Transportation Research Board.

[52]  C. Tyler Dick,et al.  Risk-Based Optimization of Rail Defect Inspection Frequency for Petroleum Crude Oil Transportation , 2016 .