Malaysian Road Accident Severity: Variables and Predictive Models

Road accident refers to an incident where at least one land vehicle with one or more people injured or killed. While there are many variables attributed to road accident, ranging from human to environmental factors, the work presented in this paper focused only on identifying predictors that could potentially lead to fatality. In this study, the raw dataset obtained from the Malaysian Institute of Road Safety Research (MIROS) was firstly preprocessed and subsequently transformed into analytical dataset by removing missing values and outliers. Such transformation, however, resort to large feature space. To overcome such challenge, feature selection algorithms were employed before constructing predictive models. Empirical study revealed that there were 26 important predictors for predicting accident fatality and the top five variables are month, speed limit, collision type, vehicle model and vehicle movement. In this work, six predictive models constructed were Random Forest, XGBoost, CART, Neural Net, Naive Bayes and SVM; with Random Forest outperformed the rest with an accuracy of 95.46%.

[1]  Junhua Wang,et al.  Utilizing the eigenvectors of freeway loop data spatiotemporal schematic for real time crash prediction. , 2016, Accident; analysis and prevention.

[2]  Ana Fernandes,et al.  An approach to accidents modeling based on compounds road environments. , 2013, Accident; analysis and prevention.

[3]  Jeff Linkenbach,et al.  White Papers for: "Toward Zero Deaths: A National Strategy on Highway Safety" - White Paper No. 2 - White Paper on Traffic Safety Culture , 2010 .

[4]  Ana de Almeida,et al.  Prediction of Road Accident Severity Using the Ordered Probit Model , 2014 .

[5]  Fridulv Sagberg,et al.  Road accidents caused by sleepy drivers: Update of a Norwegian survey. , 2013, Accident; analysis and prevention.

[6]  Huan Liu,et al.  Feature selection for classification: A review , 2014 .

[7]  A. Çelik,et al.  A multinomial logit analysis of risk factors influencing road traffic injury severities in the Erzurum and Kars Provinces of Turkey. , 2014, Accident; analysis and prevention.

[8]  S. N. Sachdeva,et al.  Analysis of road accidents on NH-1 between RD 98 km to 148 km , 2016 .

[9]  Athanasios Theofilatos,et al.  Incorporating real-time traffic and weather data to explore road accident likelihood and severity in urban arterials. , 2017, Journal of safety research.

[10]  Amirfarrokh Iranitalab,et al.  Comparison of four statistical and machine learning methods for crash severity prediction. , 2017, Accident; analysis and prevention.

[11]  Yannis George,et al.  Investigation of road accident severity per vehicle type , 2017 .

[12]  Shlomo Bekhor,et al.  The Relationship between Free-Flow Travel Speeds, Infrastructure Characteristics and Accidents, on Single-Carriageway Roads , 2017 .

[13]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[14]  Graham Currie,et al.  Factors affecting the probability of bus drivers being at-fault in bus-involved accidents. , 2014, Accident; analysis and prevention.

[15]  Jonathan J. Forster,et al.  Modelling trends in road accident frequency - Bayesian inference for rates with uncertain exposure , 2014, Comput. Stat. Data Anal..

[16]  Youngchan Jang,et al.  Classification of motor vehicle crash injury severity: A hybrid approach for imbalanced data. , 2018, Accident; analysis and prevention.

[17]  George Yannis,et al.  Predicting road accidents: a rare-events modeling approach , 2016 .

[18]  Vipin Kumar,et al.  Feature Selection: A literature Review , 2014, Smart Comput. Rev..

[19]  Yiyi Wang,et al.  Modeling Crash and Fatality Counts Along Mainlanes and Frontage Roads Across Texas: The Roles of Design, the Built Environment, and Weather , 2014 .

[20]  Faustino Prieto,et al.  Modelling road accident blackspots data with the discrete generalized Pareto distribution. , 2013, Accident; analysis and prevention.

[21]  Jian John Lu,et al.  Hit-and-run crashes in urban river-crossing road tunnels. , 2016, Accident; analysis and prevention.

[22]  Ali Azadeh,et al.  A novel framework for improvement of road accidents considering decision-making styles of drivers in a large metropolitan area. , 2016, Accident; analysis and prevention.