Inclusion of phone use while driving data in predicting distraction-affected crashes.

Abstract Introduction: Given the tremendous number of lives lost or injured, distracted driving is an important safety area to study. With the widespread use of cellphones, phone use while driving has become the most common distracted driving behavior. Although researchers have developed safety performance functions (SPFs) for various crash types, SPFs for distraction-affected crashes are rarely studied in the literature. One possible reason is the lack of critical distracted behavior information in the commonly used safety data (i.e., roadway inventory, traffic, and crash counts). Recently, the frequency of phone use while driving (referred to as phone use data) is recorded by mobile application companies and has become available to safety researchers. The primary objective of this study is to examine if phone use data can potentially predict distracted-affected crashes. Method: The authors first integrated phone use data with roadway inventory, traffic, and crash data in Texas. Then, the Random Forest (RF) algorithm was applied to assess the significance of the feature - phone use while driving - for predicting the number of distraction-affected crashes on a road segment. Further, this study developed two SPFs for distraction-affected crashes with and without the phone use data, separately. Both SPFs were assessed in terms of model fitting and prediction performances. Results: RF results rank the frequency of phone use as an important factor contributing to the number of distraction-affected crashes. Performance evaluations indicated that the inclusion of phone use data in the SPFs consistently improved both fitting and prediction abilities to predict distracted-affected crashes. Practical Applications: The phone use data provide new insights into the safety analyses of distraction-affected crashes, which cannot be achieved by only using the conventional roadway inventory and crash data. Therefore, safety researchers and practitioners are encouraged to incorporate the emerging data sources in reducing distraction-affected crashes.

[1]  Lindsay S. Arnold,et al.  Effectiveness of Distracted Driving Countermeasures: A Review of the Literature , 2019 .

[2]  Simon Washington,et al.  On the nature of over-dispersion in motor vehicle crash prediction models. , 2007, Accident; analysis and prevention.

[3]  Zhenhua Chen,et al.  Influence of built environment on the severity of vehicle crashes caused by distracted driving: A multi-state comparison. , 2020, Accident; analysis and prevention.

[4]  Jean-Michel Poggi,et al.  Variable selection using random forests , 2010, Pattern Recognit. Lett..

[5]  Y. Zou,et al.  Modeling over-dispersed crash data with a long tail: Examining the accuracy of the dispersion parameter in Negative Binomial models , 2015 .

[6]  Rob J Hyndman,et al.  Another look at measures of forecast accuracy , 2006 .

[7]  Hong Han,et al.  Variable selection using Mean Decrease Accuracy and Mean Decrease Gini based on Random Forest , 2016, 2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS).

[8]  Bhagwant Persaud,et al.  Revisiting Variability of Dispersion Parameter of Safety Performance for Two-Lane Rural Roads , 2010 .

[9]  Beth E Ebel,et al.  “I wasn't texting; I was just reading an email …”: a qualitative study of distracted driving enforcement in Washington State , 2016, Injury Prevention.

[10]  Ioannis Tsapakis,et al.  Safety performance functions for low-volume rural minor collector two-lane roadways , 2021 .

[11]  Zhenhua Chen,et al.  The influence of built environment on distracted driving related crashes in Ohio , 2021 .

[12]  Xiaoyu Guo,et al.  A comparative analysis of intersection hotspot identification: Fixed vs. varying dispersion parameters in negative binomial models , 2020 .

[13]  Michael Fitzharris,et al.  Driver inattention and driver distraction in serious casualty crashes: data from the Australian National Crash In-depth Study. , 2013, Accident; analysis and prevention.

[14]  James A Bonneson,et al.  Development of Accident Modification Factors for Rural Frontage Road Segments in Texas , 2007 .

[15]  Fred Wegman,et al.  Analyzing road design risk factors for run-off-road crashes in The Netherlands with crash prediction models. , 2014, Journal of safety research.

[16]  H. Akaike A new look at the statistical model identification , 1974 .

[17]  Oscar Oviedo-Trespalacios,et al.  Driving behaviour while self-regulating mobile phone interactions: A human-machine system approach. , 2018, Accident; analysis and prevention.

[18]  Yajie Zou,et al.  Incorporating survival analysis into the safety effectiveness evaluation of treatments: Jointly modeling crash counts and time intervals between crashes , 2020, Journal of Transportation Safety & Security.

[19]  Maurizio Guida,et al.  A crash-prediction model for multilane roads. , 2007, Accident; analysis and prevention.

[20]  Fred L. Mannering,et al.  The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives , 2010 .

[21]  Yajie Zou,et al.  Comparative Analysis of Empirical Bayes and Bayesian Hierarchical Models in Hotspot Identification , 2019, Transportation Research Record: Journal of the Transportation Research Board.

[22]  Alfonso Montella,et al.  Crash Prediction Models for Rural Motorways , 2008 .

[23]  D. Lord,et al.  Examining driver distraction in the context of driving speed: An observational study using disruptive technology and naturalistic data. , 2021, Accident; analysis and prevention.

[24]  Soheil Sohrabi,et al.  Impacts of Red-Light Cameras on Intersection Safety: A Bayesian Hierarchical Spatial Model , 2019 .

[25]  J. Hilbe Negative Binomial Regression: Index , 2011 .

[26]  Simon Washington,et al.  Effects of road infrastructure and traffic complexity in speed adaptation behaviour of distracted drivers. , 2017, Accident; analysis and prevention.

[27]  Ziyuan Pu,et al.  Evaluating the Nonlinear Correlation between Vertical Curve Features and Crash Frequency on Highways Using Random Forests , 2020 .

[28]  I. Tsapakis,et al.  Application of different negative binomial parameterizations to develop safety performance functions for non-federal aid system roads. , 2021, Accident; analysis and prevention.

[29]  Paul F Hanley,et al.  Bias caused by self-reporting distraction and its impact on crash estimates. , 2012, Accident; analysis and prevention.

[30]  Mark J. King,et al.  User preferences and design recommendations for voluntary smartphone applications to prevent distracted driving , 2019, Transportation Research Part F: Traffic Psychology and Behaviour.

[31]  Zhenhua Chen,et al.  Does space influence on the frequency and severity of the distraction-affected vehicle crashes? An empirical evidence from the Central Ohio. , 2020, Accident; analysis and prevention.

[32]  Subasish Das,et al.  Characterizing phone usage while driving: Safety impact from road and operational perspectives using factor analysis. , 2021, Accident; analysis and prevention.

[33]  Xiaoyu Guo,et al.  Generalized criteria for evaluating hotspot identification methods. , 2020, Accident; analysis and prevention.

[34]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[35]  Mohamed Abdel-Aty,et al.  Aggregate nonparametric safety analysis of traffic zones. , 2012, Accident; analysis and prevention.

[36]  Shaw-Pin Miaou,et al.  Modeling Traffic Crash-Flow Relationships for Intersections: Dispersion Parameter, Functional Form, and Bayes Versus Empirical Bayes Methods , 2003 .

[37]  Srinivas R. Geedipally,et al.  Effects of Varying Dispersion Parameter of Poisson–Gamma Models on Estimation of Confidence Intervals of Crash Prediction Models , 2008 .

[38]  Salissou Moutari,et al.  Driver distraction is an under-reported cause of road accidents: An examination of the discrepancy between police officers' views and road accident reports , 2017 .