Driver identification using in-vehicle digital data in the forensic context of a hit and run accident

Abstract One major focus in forensics is the identification of individuals based on different kinds of evidence found at a crime scene and in the digital domain. In the present study, we assessed the potential of using in-vehicle digital data to capture the natural driving behavior of individuals in order to identify them. Freely available data was used to classify drivers by their natural driving behavior. We formulated a forensic scenario of a hit and run car accident with three known suspects. Suggestions are provided for an understandable and useful reporting of model results in the light of the requirements in digital forensics. Specific aims of this study were 1) to develop a workflow for driver identification in digital forensics, 2) to apply a simple but sound method for model validation with time series data and 3) to transfer the model results to answers to the two forensic questions a) to which suspect does the evidence most likely belong to and b) how certain is the evidence claim. Based on freely available data (Kwak et al., 2017) the first question could be answered by unsupervised classification using a random forest model validated by random block splitting. To answer the second question we used model accuracy and false detection rate (FDR) which were 93% and 7%, respectively. Furthermore, we reported the random match probability (RMP) as well as the opportunity of a visual interpretation of the prediction on the time series for the evidence data in our hypothetical hit and run accident.

[1]  José Manuel Benítez,et al.  On the use of cross-validation for time series predictor evaluation , 2012, Inf. Sci..

[2]  Rok Sosic,et al.  Driver identification using automobile sensor data from a single turn , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[3]  Yinghui Yang,et al.  Web user behavioral profiling for user identification , 2010, Decis. Support Syst..

[4]  Wen-Hui Chen,et al.  Comparisons of Machine Learning Algorithms for Driving Behavior Recognition Using In-Vehicle CAN Bus Data , 2019, 2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR).

[5]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[6]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[7]  Jacques Droulez,et al.  Role of Lateral Acceleration in Curve Driving: Driver Model and Experiments on a Real Vehicle and a Driving Simulator , 2001, Hum. Factors.

[8]  Kai Zhang,et al.  Driver Evaluation and Identification Based on Driving Behavior Data , 2018, 2018 5th International Conference on Information Science and Control Engineering (ICISCE).

[9]  W. Thompson,et al.  Lay understanding of forensic statistics: Evaluation of random match probabilities, likelihood ratios, and verbal equivalents. , 2015, Law and human behavior.

[10]  Jeremy S. Daily,et al.  Automobile Event Data Recorder Forensics , 2008, IFIP Int. Conf. Digital Forensics.

[11]  Hans Godthelp,et al.  Speed Choice and Steering Behavior in Curve Driving , 1996, Hum. Factors.

[12]  Douglas M. Hawkins,et al.  The Problem of Overfitting , 2004, J. Chem. Inf. Model..

[13]  Samuel Lindsey,et al.  The Random Match Probability (RMP) in DNA Evidence: Irrelevant and Prejudicial? , 1995 .

[14]  Klara Dolos,et al.  How canopy shadow affects invasive plant species classification in high spatial resolution remote sensing , 2019, Remote Sensing in Ecology and Conservation.

[15]  Hiok Chai Quek,et al.  Driving Profile Modeling and Recognition Based on Soft Computing Approach , 2009, IEEE Transactions on Neural Networks.

[16]  Damaris Zurell,et al.  Collinearity: a review of methods to deal with it and a simulation study evaluating their performance , 2013 .

[17]  Driss Matrouf,et al.  Forensic speaker recognition , 2009, IEEE Signal Process. Mag..

[18]  Tadayoshi Kohno,et al.  Automobile Driver Fingerprinting , 2016, Proc. Priv. Enhancing Technol..

[19]  Adrian D. C. Chan,et al.  Driver identification using vehicle acceleration and deceleration events from naturalistic driving of older drivers , 2017, 2017 IEEE International Symposium on Medical Measurements and Applications (MeMeA).

[20]  Oludare Isaac Abiodun,et al.  Digital Forensics: Review of Issues in Scientific Validation of Digital Evidence , 2018, J. Inf. Process. Syst..

[21]  Arun Kumar Sangaiah,et al.  Human behavior characterization for driving style recognition in vehicle system , 2020, Comput. Electr. Eng..

[22]  Carsten F. Dormann,et al.  Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure , 2017 .

[23]  Jessica J. Fridrich,et al.  Large scale test of sensor fingerprint camera identification , 2009, Electronic Imaging.

[24]  Klara Dolos,et al.  Silvicultural climatic turning point for European beech and sessile oak in Western Europe derived from national forest inventories , 2016 .

[25]  Rob J. Hyndman,et al.  A note on the validity of cross-validation for evaluating autoregressive time series prediction , 2018, Comput. Stat. Data Anal..