Distracted driver detection by combining in-vehicle and image data using deep learning

Abstract Distracted driving is among the most important reasons for traffic accidents today. Recently, there is an increasing interest in building driver assistance systems that detect the actions of the drivers and help them drive safer. In these studies, although some distinct data types such as the physical conditions of the driver, audio and visual features, car information are used; the main data source is the images of the driver that include the face, arms, and hands taken with a camera placed inside the car. In this work, we propose to integrate sensor data into the vision-based distracted driver detection model to improve the generalization ability of the system. With this purpose, we created a new data set that includes driver images and sensor data collected from real-world drives. Then, we constructed a two-stage distracted driving detection system to detect nine distracted behaviors. In the first stage, vision-based Convolutional Neural Network (CNN) models were created by transfer learning and fine-tuning methods. In the second stage, Long-Short Term Memory-Recurrent Neural Network (LSTM-RNN) models were created using sensor and image data together. We evaluate our system by two different fusion techniques and show that integrating sensor data to image-based driver detection significantly increases the overall performance with both of the fusion techniques. We also show that the accuracy of the vision-based model increases by fine-tuning the pre-trained CNN model using a related public dataset.

[1]  Avinash Wesley,et al.  A novel method to monitor driver's distractions , 2010, CHI EA '10.

[2]  Rafael A. Berri,et al.  A pattern recognition system for detecting use of mobile phones while driving , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[3]  H. Abdul Shabeer,et al.  Averting mobile phone use while driving and technique to locate the mobile phone used vehicle , 2012 .

[4]  Yomi Kastro,et al.  Real-time prediction of online shoppers’ purchasing intention using multilayer perceptron and LSTM recurrent neural networks , 2019, Neural Computing and Applications.

[5]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[6]  Bailing Zhang,et al.  Recognition of driving postures by contourlet transform and random forests , 2012 .

[7]  Li Meng,et al.  Detecting Distracted Driving with Deep Learning , 2017, ICR.

[8]  K. Bora,et al.  Comparative assessment of CNN architectures for classification of breast FNAC images. , 2019, Tissue & cell.

[9]  Jianwei Yu,et al.  Gaussian Process Lstm Recurrent Neural Network Language Models for Speech Recognition , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Javier Echanobe,et al.  Driver identification and impostor detection based on driving behavior signals , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[11]  Mohan M. Trivedi,et al.  Head, Eye, and Hand Patterns for Driver Activity Recognition , 2014, 2014 22nd International Conference on Pattern Recognition.

[12]  Mohan M. Trivedi,et al.  Driving style recognition using a smartphone as a sensor platform , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[13]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[14]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[15]  Xiu-Shen Wei,et al.  Mask-CNN: Localizing parts and selecting descriptors for fine-grained bird species categorization , 2018, Pattern Recognit..

[16]  Byung-Geun Lee,et al.  Background subtraction based on Gaussian mixture models using color and depth information , 2014, The 2014 International Conference on Control, Automation and Information Sciences (ICCAIS 2014).

[17]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[18]  Paulius Lengvenis,et al.  Driving style classification using long-term accelerometer information , 2014, 2014 19th International Conference on Methods and Models in Automation and Robotics (MMAR).

[19]  Mark S. Young,et al.  The impact of smart driving aids on driving performance and driver distraction , 2011 .

[20]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[21]  Amith Khandakar,et al.  Portable System for Monitoring and Controlling Driver Behavior and the Use of a Mobile Phone While Driving , 2019, Sensors.

[22]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[23]  Yongsheng Gao,et al.  Recognition of driving postures by multiwavelet transform and multilayer perceptron classifier , 2012, Eng. Appl. Artif. Intell..

[24]  Qiang Ji,et al.  Real-Time Eye, Gaze, and Face Pose Tracking for Monitoring Driver Vigilance , 2002, Real Time Imaging.

[25]  P. Ehrlich,et al.  Preventing distracted driving: A program from initiation through to evaluation. , 2020, American journal of surgery.

[26]  Önsen Toygar,et al.  On the use of DAG-CNN architecture for age estimation with multi-stage features fusion , 2019, Neurocomputing.

[27]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[28]  Xiang-Yang Li,et al.  You're driving and texting: detecting drivers using personal smart phones by leveraging inertial sensors , 2013, MobiCom.

[29]  Wu Jun,et al.  Fatigue driving recognition network: fatigue driving recognition via convolutional neural network and long short‐term memory units , 2019, IET Intelligent Transport Systems.

[30]  Bailing Zhang,et al.  Classification of Driving Postures by Support Vector Machines , 2011, 2011 Sixth International Conference on Image and Graphics.

[31]  Necip Gozuacik,et al.  Turkish Movie Genre Classification from Poster Images using Convolutional Neural Networks , 2019, 2019 11th International Conference on Electrical and Electronics Engineering (ELECO).

[32]  Stephanie Carey,et al.  Development and evaluation of a dynamic virtual reality driving simulator , 2015, PETRA.

[33]  Céline Craye,et al.  A Multi-Modal Driver Fatigue and Distraction Assessment System , 2015, International Journal of Intelligent Transportation Systems Research.

[34]  Lin Hong,et al.  Fatigue driving detection model based on multi‐feature fusion and semi‐supervised active learning , 2019, IET Intelligent Transport Systems.

[35]  Sriram Chellappan,et al.  Leveraging Smartphone Sensors to Detect Distracted Driving Activities , 2019, IEEE Transactions on Intelligent Transportation Systems.

[36]  Hesham M. Eraqi,et al.  Driver Distraction Identification with an Ensemble of Convolutional Neural Networks , 2019, Journal of Advanced Transportation.

[37]  Bailing Zhang,et al.  Erratum to: Recognition of driving postures by combined features and random subspace ensemble of multilayer perceptron classifiers , 2012, Neural Computing and Applications.

[38]  Imen Jegham,et al.  Safe Driving : Driver Action Recognition using SURF Keypoints , 2018, 2018 30th International Conference on Microelectronics (ICM).

[39]  H. M. N. Dilum Bandara,et al.  Cloud-based driver monitoring and vehicle diagnostic with OBD2 telematics , 2015, 2015 Fifteenth International Conference on Advances in ICT for Emerging Regions (ICTer).

[40]  Muhammad Qasim Khan,et al.  A Comprehensive Survey of Driving Monitoring and Assistance Systems , 2019, Sensors.

[41]  Feiran Huang,et al.  Sentiment analysis of social images via hierarchical deep fusion of content and links , 2019, Appl. Soft Comput..

[42]  Feiran Huang,et al.  Visual question answering via Attention-based syntactic structure tree-LSTM , 2019, Appl. Soft Comput..

[43]  Dongpu Cao,et al.  Driver Activity Recognition for Intelligent Vehicles: A Deep Learning Approach , 2019, IEEE Transactions on Vehicular Technology.

[44]  Prabhat Kumar,et al.  RNN based online handwritten word recognition in Devanagari and Bengali scripts using horizontal zoning , 2019, Pattern Recognit..

[45]  Ardhendu Behera,et al.  Context-driven Multi-stream LSTM (M-LSTM) for Recognizing Fine-Grained Activity of Drivers , 2018, GCPR.

[46]  Zunlei Feng,et al.  Neural Style Transfer: A Review , 2017, IEEE Transactions on Visualization and Computer Graphics.

[47]  Mara Tanelli,et al.  Quantitative Driving Style Estimation for Energy-Oriented Applications in Road Vehicles , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[48]  Mudhakar Srivatsa,et al.  Darnet: a deep learning solution for distracted driving detection , 2017, Middleware '17.

[49]  Pushpendra Singh,et al.  Using mobile phone sensors to detect driving behavior , 2013, ACM DEV '13.

[50]  Junichi Yamagishi,et al.  Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis , 2018, Speech Commun..

[51]  Fernando De la Torre,et al.  Driver Gaze Tracking and Eyes Off the Road Detection System , 2015, IEEE Transactions on Intelligent Transportation Systems.

[52]  Frank Gauterin,et al.  Online driving style recognition using fuzzy logic , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[53]  Frans Coenen,et al.  Driving posture recognition by convolutional neural networks , 2016, IET Comput. Vis..

[54]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Sergio M. Savaresi,et al.  Driving style estimation via inertial measurements , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[56]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[57]  Ram Dantu,et al.  Safe Driving Using Mobile Phones , 2012, IEEE Transactions on Intelligent Transportation Systems.

[58]  Carlos Busso,et al.  Predicting Perceived Visual and Cognitive Distractions of Drivers With Multimodal Features , 2015, IEEE Transactions on Intelligent Transportation Systems.