Machine Learning and End-to-End Deep Learning for Monitoring Driver Distractions From Physiological and Visual Signals

It is only a matter of time until autonomous vehicles become ubiquitous; however, human driving supervision will remain a necessity for decades. To assess the driver’s ability to take control over the vehicle in critical scenarios, driver distractions can be monitored using wearable sensors or sensors that are embedded in the vehicle, such as video cameras. The types of driving distractions that can be sensed with various sensors is an open research question that this study attempts to answer. This study compared data from physiological sensors (palm electrodermal activity (pEDA), heart rate and breathing rate) and visual sensors (eye tracking, pupil diameter, nasal EDA (nEDA), emotional activation and facial action units (AUs)) for the detection of four types of distractions. The dataset was collected in a previous driving simulation study. The statistical tests showed that the most informative feature/modality for detecting driver distraction depends on the type of distraction, with emotional activation and AUs being the most promising. The experimental comparison of seven classical machine learning (ML) and seven end-to-end deep learning (DL) methods, which were evaluated on a separate test set of 10 subjects, showed that when classifying windows into distracted or not distracted, the highest F1-score of 79% was realized by the extreme gradient boosting (XGB) classifier using 60-second windows of AUs as input. When classifying complete driving sessions, XGB’s F1-score was 94%. The best-performing DL model was a spectro-temporal ResNet, which realized an F1-score of 75% when classifying segments and an F1-score of 87% when classifying complete driving sessions. Finally, this study identified and discussed problems, such as label jitter, scenario overfitting and unsatisfactory generalization performance, that may adversely affect related ML approaches.

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  Jeffrey J. Rodriguez,et al.  Drive-Net: Convolutional Network for Driver Distraction Detection , 2018, 2018 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI).

[3]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[4]  I. Pavlidis,et al.  Causality Effects of Interventions and Stressors on Driving Behaviors under Typical Conditions , 2018, Mathematics.

[5]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[6]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[7]  Russell Beale,et al.  Affect and Emotion in Human-Computer Interaction, From Theory to Applications , 2008, Affect and Emotion in Human-Computer Interaction.

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[10]  Mitja Lustrek,et al.  Blood Pressure Estimation from Photoplethysmogram Using a Spectro-Temporal Deep Neural Network , 2019, Sensors.

[11]  Thierry Derrmann,et al.  Driver Behavior Profiling Using Smartphones: A Low-Cost Platform for Driver Monitoring , 2015, IEEE Intelligent Transportation Systems Magazine.

[12]  Luis M. Bergasa,et al.  Real-time system for monitoring driver vigilance , 2005, ISIE 2005.

[13]  Makoto Itoh,et al.  Driver Assistance System With a Dual Control Scheme: Effectiveness of Identifying Driver Drowsiness and Preventing Lane Departure Accidents , 2016, IEEE Transactions on Human-Machine Systems.

[14]  Jacob Scharcanski,et al.  Yawning Detection Using Embedded Smart Cameras , 2016, IEEE Transactions on Instrumentation and Measurement.

[15]  Charlene Hallett,et al.  Driver distraction and driver inattention: definition, relationship and taxonomy. , 2011, Accident; analysis and prevention.

[16]  Sangram Ganguly,et al.  DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution , 2017, KDD.

[17]  Jianqiang Wang,et al.  A Forward Collision Warning Algorithm With Adaptation to Driver Behaviors , 2016, IEEE Transactions on Intelligent Transportation Systems.

[18]  Martin Gjoreski,et al.  Machine Learning and End-to-End Deep Learning for the Detection of Chronic Heart Failure From Heart Sounds , 2020, IEEE Access.

[19]  Dong Yu,et al.  Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..

[20]  Mohan M. Trivedi,et al.  Head Pose Estimation and Augmented Reality Tracking: An Integrated System and Evaluation for Monitoring Driver Awareness , 2010, IEEE Transactions on Intelligent Transportation Systems.

[21]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[22]  I. Pavlidis,et al.  Dissecting Driver Behaviors Under Cognitive, Emotional, Sensorimotor, and Mixed Stressors , 2016, Scientific Reports.

[23]  Jürgen Schmidhuber,et al.  Recurrent nets that time and count , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[24]  Anwar M. Mirza,et al.  Fully automated real time fatigue detection of drivers through Fuzzy Expert Systems , 2014, Appl. Soft Comput..

[25]  Gregory D. Abowd,et al.  Handling annotation uncertainty in human activity recognition , 2019, UbiComp.

[26]  Andrei Aksjonov,et al.  Detection and Evaluation of Driver Distraction Using Machine Learning and Fuzzy Logic , 2019, IEEE Transactions on Intelligent Transportation Systems.

[27]  Jennifer Healey,et al.  Detecting stress during real-world driving tasks using physiological sensors , 2005, IEEE Transactions on Intelligent Transportation Systems.

[28]  Yann LeCun,et al.  What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[29]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[30]  Richard J. Hanowski,et al.  Driver Distraction in Commercial Motor Vehicle Operations , 2017 .

[31]  Ning Qian,et al.  On the momentum term in gradient descent learning algorithms , 1999, Neural Networks.

[32]  Hongxun Yao,et al.  Deep feature extraction and combination for remote sensing image classification based on pre-trained CNN models , 2017, International Conference on Digital Image Processing.

[33]  Sriram Chellappan,et al.  Detecting Distracted Driving Using a Wrist-Worn Wearable , 2018, 2018 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops).

[34]  Björn W. Schuller,et al.  Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car , 2010, Adv. Hum. Comput. Interact..

[35]  Gamini Dissanayake,et al.  Driver Drowsiness Classification Using Fuzzy Wavelet-Packet-Based Feature-Extraction Algorithm , 2011, IEEE Transactions on Biomedical Engineering.

[36]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[37]  Honglak Lee,et al.  Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[38]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[39]  Andrew J Belyavin,et al.  Modeling performance and alertness: the QinetiQ approach. , 2004, Aviation, space, and environmental medicine.

[40]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[41]  Shahzad Anwar,et al.  Driver Fatigue Detection Systems: A Review , 2019, IEEE Transactions on Intelligent Transportation Systems.

[42]  Frans Coenen,et al.  Driving posture recognition by convolutional neural networks , 2015, 2015 11th International Conference on Natural Computation (ICNC).

[43]  Jeffrey F. Cohn,et al.  Observer-based measurement of facial expression with the Facial Action Coding System. , 2007 .

[44]  D. Kibler,et al.  Instance-based learning algorithms , 2004, Machine Learning.

[45]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[46]  Archisman Sarkar,et al.  A health monitoring system using multiple non-contact ECG sensors for automotive drivers , 2016, 2016 IEEE International Instrumentation and Measurement Technology Conference Proceedings.

[47]  Wan-Young Chung,et al.  Standalone Wearable Driver Drowsiness Detection System in a Smartwatch , 2016, IEEE Sensors Journal.

[48]  Jia Xu,et al.  Learning to See in the Dark , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[49]  Andreas Savakis,et al.  Distracted Driver Detection: Deep Learning vs Handcrafted Features , 2017 .

[50]  Ute Schmid,et al.  A Practical Approach to Fuse Shape and Appearance Information in a Gaussian Facial Action Estimation Framework , 2016, ECAI.

[51]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[52]  Alexander Zelinsky,et al.  Vision In and Out of Vehicles: Integrated Driver and Road Scene Monitoring , 2002, ISER.

[53]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[54]  Taxonomy and definitions for terms related to driving automation systems for on-road motor vehicles , 2022 .

[55]  A. Borbély A two process model of sleep regulation. , 1982, Human neurobiology.

[56]  Arief Koesdwiady,et al.  End-to-End Deep Learning for Driver Distraction Recognition , 2017, ICIAR.

[57]  M. N. Doja,et al.  Detecting distraction of drivers using Convolutional Neural Network , 2020, Pattern Recognit. Lett..

[58]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[59]  Wan-Young Chung,et al.  Smartwatch-Based Driver Vigilance Indicator With Kernel-Fuzzy-C-Means-Wavelet Method , 2016, IEEE Sensors Journal.

[60]  Katherine B. Martin,et al.  Facial Action Coding System , 2015 .

[61]  Matjaz Gams,et al.  Monitoring stress with a wrist device using context , 2017, J. Biomed. Informatics.

[62]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[63]  Fernando De la Torre,et al.  Driver Gaze Tracking and Eyes Off the Road Detection System , 2015, IEEE Transactions on Intelligent Transportation Systems.

[64]  Dvijesh Shastri,et al.  Perinasal Imaging of Physiological Stress and Its Affective Potential , 2012, IEEE Transactions on Affective Computing.

[65]  Yan Zhou,et al.  Spatiotemporal Smoothing as a Basis for Facial Tissue Tracking in Thermal Imaging , 2013, IEEE Transactions on Biomedical Engineering.

[66]  Xuelong Li,et al.  Speed up deep neural network based pedestrian detection by sharing features across multi-scale models , 2016, Neurocomputing.

[67]  Rubén Usamentiaga,et al.  Driver Distraction Using Visual-Based Sensors and Algorithms , 2016, Sensors.

[68]  Jani Bizjak,et al.  Classical and deep learning methods for recognizing human activities and modes of transportation with smartphone sensors , 2020, Inf. Fusion.

[69]  Mubarak Shah,et al.  Determining driver visual attention with one camera , 2003, IEEE Trans. Intell. Transp. Syst..

[70]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[71]  Karen Kafadar,et al.  Letter-Value Plots: Boxplots for Large Data , 2017 .

[72]  Wan-Young Chung,et al.  Driver Alertness Monitoring Using Fusion of Facial Features and Bio-Signals , 2012, IEEE Sensors Journal.

[73]  Chin-Teng Lin,et al.  Development of Wireless Brain Computer Interface With Embedded Multitask Scheduling and its Application on Real-Time Driver's Drowsiness Detection and Warning , 2008, IEEE Transactions on Biomedical Engineering.

[74]  Tong Zhang,et al.  An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods , 2001, AI Mag..