Robust and Subject-Independent Driving Manoeuvre Anticipation through Domain-Adversarial Recurrent Neural Networks

Abstract Through deep learning and computer vision techniques, driving manoeuvres can be predicted accurately a few seconds in advance. Even though adapting a learned model to new drivers and different vehicles is key for robust driver-assistance systems, this problem has received little attention so far. This work proposes to tackle this challenge through domain adaptation, a technique closely related to transfer learning. A proof of concept for the application of a Domain-Adversarial Recurrent Neural Network (DA-RNN) to multi-modal time series driving data is presented, in which domain-invariant features are learned by maximising the loss of an auxiliary domain classifier. Our implementation is evaluated using a leave-one-driver-out approach on individual drivers from the Brain4Cars dataset, as well as using a new dataset acquired through driving simulations, yielding an average increase in performance of 30% and 114% respectively compared to no adaptation. We also show the importance of fine-tuning sections of the network to optimise the extraction of domain-independent features. The results demonstrate the applicability of the approach to driver-assistance systems as well as training and simulation environments.

[1]  Christopher Kermorvant,et al.  Where to apply dropout in recurrent neural networks for handwriting recognition? , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[2]  Roland Kuhn,et al.  Discriminative Instance Weighting for Domain Adaptation in Statistical Machine Translation , 2010, EMNLP.

[3]  ChengXiang Zhai,et al.  Instance Weighting for Domain Adaptation in NLP , 2007, ACL.

[4]  Mohan M. Trivedi,et al.  On-road prediction of driver's intent with multimodal sensory cues , 2011, IEEE Pervasive Computing.

[5]  G. Gredebäck,et al.  Eye Movements During Action Observation , 2015, Perspectives on psychological science : a journal of the Association for Psychological Science.

[6]  Mohan M. Trivedi,et al.  Looking-in and looking-out vision for Urban Intelligent Assistance: Estimation of driver attentive state and dynamic surround for safe merging and braking , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[7]  Peter Robinson,et al.  OpenFace: An open source facial behavior analysis toolkit , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[8]  Ashutosh Saxena,et al.  Robobarista: Object Part Based Transfer of Manipulation Trajectories from Crowd-Sourcing in 3D Pointclouds , 2015, ISRR.

[9]  Trevor Darrell,et al.  Simultaneous Deep Transfer Across Domains and Tasks , 2015, ICCV.

[10]  Jürgen Schmidhuber,et al.  Learning to forget: continual prediction with LSTM , 1999 .

[11]  Junqiang Xi,et al.  Human-Centered Feed-Forward Control of a Vehicle Steering System Based on a Driver's Path-Following Characteristics , 2017, IEEE Transactions on Intelligent Transportation Systems.

[12]  Alexander Yates,et al.  Distributional Representations for Handling Sparsity in Supervised Sequence-Labeling , 2009, ACL.

[13]  Philip S. Yu,et al.  Deep Learning of Transferable Representation for Scalable Domain Adaptation , 2016, IEEE Transactions on Knowledge and Data Engineering.

[14]  ZuWhan Kim,et al.  Robust Lane Detection and Tracking in Challenging Scenarios , 2008, IEEE Transactions on Intelligent Transportation Systems.

[15]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16]  Yoonsook Hwang,et al.  A Validation Study on a Subjective Driving Workload Prediction Tool , 2014, IEEE Transactions on Intelligent Transportation Systems.

[17]  Christian Wolf,et al.  Sequential Deep Learning for Human Action Recognition , 2011, HBU.

[18]  Yuan Shi,et al.  Geodesic flow kernel for unsupervised domain adaptation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Marco Gamba,et al.  BORIS: a free, versatile open‐source event‐logging software for video/audio coding and live observations , 2016 .

[20]  R. Johansson,et al.  Action plans used in action observation , 2003, Nature.

[21]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[22]  Trevor Darrell,et al.  Adapting Visual Category Models to New Domains , 2010, ECCV.

[23]  A. ZelinskyResearch,et al.  The Essential Components of Human-Friendly Robot Systems , 1999 .

[24]  Rama Chellappa,et al.  Visual Domain Adaptation: A survey of recent advances , 2015, IEEE Signal Processing Magazine.

[25]  Brian C. Lovell,et al.  Unsupervised Domain Adaptation by Domain Invariant Projection , 2013, 2013 IEEE International Conference on Computer Vision.

[26]  M.M. Trivedi,et al.  Looking-in and looking-out of a vehicle: selected investigations in computer vision based enhanced vehicle safety , 2005, IEEE International Conference on Vehicular Electronics and Safety, 2005..

[27]  Mohan M. Trivedi,et al.  Dynamic Probabilistic Drivability Maps for Lane Change and Merge Driver Assistance , 2014, IEEE Transactions on Intelligent Transportation Systems.

[28]  Masamichi Shimosaka,et al.  Modeling risk anticipation and defensive driving on residential roads with inverse reinforcement learning , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[29]  Amandeep Kaur,et al.  Face detection techniques: a review , 2018, Artificial Intelligence Review.

[30]  Qiang Ji,et al.  A joint cascaded framework for simultaneous eye detection and eye state estimation , 2017, Pattern Recognit..

[31]  Tinne Tuytelaars,et al.  Unsupervised Visual Domain Adaptation Using Subspace Alignment , 2013, 2013 IEEE International Conference on Computer Vision.

[32]  Junqiang Xi,et al.  Statistical Pattern Recognition for Driving Styles Based on Bayesian Probability and Kernel Density Estimation , 2016, ArXiv.

[33]  Koby Crammer,et al.  Analysis of Representations for Domain Adaptation , 2006, NIPS.

[34]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[35]  Kazuya Takeda,et al.  Driver-Behavior Modeling Using On-Road Driving Data: A new application for behavior signal processing , 2016, IEEE Signal Processing Magazine.

[36]  Koby Crammer,et al.  A theory of learning from different domains , 2010, Machine Learning.

[37]  David R. Thom,et al.  MOTORCYCLE ACCIDENT CAUSE FACTORS AND IDENTIFICATION OF COUNTERMEASURES. VOLUME II: APPENDIX/SUPPLEMENTAL DATA , 1980 .

[38]  Luke Fletcher,et al.  Driver Inattention Detection based on Eye Gaze—Road Event Correlation , 2009, Int. J. Robotics Res..

[39]  Wojciech Zaremba,et al.  An Empirical Exploration of Recurrent Network Architectures , 2015, ICML.

[40]  Massimo Bergamasco,et al.  Fork-lift truck simulator for training in industrial environment , 2005, 2005 IEEE Conference on Emerging Technologies and Factory Automation.

[41]  François Laviolette,et al.  Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[42]  Andrew Y. Ng,et al.  Parsing Natural Scenes and Natural Language with Recursive Neural Networks , 2011, ICML.

[43]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[44]  Kilian Q. Weinberger,et al.  Marginalized Denoising Autoencoders for Domain Adaptation , 2012, ICML.

[45]  Yoshua Bengio,et al.  Gated Feedback Recurrent Neural Networks , 2015, ICML.

[46]  Junqiang Xi,et al.  Statistical-based approach for driving style recognition using Bayesian probability with kernel density estimation , 2019 .

[47]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[48]  Gang Wang,et al.  Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition , 2016, ECCV.

[49]  Min Sun,et al.  Anticipating Accidents in Dashcam Videos , 2016, ACCV.

[50]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.