论文信息 - Robust and Subject-Independent Driving Manoeuvre Anticipation through Domain-Adversarial Recurrent Neural Networks

Robust and Subject-Independent Driving Manoeuvre Anticipation through Domain-Adversarial Recurrent Neural Networks

Abstract Through deep learning and computer vision techniques, driving manoeuvres can be predicted accurately a few seconds in advance. Even though adapting a learned model to new drivers and different vehicles is key for robust driver-assistance systems, this problem has received little attention so far. This work proposes to tackle this challenge through domain adaptation, a technique closely related to transfer learning. A proof of concept for the application of a Domain-Adversarial Recurrent Neural Network (DA-RNN) to multi-modal time series driving data is presented, in which domain-invariant features are learned by maximising the loss of an auxiliary domain classifier. Our implementation is evaluated using a leave-one-driver-out approach on individual drivers from the Brain4Cars dataset, as well as using a new dataset acquired through driving simulations, yielding an average increase in performance of 30% and 114% respectively compared to no adaptation. We also show the importance of fine-tuning sections of the network to optimise the extraction of domain-independent features. The results demonstrate the applicability of the approach to driver-assistance systems as well as training and simulation environments.

[1] Christopher Kermorvant,et al. Where to apply dropout in recurrent neural networks for handwriting recognition? , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[2] Roland Kuhn,et al. Discriminative Instance Weighting for Domain Adaptation in Statistical Machine Translation , 2010, EMNLP.

[3] ChengXiang Zhai,et al. Instance Weighting for Domain Adaptation in NLP , 2007, ACL.

[4] Mohan M. Trivedi,et al. On-road prediction of driver's intent with multimodal sensory cues , 2011, IEEE Pervasive Computing.

[5] G. Gredebäck,et al. Eye Movements During Action Observation , 2015, Perspectives on psychological science : a journal of the Association for Psychological Science.

[6] Mohan M. Trivedi,et al. Looking-in and looking-out vision for Urban Intelligent Assistance: Estimation of driver attentive state and dynamic surround for safe merging and braking , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[7] Peter Robinson,et al. OpenFace: An open source facial behavior analysis toolkit , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[8] Ashutosh Saxena,et al. Robobarista: Object Part Based Transfer of Manipulation Trajectories from Crowd-Sourcing in 3D Pointclouds , 2015, ISRR.

[9] Trevor Darrell,et al. Simultaneous Deep Transfer Across Domains and Tasks , 2015, ICCV.

[10] Jürgen Schmidhuber,et al. Learning to forget: continual prediction with LSTM , 1999 .

[11] Junqiang Xi,et al. Human-Centered Feed-Forward Control of a Vehicle Steering System Based on a Driver's Path-Following Characteristics , 2017, IEEE Transactions on Intelligent Transportation Systems.

[12] Alexander Yates,et al. Distributional Representations for Handling Sparsity in Supervised Sequence-Labeling , 2009, ACL.

[13] Philip S. Yu,et al. Deep Learning of Transferable Representation for Scalable Domain Adaptation , 2016, IEEE Transactions on Knowledge and Data Engineering.

[14] ZuWhan Kim,et al. Robust Lane Detection and Tracking in Challenging Scenarios , 2008, IEEE Transactions on Intelligent Transportation Systems.

[15] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16] Yoonsook Hwang,et al. A Validation Study on a Subjective Driving Workload Prediction Tool , 2014, IEEE Transactions on Intelligent Transportation Systems.

[17] Christian Wolf,et al. Sequential Deep Learning for Human Action Recognition , 2011, HBU.

[18] Yuan Shi,et al. Geodesic flow kernel for unsupervised domain adaptation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19] Marco Gamba,et al. BORIS: a free, versatile open‐source event‐logging software for video/audio coding and live observations , 2016 .

[20] R. Johansson,et al. Action plans used in action observation , 2003, Nature.

[21] Sebastian Ruder,et al. An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[22] Trevor Darrell,et al. Adapting Visual Category Models to New Domains , 2010, ECCV.

[23] A. ZelinskyResearch,et al. The Essential Components of Human-Friendly Robot Systems , 1999 .

[24] Rama Chellappa,et al. Visual Domain Adaptation: A survey of recent advances , 2015, IEEE Signal Processing Magazine.

[25] Brian C. Lovell,et al. Unsupervised Domain Adaptation by Domain Invariant Projection , 2013, 2013 IEEE International Conference on Computer Vision.

[26] M.M. Trivedi,et al. Looking-in and looking-out of a vehicle: selected investigations in computer vision based enhanced vehicle safety , 2005, IEEE International Conference on Vehicular Electronics and Safety, 2005..

[27] Mohan M. Trivedi,et al. Dynamic Probabilistic Drivability Maps for Lane Change and Merge Driver Assistance , 2014, IEEE Transactions on Intelligent Transportation Systems.

[28] Masamichi Shimosaka,et al. Modeling risk anticipation and defensive driving on residential roads with inverse reinforcement learning , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[29] Amandeep Kaur,et al. Face detection techniques: a review , 2018, Artificial Intelligence Review.

[30] Qiang Ji,et al. A joint cascaded framework for simultaneous eye detection and eye state estimation , 2017, Pattern Recognit..

[31] Tinne Tuytelaars,et al. Unsupervised Visual Domain Adaptation Using Subspace Alignment , 2013, 2013 IEEE International Conference on Computer Vision.

[32] Junqiang Xi,et al. Statistical Pattern Recognition for Driving Styles Based on Bayesian Probability and Kernel Density Estimation , 2016, ArXiv.

[33] Koby Crammer,et al. Analysis of Representations for Domain Adaptation , 2006, NIPS.

[34] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[35] Kazuya Takeda,et al. Driver-Behavior Modeling Using On-Road Driving Data: A new application for behavior signal processing , 2016, IEEE Signal Processing Magazine.

[36] Koby Crammer,et al. A theory of learning from different domains , 2010, Machine Learning.

[37] David R. Thom,et al. MOTORCYCLE ACCIDENT CAUSE FACTORS AND IDENTIFICATION OF COUNTERMEASURES. VOLUME II: APPENDIX/SUPPLEMENTAL DATA , 1980 .

[38] Luke Fletcher,et al. Driver Inattention Detection based on Eye Gaze—Road Event Correlation , 2009, Int. J. Robotics Res..

[39] Wojciech Zaremba,et al. An Empirical Exploration of Recurrent Network Architectures , 2015, ICML.

[40] Massimo Bergamasco,et al. Fork-lift truck simulator for training in industrial environment , 2005, 2005 IEEE Conference on Emerging Technologies and Factory Automation.

[41] François Laviolette,et al. Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[42] Andrew Y. Ng,et al. Parsing Natural Scenes and Natural Language with Recursive Neural Networks , 2011, ICML.

[43] Guy Lapalme,et al. A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[44] Kilian Q. Weinberger,et al. Marginalized Denoising Autoencoders for Domain Adaptation , 2012, ICML.

[45] Yoshua Bengio,et al. Gated Feedback Recurrent Neural Networks , 2015, ICML.

[46] Junqiang Xi,et al. Statistical-based approach for driving style recognition using Bayesian probability with kernel density estimation , 2019 .

[47] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[48] Gang Wang,et al. Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition , 2016, ECCV.

[49] Min Sun,et al. Anticipating Accidents in Dashcam Videos , 2016, ACCV.

[50] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.