论文信息 - Closing the gap towards end-to-end autonomous vehicle system

Closing the gap towards end-to-end autonomous vehicle system

Designing a driving policy for autonomous vehicles is a difficult task. Recent studies suggested an end-toend (E2E) training of a policy to predict car actuators directly from raw sensory inputs. It is appealing due to the ease of labeled data collection and since handcrafted features are avoided. Explicit drawbacks such as interpretability, safety enforcement and learning efficiency limit the practical application of the approach. In this paper, we amend the basic E2E architecture to address these shortcomings, while retaining the power of end-to-end learning. A key element in our proposed architecture is formulation of the learning problem as learning of trajectory. We also apply a Gaussian mixture model loss to contend with multi-modal data, and adopt a finance risk measure, conditional value at risk, to emphasize rare events. We analyze the effect of each concept and present driving performance in a highway scenario in the TORCS simulator. Video is available in this link: this https URL

[1] R. Rockafellar,et al. Optimization of conditional value-at risk , 2000 .

[2] Amnon Shashua,et al. On a Formal Model of Safe and Scalable Self-driving Cars , 2017, ArXiv.

[3] S. Ullman. Against direct perception , 1980, Behavioral and Brain Sciences.

[4] Yonatan Wexler,et al. Minimizing the Maximal Loss: How and Why , 2016, ICML.

[5] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[6] Alexey Dosovitskiy,et al. End-to-End Driving Via Conditional Imitation Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[7] Jiebo Luo,et al. End-to-end Multi-Modal Multi-Task Vehicle Control for Self-Driving Cars with Visual Perceptions , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[8] Shie Mannor,et al. Optimizing the CVaR via Sampling , 2014, AAAI.

[9] Andreas Geiger,et al. Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[10] Amnon Shashua,et al. On the Sample Complexity of End-to-end Training vs. Semantic Abstraction Training , 2016, ArXiv.

[11] Ashish Mehta,et al. Learning End-to-end Autonomous Driving using Guided Auxiliary Supervision , 2018, ICVGIP.

[12] Xin Zhang,et al. End to End Learning for Self-Driving Cars , 2016, ArXiv.

[13] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .

[14] Yann LeCun,et al. Off-Road Obstacle Avoidance through End-to-End Learning , 2005, NIPS.

[15] Kaiming He,et al. Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[16] Ohad Shamir,et al. Failures of Gradient-Based Deep Learning , 2017, ICML.

[17] Yang Gao,et al. End-to-End Learning of Driving Models from Large-Scale Video Datasets , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[19] Yoel Zeldes,et al. Deep density networks and uncertainty in recommender systems , 2017, ArXiv.

[20] Luc Van Gool,et al. End-to-End Learning of Driving Models with Surround-View Cameras and Route Planners , 2018, ECCV.

[21] Christos Dimitrakakis,et al. TORCS, The Open Racing Car Simulator , 2005 .

[22] C. Bishop. Mixture density networks , 1994 .

[23] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[24] Jianxiong Xiao,et al. DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[25] Luís Torgo,et al. Resampling strategies for regression , 2015, Expert Syst. J. Knowl. Eng..

[26] Sebastian Ruder,et al. An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.

[27] L. Jeff Hong,et al. Simulating Sensitivities of Conditional Value at Risk , 2009, Manag. Sci..