Perception as prediction using general value functions in autonomous driving applications

We propose and demonstrate a framework called perception as prediction for autonomous driving that uses general value functions (GVFs) to learn predictions. Perception as prediction learns data-driven predictions relating to the impact of actions on the agent’s perception of the world. It also provides a data-driven approach to predict the impact of the anticipated behavior of other agents on the world without explicitly learning their policy or intentions. We demonstrate perception as prediction by learning to predict an agent’s front safety and rear safety with GVFs, which encapsulate anticipation of the behavior of the vehicle in front and in the rear, respectively. The safety predictions are learned through random interactions in a simulated environment containing other agents. We show that these predictions can be used to produce similar control behavior to an LQR-based controller in an adaptive cruise control problem as well as provide advanced warning when the vehicle behind is approaching dangerously. The predictions are compact policy-based predictions that support prediction of the long term impact on safety when following a given policy. We analyze two controllers that use the learned predictions in a racing simulator to understand the value of the predictions and demonstrate their use in the real-world on a Clearpath Jackal robot and an autonomous vehicle platform.

[1]  Seungwuk Moon,et al.  Design, tuning, and evaluation of a full-range adaptive cruise control system with collision avoidance , 2009 .

[2]  Sergey Levine,et al.  Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation , 2018, CoRL.

[3]  Vladlen Koltun,et al.  Learning to Act by Predicting the Future , 2016, ICLR.

[4]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[5]  Matthias Althoff,et al.  Adaptive Cruise Control with Safety Guarantees for Autonomous Vehicles , 2017 .

[6]  Feng Gao,et al.  A comprehensive review of the development of adaptive cruise control systems , 2010 .

[7]  Adam M White,et al.  DEVELOPING A PREDICTIVE APPROACH TO KNOWLEDGE , 2015 .

[8]  Bart De Schutter,et al.  Adaptive Cruise Control for a SMART Car: A Comparison Benchmark for MPC-PWA Control Methods , 2008, IEEE Transactions on Control Systems Technology.

[9]  Farbod Fahimi,et al.  Online human training of a myoelectric prosthesis controller via actor-critic reinforcement learning , 2011, 2011 IEEE International Conference on Rehabilitation Robotics.

[10]  Richard S. Sutton,et al.  Multi-timescale nexting in a reinforcement learning robot , 2011, Adapt. Behav..

[11]  Patrick M. Pilarski,et al.  Intelligent laser welding through representation, prediction, and control learning: An architecture with deep neural networks and reinforcement learning , 2016 .

[12]  Jin Zhu,et al.  Architecture Design and Implementation of an Autonomous Vehicle , 2018, IEEE Access.

[13]  Erik Poll,et al.  A Standard Driven Software Architecture for Fully Autonomous Vehicles , 2018, 2018 IEEE International Conference on Software Architecture Companion (ICSA-C).

[14]  A. Clark Whatever next? Predictive brains, situated agents, and the future of cognitive science. , 2013, The Behavioral and brain sciences.

[15]  Patrick M. Pilarski,et al.  Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.

[16]  M Maarten Steinbuch,et al.  Design and implementation of parameterized adaptive cruise control : an explicit model predictive control approach , 2010 .

[17]  Henggang Cui,et al.  Short-term Motion Prediction of Traffic Actors for Autonomous Driving using Deep Convolutional Networks , 2018 .