Autonomous Vehicle Steering Based on Evaluative Feedback by Reinforcement Learning

Steering an autonomous vehicle requires the permanent adaptation of behavior in relation to the various situations the vehicle is in. This paper describes a research which implements such adaptation and optimization based on Reinforcement Learning (RL) which in detail purely learns from evaluative feedback in contrast to instructive feedback. Convergence of the learning process has been achieved at various experimental results revealing the impact of the different RL parameters. While using RL for autonomous steering is in itself already a novelty, additional attention has been given to new proposals for post-processing and interpreting the experimental data.

[1]  Dean A. Pomerleau,et al.  Vision guided lane transition , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[2]  Klaus-Dieter Kuhnert,et al.  Pattern matching as the nucleus for either autonomous driving or driver assistance systems , 2002, Intelligent Vehicle Symposium, 2002. IEEE.

[3]  Klaus-Dieter Kuhnert,et al.  Robust adaptive control of nonholonomic mobile robot with parameter and nonparameter uncertainties , 2005, IEEE Transactions on Robotics.

[4]  Shumeet Baluja,et al.  Expectation-based selective attention for visual monitoring and control of a robot vehicle , 1997, Robotics Auton. Syst..

[5]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[6]  Dean Pomerleau,et al.  Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.

[7]  Klaus-Dieter Kuhnert,et al.  Reinforcement Learning to Drive a Car by Pattern Matching , 2002, DAGM-Symposium.

[8]  Klaus-Dieter Kuhnert,et al.  A Vision System for Real Time Road and Object Recognition for Vehicle Guidance , 1987, Other Conferences.

[9]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[10]  Dariu Gavrila,et al.  The Issues , 2011 .

[11]  Reinhold Behringer,et al.  The seeing passenger car 'VaMoRs-P' , 1994, Proceedings of the Intelligent Vehicles '94 Symposium.

[12]  Klaus-Dieter Kuhnert,et al.  Über die lernende Regelung autonomer Fahrzeuge mit neuronalen Netzen , 2003, AMS.

[13]  D. Pomerleau,et al.  MANIAC : A Next Generation Neurally Based Autonomous Road Follower , 1993 .

[14]  E D Dickmanns,et al.  AUTONOMOUS HIGH SPEED ROAD VEHICLE GUIDANCE BY COMPUTER VISION , 1987 .