论文信息 - Interim report ( COMP 4801 ) AUTONOMOUS DRIFTING RC CAR WITH REINFORCEMENT LEARNING

Interim report ( COMP 4801 ) AUTONOMOUS DRIFTING RC CAR WITH REINFORCEMENT LEARNING

The advent of self-driving cars has pushed the boundaries on how safe passenger automobiles can be, but most modern self-driving car systems ignore the possibility of a car slipping resulting from inclement weather or driver error. Passengers and bystanders would benefit heavily if self-driving cars could handle slipping by learning to drift with the turn rather than against it (by applying the brakes, or turning away which is the instinctive action), preventing many fatalities [1]. Our project is aimed at studying the drifting (over steering of a car that results in the loss of traction of the rear wheels) of an autonomous remote controlled (RC) car. We use reinforcement learning techniques and algorithms to design a controller for an RC car that learns to drift without human intervention. Reinforcement learning is a branch of machine learning that primarily deals with learning a control agent from trial-and-error, much like how humans learn by interacting with the environment. Reinforcement learning has in recent years been used to learn all sorts of robotic controllers and even defeat the best human player at Go. It is an exciting realm of machine learning, and we decided on using it to teach an RC car to maintain a steady state circular drift. As for the technique employed, we use double dueling deep Q-networks and Q-learning as our primary algorithm. However, using reinforcement learning (RL) typically requires many interactions with the environment before learning anything useful. Since robotic systems are prone to wear with use, we implemented a simulator by modeling the car dynamics, where we run most iterations of the learning algorithm. In addition, since it is imperative to define the reward function appropriately to make sure that our agent learns the right behaviour in the shortest time possible, we also use potential based reward shaping to shape the rewards the agent receives.

D. Schnieders | S. Bhattacharjee | Kanak Dipak Kabara | Rachit Jain

[1] Jürgen Ackermann,et al. Robust control prevents car skidding , 1997 .

[2] S. Kawakami,et al. Proposal of driver assistance system for recovering vehicle stability from unstable states by automatic steering , 1999, Proceedings of the IEEE International Vehicle Electronics Conference (IVEC'99) (Cat. No.99EX257).

[3] Klaus Landesfeind,et al. Vehicle Stabilization by the Vehicle Dynamics Control System ESP , 2000 .

[4] Michael I. Jordan,et al. PEGASUS: A policy search method for large MDPs and POMDPs , 2000, UAI.

[5] Aleksander B. Hac,et al. IMPROVEMENTS IN VEHICLE HANDLING THROUGH INTEGRATED CONTROL OF CHASSIS SYSTEMS , 2002 .

[6] Andrew Y. Ng,et al. Shaping and policy search in reinforcement learning , 2003 .

[7] Ansgar Trächtler,et al. Integrated vehicle dynamics control using active brake, steering and suspension systems , 2004 .

[8] Andrew W. Moore,et al. Locally Weighted Learning , 1997, Artificial Intelligence Review.

[9] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[10] Zhang Lijun,et al. Integrated Chassis Control System for Improving Vehicle Stability , 2006, 2006 IEEE International Conference on Vehicular Electronics and Safety.

[11] Emilio Frazzoli,et al. On steady-state cornering equilibria for wheeled vehicles with drift , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[12] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[13] Emilio Frazzoli,et al. Steady-state drifting stabilization of RWD vehicles , 2011 .

[14] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[15] J. Christian Gerdes,et al. A Controller Framework for Autonomous Drifting: Design, Stability, and Experimental Validation , 2014 .

[16] Carl E. Rasmussen,et al. Gaussian Processes for Data-Efficient Learning in Robotics and Control , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Shubhayu Saha,et al. Adverse weather conditions and fatal motor vehicle crashes in the United States, 1994-2012 , 2016, Environmental Health.

[18] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[19] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[20] Jonathan P. How,et al. Autonomous drifting using simulation-aided reinforcement learning , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[21] Wu-Sung Yao,et al. Design of a Drift Assist Control System Applied to Remote Control Car , 2016 .

[22] Fan Zhang,et al. Autonomous Drift Cornering with Mixed Open-loop and Closed-loop Control , 2017 .