Reinforcement learning: Introduction to theory and potential for transport applications

The aim of this paper is to develop insight into the potential of reinforcement learning (RL) agents and distributed reinforcement learning agents in the domain of transportation and traffic engineering and specifically in Intelligent Transport Systems (ITS). This paper provides a crystallized, comprehensive overview of the concept of RL and presents related successful applications in the field of traffic control and transportation engineering. It is divided into two parts: the first part provides a thorough overview of RL and its related methods and the second part reviews most recent applications of RL algorithms to the field of transportation engineering. Finally, it identifies many open research subjects in transportation in which the use of RL seems to be promising.Key words: reinforcement learning, machine learning, traffic control, artificial intelligence, intelligent transportation systems.

[1]  Kumpati S. Narendra,et al.  Learning automata - an introduction , 1989 .

[2]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[3]  Pablo L. Durango,et al.  Reinforcement learning models for transportation infrastructure management , 2002 .

[4]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[5]  Leemon C. Baird,et al.  Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[6]  Pat Langley,et al.  Learning distributed strategies for traffic control , 1998 .

[7]  Pat Langley,et al.  Learning Cooperative Lane Selection Strategies for Highways , 1998, AAAI/IAAI.

[8]  Wilfried Brauer,et al.  Fuzzy Model-Based Reinforcement Learning , 2002, Advances in Computational Intelligence and Learning.

[9]  Baher Abdulhai,et al.  REINFORCEMENT LEARNING FOR ITS : INTRODUCTION AND A CASE STUDY ON ADAPTIVE TRAFFIC SIGNAL CONTROL , 2001 .

[10]  Mark D. Pendrith Distributed reinforcement learning for a traffic engineering application , 2000, AGENTS '00.

[11]  Athanasios K. Ziliaskopoulos,et al.  Foundations of Dynamic Traffic Assignment: The Past, the Present and the Future , 2001 .

[12]  Ella Bingham Reinforcement learning in neurofuzzy traffic signal control , 2001, Eur. J. Oper. Res..

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  Baher Abdulhai,et al.  Reinforcement learning for true adaptive traffic signal control , 2003 .

[15]  Dana H. Ballard,et al.  An introduction to natural computation , 2000, Complex adaptive systems.

[16]  Gerald Tesauro,et al.  Temporal difference learning and TD-Gammon , 1995, CACM.

[17]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .