Swarm reinforcement learning for traffic signal control based on cooperative multi-agent framework

Congestion, accidents, pollution, and many other problems resulting from urban traffic are present every day in most cities around the world. The growing number of traffic lights in intersections needs efficient control, and hence, automatic systems are essential nowadays for optimally tackling this task. Agent based technologies and reinforcements learning are largely used for modelling and controlling intelligent transportation systems, where agents represent a traffic signal controller. Each agent learns to achieve its goal through many episodes. With a complicated learning problem, it may take much computation time to acquire the optimal policy. In this paper, we use a population based methods such as particle swarm optimization to be able to find rapidly the global optimal solution for multimodal functions with wide solution space. Agents learn through not only on their respective experiences, but also by exchanging information among them, simulation results show that the swarm Q-learning surpass the simple Q-learning causing less average delay time and higher flow rate.

[1]  Lúcio S. Passos,et al.  Traffic light control using reactive agents , 2010, 5th Iberian Conference on Information Systems and Technologies.

[2]  Dipti Srinivasan,et al.  Neural Networks for Real-Time Traffic Signal Control , 2006, IEEE Transactions on Intelligent Transportation Systems.

[3]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[4]  Mohamed A. Khamis,et al.  Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework , 2014, Eng. Appl. Artif. Intell..

[5]  Neila Bhouri,et al.  A Multi-Agent System for Urban Traffic and Buses Regularity Control , 2011 .

[6]  Bo Chen,et al.  Integrating mobile agent technology with multi-agent systems for distributed traffic detection and management systems , 2009 .

[7]  J. Hellendoorn,et al.  Multi-agent coordination of traffic control instruments , 2008, 2008 First International Conference on Infrastructure Systems and Services: Building Networks for a Brighter Future (INFRA).

[8]  Monireh Abdoos,et al.  Traffic light control in non-stationary environments based on multi agent Q-learning , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[9]  Zhao-Sheng Yang,et al.  Study on Urban Traffic Management Based on Multi-Agent System , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[10]  Ana L. C. Bazzan,et al.  A Distributed Approach for Coordination of Traffic Signal Agents , 2005, Autonomous Agents and Multi-Agent Systems.

[11]  Enrique Alba,et al.  Optimal Cycle Program of Traffic Lights With Particle Swarm Optimization , 2013, IEEE Transactions on Evolutionary Computation.

[12]  Baher Abdulhai,et al.  Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto , 2013, IEEE Transactions on Intelligent Transportation Systems.

[13]  Marco Wiering,et al.  Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .

[14]  T. Urbanik,et al.  Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .

[15]  Danko A. Roozemond Using intelligent agents for pro-active, real-time urban intersection control , 2001, Eur. J. Oper. Res..

[16]  Dipti Srinivasan,et al.  Cooperative, hybrid agent architecture for real-time traffic signal control , 2003, IEEE Trans. Syst. Man Cybern. Part A.