论文信息 - Improved Multi-Agent Reinforcement Learning for Minimizing Traffic Waiting Time

Improved Multi-Agent Reinforcement Learning for Minimizing Traffic Waiting Time

This paper depict using multi-agent reinforcement learning (MARL) algorithm for learning traffic pattern to minimize the traveling time or maximizing safety and optimizing traffic pattern (OTP). This model provides a description and solution to optimize traffic pattern that use multi-agent based reinforcement learning algorithms. MARL uses multi agent structure where vehicles and traffic signals are working as agents. In this model traffic area divide in different-different traffic ZONE. Each zone have own distributed agent and these agent will pass the information one zone to other threw the network. The Optimization objectives include the number of vehicle stops, the average waiting time and maximum queue length of the next (node) intersection. In addition, This research also introduce the priority control of buses and emergent vehicles into this model. Expected outcome of the algorithm is comparable to the performance of Q-Learning and Temporal difference learning. The results show significant reduction in waiting time comparable to those algorithms for the work more efficiently than other traffic system. General Terms Learning Algorithm, Artificial Intelligence, Agent based learning.

[1] Michael H. Bowling,et al. Convergence and No-Regret in Multiagent Learning , 2004, NIPS.

[2] Georgios Chalkiadakis. Multiagent reinforcement learning: stochastic games with multiple learning players , 2003 .

[3] Arne Koopman,et al. Intelligent Traffic Light Control , 2004 .

[4] Michael P. Wellman,et al. Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..

[5] Koby Crammer,et al. Learning from Multiple Sources , 2006, NIPS.

[6] Michail G. Lagoudakis,et al. Coordinated Reinforcement Learning , 2002, ICML.

[7] Bart De Schutter,et al. Multiagent Reinforcement Learning with Adaptive State Focus , 2005, BNAIC.

[8] Marco Wiering,et al. Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .

[9] Victor R. Lesser,et al. Using cooperative mediation to coordinate traffic lights: a case study , 2005, AAMAS '05.

[10] C. Boutilier,et al. Accelerating Reinforcement Learning through Implicit Imitation , 2003, J. Artif. Intell. Res..