Recursive least-squares temporal difference learning for adaptive traffic signal control at intersection

[1]  Li Li,et al.  Traffic signal timing via deep reinforcement learning , 2016, IEEE/CAA Journal of Automatica Sinica.

[2]  Abdellah El Moudni,et al.  Forward search algorithm based on dynamic programming for real-time adaptive traffic signal control , 2015 .

[3]  Abbas Khosravi,et al.  A review on computational intelligence methods for controlling traffic signal timing , 2015, Expert Syst. Appl..

[4]  Baher Abdulhai,et al.  Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control , 2014, J. Intell. Transp. Syst..

[5]  MengChu Zhou,et al.  Modular Design of Urban Traffic-Light Control Systems Based on Synchronized Timed Petri Nets , 2014, IEEE Transactions on Intelligent Transportation Systems.

[6]  Mohamed A. Khamis,et al.  Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework , 2014, Eng. Appl. Artif. Intell..

[7]  Xin Xu,et al.  Reinforcement learning algorithms with function approximation: Recent advances and applications , 2014, Inf. Sci..

[8]  Dongbin Zhao,et al.  Full-range adaptive cruise control based on supervised adaptive dynamic programming , 2014, Neurocomputing.

[9]  Baher Abdulhai,et al.  Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto , 2013, IEEE Transactions on Intelligent Transportation Systems.

[10]  Mohamed A. Khamis,et al.  Enhanced multiagent multi-objective reinforcement learning for urban traffic light control , 2012, 2012 11th International Conference on Machine Learning and Applications.

[11]  Frank L. Lewis,et al.  Reinforcement learning and optimal adaptive control: An overview and implementation examples , 2012, Annu. Rev. Control..

[12]  José García-Nieto,et al.  Swarm intelligence for traffic light scheduling: Application to real urban areas , 2012, Eng. Appl. Artif. Intell..

[13]  H. He,et al.  Efficient Reinforcement Learning Using Recursive Least-Squares Methods , 2011, J. Artif. Intell. Res..

[14]  Shalabh Bhatnagar,et al.  Reinforcement Learning With Function Approximation for Traffic Signal Control , 2011, IEEE Transactions on Intelligent Transportation Systems.

[15]  T. Urbanik,et al.  Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .

[16]  Peter T. Martin,et al.  Comparative Evaluation of Adaptive Traffic Control System Assessments Through Field and Microsimulation , 2010, J. Intell. Transp. Syst..

[17]  Chen Cai,et al.  Adaptive traffic signal control using approximate dynamic programming , 2009 .

[18]  Ana L. C. Bazzan,et al.  Opportunities for multiagent systems and multiagent reinforcement learning in traffic control , 2009, Autonomous Agents and Multi-Agent Systems.

[19]  Huaguang Zhang,et al.  Adaptive Dynamic Programming: An Introduction , 2009, IEEE Computational Intelligence Magazine.

[20]  Abdellah El Moudni,et al.  Discrete Methods for Urban Intersection Traffic Controlling , 2009, VTC Spring 2009 - IEEE 69th Vehicular Technology Conference.

[21]  Tao Li,et al.  Adaptive Dynamic Programming for Multi-intersections Traffic Signal Intelligent Control , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[22]  Jan van der Wal,et al.  AN MDP DECOMPOSITION APPROACH FOR TRAFFIC CONTROL AT ISOLATED SIGNALIZED INTERSECTIONS , 2008, Probability in the Engineering and Informational Sciences.

[23]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[24]  Warren B. Powell,et al.  Approximate Dynamic Programming - Solving the Curses of Dimensionality , 2007 .

[25]  Dipti Srinivasan,et al.  Neural Networks for Real-Time Traffic Signal Control , 2006, IEEE Transactions on Intelligent Transportation Systems.

[26]  Wilfred W. Recker,et al.  Stochastic adaptive control model for traffic signal systems , 2006 .

[27]  Yu-Fai Fung,et al.  Coordinated road-junction traffic control by dynamic programming , 2005, IEEE Trans. Intell. Transp. Syst..

[28]  Baher Abdulhai,et al.  Real-Time Optimization for Adaptive Traffic Signal Control Using Genetic Algorithms , 2005, J. Intell. Transp. Syst..

[29]  Baher Abdulhai,et al.  Reinforcement learning for true adaptive traffic signal control , 2003 .

[30]  Justin A. Boyan,et al.  Technical Update: Least-Squares Temporal Difference Learning , 2002, Machine Learning.

[31]  Pitu B. Mirchandani,et al.  A REAL-TIME TRAFFIC SIGNAL CONTROL SYSTEM: ARCHITECTURE, ALGORITHMS, AND ANALYSIS , 2001 .

[32]  Nathan H. Gartner,et al.  Implementation of the OPAC adaptive control strategy in a traffic signal network , 2001, ITSC 2001. 2001 IEEE Intelligent Transportation Systems. Proceedings (Cat. No.01TH8585).

[33]  Andrew W. Moore,et al.  Gradient Descent for General Reinforcement Learning , 1998, NIPS.

[34]  John N. Tsitsiklis,et al.  Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.

[35]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[36]  T. Söderström,et al.  Instrumental variable methods for system identification , 1983 .

[37]  Jean-Loup Farges,et al.  THE PRODYN REAL TIME TRAFFIC ALGORITHM , 1983 .

[38]  Aleksandar Stevanovic,et al.  Adaptive Traffic Control Systems: Guidelines for Development of Functional Requirements , 2015 .

[39]  Ben Waterson,et al.  An automated signalized junction controller that learns strategies by temporal difference reinforcement learning , 2013, Eng. Appl. Artif. Intell..

[40]  Andrew G. Barto,et al.  Linear Least-Squares Algorithms for Temporal Difference Learning , 2005, Machine Learning.

[41]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[42]  Dirk Ormoneit,et al.  Kernel-Based Reinforcement Learning , 2004, Machine Learning.

[43]  Myungsoon Chang,et al.  Realizing Benefits of Adaptive Signal Control at an Isolated Intersection , 2002 .

[44]  Philip J Tarnoff,et al.  EVALUATION OF OPTIMIZED POLICIES FOR ADAPTIVE CONTROL STRATEGY , 1991 .

[45]  P R Lowrie,et al.  The Sydney coordinated adaptive traffic system - principles, methodology, algorithms , 1982 .

[46]  R D Bretherton,et al.  SCOOT-a Traffic Responsive Method of Coordinating Signals , 1981 .