论文信息 - Recursive least-squares temporal difference learning for adaptive traffic signal control at intersection - 字舞流文

Recursive least-squares temporal difference learning for adaptive traffic signal control at intersection

A. E. Moudni | M. Dridi | Biao Yin

[1] Li Li,et al. Traffic signal timing via deep reinforcement learning , 2016, IEEE/CAA Journal of Automatica Sinica.

[2] Abdellah El Moudni,et al. Forward search algorithm based on dynamic programming for real-time adaptive traffic signal control , 2015 .

[3] Abbas Khosravi,et al. A review on computational intelligence methods for controlling traffic signal timing , 2015, Expert Syst. Appl..

[4] Baher Abdulhai,et al. Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control , 2014, J. Intell. Transp. Syst..

[5] MengChu Zhou,et al. Modular Design of Urban Traffic-Light Control Systems Based on Synchronized Timed Petri Nets , 2014, IEEE Transactions on Intelligent Transportation Systems.

[6] Mohamed A. Khamis,et al. Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework , 2014, Eng. Appl. Artif. Intell..

[7] Xin Xu,et al. Reinforcement learning algorithms with function approximation: Recent advances and applications , 2014, Inf. Sci..

[8] Dongbin Zhao,et al. Full-range adaptive cruise control based on supervised adaptive dynamic programming , 2014, Neurocomputing.

[9] Baher Abdulhai,et al. Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto , 2013, IEEE Transactions on Intelligent Transportation Systems.

[10] Mohamed A. Khamis,et al. Enhanced multiagent multi-objective reinforcement learning for urban traffic light control , 2012, 2012 11th International Conference on Machine Learning and Applications.

[11] Frank L. Lewis,et al. Reinforcement learning and optimal adaptive control: An overview and implementation examples , 2012, Annu. Rev. Control..

[12] José García-Nieto,et al. Swarm intelligence for traffic light scheduling: Application to real urban areas , 2012, Eng. Appl. Artif. Intell..

[13] H. He,et al. Efficient Reinforcement Learning Using Recursive Least-Squares Methods , 2011, J. Artif. Intell. Res..

[14] Shalabh Bhatnagar,et al. Reinforcement Learning With Function Approximation for Traffic Signal Control , 2011, IEEE Transactions on Intelligent Transportation Systems.

[15] T. Urbanik,et al. Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .

[16] Peter T. Martin,et al. Comparative Evaluation of Adaptive Traffic Control System Assessments Through Field and Microsimulation , 2010, J. Intell. Transp. Syst..

[17] Chen Cai,et al. Adaptive traffic signal control using approximate dynamic programming , 2009 .

[18] Ana L. C. Bazzan,et al. Opportunities for multiagent systems and multiagent reinforcement learning in traffic control , 2009, Autonomous Agents and Multi-Agent Systems.

[19] Huaguang Zhang,et al. Adaptive Dynamic Programming: An Introduction , 2009, IEEE Computational Intelligence Magazine.

[20] Abdellah El Moudni,et al. Discrete Methods for Urban Intersection Traffic Controlling , 2009, VTC Spring 2009 - IEEE 69th Vehicular Technology Conference.

[21] Tao Li,et al. Adaptive Dynamic Programming for Multi-intersections Traffic Signal Intelligent Control , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[22] Jan van der Wal,et al. AN MDP DECOMPOSITION APPROACH FOR TRAFFIC CONTROL AT ISOLATED SIGNALIZED INTERSECTIONS , 2008, Probability in the Engineering and Informational Sciences.

[23] Bart De Schutter,et al. A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[24] Warren B. Powell,et al. Approximate Dynamic Programming - Solving the Curses of Dimensionality , 2007 .

[25] Dipti Srinivasan,et al. Neural Networks for Real-Time Traffic Signal Control , 2006, IEEE Transactions on Intelligent Transportation Systems.

[26] Wilfred W. Recker,et al. Stochastic adaptive control model for traffic signal systems , 2006 .

[27] Yu-Fai Fung,et al. Coordinated road-junction traffic control by dynamic programming , 2005, IEEE Trans. Intell. Transp. Syst..

[28] Baher Abdulhai,et al. Real-Time Optimization for Adaptive Traffic Signal Control Using Genetic Algorithms , 2005, J. Intell. Transp. Syst..

[29] Baher Abdulhai,et al. Reinforcement learning for true adaptive traffic signal control , 2003 .

[30] Justin A. Boyan,et al. Technical Update: Least-Squares Temporal Difference Learning , 2002, Machine Learning.

[31] Pitu B. Mirchandani,et al. A REAL-TIME TRAFFIC SIGNAL CONTROL SYSTEM: ARCHITECTURE, ALGORITHMS, AND ANALYSIS , 2001 .

[32] Nathan H. Gartner,et al. Implementation of the OPAC adaptive control strategy in a traffic signal network , 2001, ITSC 2001. 2001 IEEE Intelligent Transportation Systems. Proceedings (Cat. No.01TH8585).

[33] Andrew W. Moore,et al. Gradient Descent for General Reinforcement Learning , 1998, NIPS.

[34] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.

[35] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[36] T. Söderström,et al. Instrumental variable methods for system identification , 1983 .

[37] Jean-Loup Farges,et al. THE PRODYN REAL TIME TRAFFIC ALGORITHM , 1983 .

[38] Aleksandar Stevanovic,et al. Adaptive Traffic Control Systems: Guidelines for Development of Functional Requirements , 2015 .

[39] Ben Waterson,et al. An automated signalized junction controller that learns strategies by temporal difference reinforcement learning , 2013, Eng. Appl. Artif. Intell..

[40] Andrew G. Barto,et al. Linear Least-Squares Algorithms for Temporal Difference Learning , 2005, Machine Learning.

[41] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[42] Dirk Ormoneit,et al. Kernel-Based Reinforcement Learning , 2004, Machine Learning.

[43] Myungsoon Chang,et al. Realizing Benefits of Adaptive Signal Control at an Isolated Intersection , 2002 .

[44] Philip J Tarnoff,et al. EVALUATION OF OPTIMIZED POLICIES FOR ADAPTIVE CONTROL STRATEGY , 1991 .

[45] P R Lowrie,et al. The Sydney coordinated adaptive traffic system - principles, methodology, algorithms , 1982 .

[46] R D Bretherton,et al. SCOOT-a Traffic Responsive Method of Coordinating Signals , 1981 .