论文信息 - A Residual Gradient Fuzzy Reinforcement Learning Algorithm for Differential Games - 字舞流文

A Residual Gradient Fuzzy Reinforcement Learning Algorithm for Differential Games

Mostafa D. Awheda | H. Schwartz

[1] Dongbin Zhao,et al. Online reinforcement learning control by Bayesian inference , 2016 .

[2] Dongbin Zhao,et al. Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics , 2016 .

[3] Howard M. Schwartz,et al. A Decentralized Fuzzy Learning Algorithm for Pursuit-Evasion Differential Games with Superior Evaders , 2016, J. Intell. Robotic Syst..

[4] Huaguang Zhang,et al. Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method , 2016, Neurocomputing.

[5] Howard M. Schwartz,et al. A fuzzy reinforcement learning algorithm using a predictor for pursuit-evasion games , 2016, 2016 Annual IEEE Systems Conference (SysCon).

[6] Daniel Sánchez,et al. Fuzzy frameworks for mining data associations: fuzzy association rules and beyond , 2016, WIREs Data Mining Knowl. Discov..

[7] Shaocheng Tong,et al. Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone , 2016, IEEE Transactions on Fuzzy Systems.

[8] Tingwen Huang,et al. Reinforcement learning solution for HJB equation arising in constrained optimal control problem , 2015, Neural Networks.

[9] Howard M. Schwartz,et al. The residual gradient FACL algorithm for differential games , 2015, 2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE).

[10] Han-Xiong Li,et al. Adaptive Optimal Control of Highly Dissipative Nonlinear Spatially Distributed Processes With Neuro-Dynamic Programming , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[11] Howard M. Schwartz,et al. Multi-Agent Machine Learning: A Reinforcement Approach , 2014 .

[12] Han-Xiong Li,et al. Data-based Suboptimal Neuro-control Design with Reinforcement Learning for Dissipative Spatially Distributed Processes , 2014 .

[13] Tingwen Huang,et al. Off-Policy Reinforcement Learning for $ H_\infty $ Control Design , 2013, IEEE Transactions on Cybernetics.

[14] Rushikesh Kamalapurkar,et al. Concurrent learning-based approximate optimal regulation , 2013, 52nd IEEE Conference on Decision and Control.

[15] Huai-Ning Wu,et al. Neural Network Based Online Simultaneous Policy Update Algorithm for Solving the HJI Equation in Nonlinear $H_{\infty}$ Control , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[16] Frank L. Lewis,et al. Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles , 2012 .

[17] Howard M. Schwartz,et al. Q(λ)‐learning adaptive fuzzy logic controllers for pursuit–evasion differential games , 2011 .

[18] Uzay Kaymak,et al. Systems Control With Generalized Probabilistic Fuzzy-Reinforcement Learning , 2011, IEEE Transactions on Fuzzy Systems.

[19] Sidney Nascimento Givigi,et al. A Reinforcement Learning Adaptive Fuzzy Controller for Differential Games , 2010, J. Intell. Robotic Syst..

[20] Howard M. Schwartz,et al. Hybrid intelligent systems applied to the pursuit-evasion game , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[21] Andrea Bonarini,et al. Reinforcement distribution in fuzzy Q-learning , 2009, Fuzzy Sets Syst..

[22] Senén Barro,et al. Autonomous and fast robot learning through motivation , 2007, Robotics Auton. Syst..

[23] Xuesong Wang,et al. A fuzzy Actor-Critic reinforcement learning network , 2007, Inf. Sci..

[24] Salim Labiod,et al. Adaptive fuzzy control of a class of SISO nonaffine nonlinear systems , 2007, Fuzzy Sets Syst..

[25] Senén Barro,et al. Design of a fuzzy controller in mobile robotics using genetic algorithms , 2007, Appl. Soft Comput..

[26] Steven M. LaValle,et al. Planning algorithms , 2006 .

[27] Ferhat Daldaban,et al. Phase inductance estimation for switched reluctance motor using adaptive neuro-fuzzy inference system , 2006 .

[28] Chi-Kwong Li,et al. An approach to tune fuzzy controllers based on reinforcement learning for autonomous vehicle control , 2005, IEEE Transactions on Intelligent Transportation Systems.

[29] Hugh F. Durrant-Whyte,et al. A time-optimal control strategy for pursuit-evasion games problems , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[30] Toshiyuki Kondo,et al. A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control , 2003, Robotics Auton. Syst..

[31] Artur Merke,et al. TD(0) Converges Provably Faster than the Residual Gradient Algorithm , 2003, ICML.

[32] N. H. C. Yung,et al. A fuzzy controller with supervised learning assisted reinforcement learning algorithm for obstacle avoidance , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[33] Leslie Pack Kaelbling,et al. Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[34] Manuela M. Veloso,et al. Multiagent learning using a variable learning rate , 2002, Artif. Intell..

[35] S. Micera,et al. Adaptive fuzzy control of electrically stimulated muscles for arm movements , 1999, Medical & Biological Engineering & Computing.

[36] Ebrahim H. Mamdani,et al. An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller , 1999, Int. J. Man Mach. Stud..

[37] John W. Sheppard,et al. Colearning in Differential Games , 1998, Machine Learning.

[38] Lionel Jouffe,et al. Fuzzy inference system learning by reinforcement methods , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[39] Robert Babuska,et al. Adaptive fuzzy control of satellite attitude by reinforcement learning , 1998, IEEE Trans. Fuzzy Syst..

[40] E. Mizutani,et al. Neuro-Fuzzy and Soft Computing-A Computational Approach to Learning and Machine Intelligence [Book Review] , 1997, IEEE Transactions on Automatic Control.

[41] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.

[42] Li-Xin Wang,et al. A Course In Fuzzy Systems and Control , 1996 .

[43] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[44] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[45] Terrence J. Sejnowski,et al. TD(λ) Converges with Probability 1 , 1994, Machine Learning.

[46] Peter Dayan,et al. The convergence of TD(λ) for general λ , 1992, Machine Learning.

[47] M. Sugeno,et al. Structure identification of fuzzy model , 1988 .

[48] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[49] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[50] R. Kruse,et al. Fuzzy Control , 2015, Handbook of Computational Intelligence.

[51] Frank L. Lewis,et al. Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems , 2014, Autom..

[52] Howard M. Schwartz,et al. Self-learning fuzzy logic controllers for pursuit-evasion differential games , 2011, Robotics Auton. Syst..

[53] H.K. Lam,et al. Fuzzy controller with stability and performance rules for nonlinear systems , 2007, Fuzzy Sets Syst..

[54] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[55] P. Varaiya,et al. Differential Games , 1994 .

[56] Hani Hagras,et al. Learning and adaptation of an intelligent mobile robot navigator operating in unstructured environment based on a novel online Fuzzy-Genetic system , 2004, Fuzzy Sets Syst..

[57] B. Silvano Zanutto,et al. Learning Obstacle Avoidance with an Operant Behavior Model , 2004, Artificial Life.

[58] Geoffrey J. Gordon. Reinforcement Learning with Function Approximation Converges to a Region , 2000, NIPS.

[59] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.

[60] Michael I. Jordan,et al. On the Convergence of Stochastic Iterative Dynamic Programming Algorithms , 1994, Neural Computation.

[61] P. Dayan,et al. TD ( X ) Converges with Probability 1 , 1994 .

[62] Michio Sugeno,et al. Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.