A Residual Gradient Fuzzy Reinforcement Learning Algorithm for Differential Games

[1]  Dongbin Zhao,et al.  Online reinforcement learning control by Bayesian inference , 2016 .

[2]  Dongbin Zhao,et al.  Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics , 2016 .

[3]  Howard M. Schwartz,et al.  A Decentralized Fuzzy Learning Algorithm for Pursuit-Evasion Differential Games with Superior Evaders , 2016, J. Intell. Robotic Syst..

[4]  Huaguang Zhang,et al.  Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method , 2016, Neurocomputing.

[5]  Howard M. Schwartz,et al.  A fuzzy reinforcement learning algorithm using a predictor for pursuit-evasion games , 2016, 2016 Annual IEEE Systems Conference (SysCon).

[6]  Daniel Sánchez,et al.  Fuzzy frameworks for mining data associations: fuzzy association rules and beyond , 2016, WIREs Data Mining Knowl. Discov..

[7]  Shaocheng Tong,et al.  Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone , 2016, IEEE Transactions on Fuzzy Systems.

[8]  Tingwen Huang,et al.  Reinforcement learning solution for HJB equation arising in constrained optimal control problem , 2015, Neural Networks.

[9]  Howard M. Schwartz,et al.  The residual gradient FACL algorithm for differential games , 2015, 2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE).

[10]  Han-Xiong Li,et al.  Adaptive Optimal Control of Highly Dissipative Nonlinear Spatially Distributed Processes With Neuro-Dynamic Programming , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Howard M. Schwartz,et al.  Multi-Agent Machine Learning: A Reinforcement Approach , 2014 .

[12]  Han-Xiong Li,et al.  Data-based Suboptimal Neuro-control Design with Reinforcement Learning for Dissipative Spatially Distributed Processes , 2014 .

[13]  Tingwen Huang,et al.  Off-Policy Reinforcement Learning for $ H_\infty $ Control Design , 2013, IEEE Transactions on Cybernetics.

[14]  Rushikesh Kamalapurkar,et al.  Concurrent learning-based approximate optimal regulation , 2013, 52nd IEEE Conference on Decision and Control.

[15]  Huai-Ning Wu,et al.  Neural Network Based Online Simultaneous Policy Update Algorithm for Solving the HJI Equation in Nonlinear $H_{\infty}$ Control , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[16]  Frank L. Lewis,et al.  Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles , 2012 .

[17]  Howard M. Schwartz,et al.  Q(λ)‐learning adaptive fuzzy logic controllers for pursuit–evasion differential games , 2011 .

[18]  Uzay Kaymak,et al.  Systems Control With Generalized Probabilistic Fuzzy-Reinforcement Learning , 2011, IEEE Transactions on Fuzzy Systems.

[19]  Sidney Nascimento Givigi,et al.  A Reinforcement Learning Adaptive Fuzzy Controller for Differential Games , 2010, J. Intell. Robotic Syst..

[20]  Howard M. Schwartz,et al.  Hybrid intelligent systems applied to the pursuit-evasion game , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[21]  Andrea Bonarini,et al.  Reinforcement distribution in fuzzy Q-learning , 2009, Fuzzy Sets Syst..

[22]  Senén Barro,et al.  Autonomous and fast robot learning through motivation , 2007, Robotics Auton. Syst..

[23]  Xuesong Wang,et al.  A fuzzy Actor-Critic reinforcement learning network , 2007, Inf. Sci..

[24]  Salim Labiod,et al.  Adaptive fuzzy control of a class of SISO nonaffine nonlinear systems , 2007, Fuzzy Sets Syst..

[25]  Senén Barro,et al.  Design of a fuzzy controller in mobile robotics using genetic algorithms , 2007, Appl. Soft Comput..

[26]  Steven M. LaValle,et al.  Planning algorithms , 2006 .

[27]  Ferhat Daldaban,et al.  Phase inductance estimation for switched reluctance motor using adaptive neuro-fuzzy inference system , 2006 .

[28]  Chi-Kwong Li,et al.  An approach to tune fuzzy controllers based on reinforcement learning for autonomous vehicle control , 2005, IEEE Transactions on Intelligent Transportation Systems.

[29]  Hugh F. Durrant-Whyte,et al.  A time-optimal control strategy for pursuit-evasion games problems , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[30]  Toshiyuki Kondo,et al.  A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control , 2003, Robotics Auton. Syst..

[31]  Artur Merke,et al.  TD(0) Converges Provably Faster than the Residual Gradient Algorithm , 2003, ICML.

[32]  N. H. C. Yung,et al.  A fuzzy controller with supervised learning assisted reinforcement learning algorithm for obstacle avoidance , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[33]  Leslie Pack Kaelbling,et al.  Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[34]  Manuela M. Veloso,et al.  Multiagent learning using a variable learning rate , 2002, Artif. Intell..

[35]  S. Micera,et al.  Adaptive fuzzy control of electrically stimulated muscles for arm movements , 1999, Medical & Biological Engineering & Computing.

[36]  Ebrahim H. Mamdani,et al.  An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller , 1999, Int. J. Man Mach. Stud..

[37]  John W. Sheppard,et al.  Colearning in Differential Games , 1998, Machine Learning.

[38]  Lionel Jouffe,et al.  Fuzzy inference system learning by reinforcement methods , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[39]  Robert Babuska,et al.  Adaptive fuzzy control of satellite attitude by reinforcement learning , 1998, IEEE Trans. Fuzzy Syst..

[40]  E. Mizutani,et al.  Neuro-Fuzzy and Soft Computing-A Computational Approach to Learning and Machine Intelligence [Book Review] , 1997, IEEE Transactions on Automatic Control.

[41]  John N. Tsitsiklis,et al.  Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.

[42]  Li-Xin Wang,et al.  A Course In Fuzzy Systems and Control , 1996 .

[43]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[44]  Leemon C. Baird,et al.  Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[45]  Terrence J. Sejnowski,et al.  TD(λ) Converges with Probability 1 , 1994, Machine Learning.

[46]  Peter Dayan,et al.  The convergence of TD(λ) for general λ , 1992, Machine Learning.

[47]  M. Sugeno,et al.  Structure identification of fuzzy model , 1988 .

[48]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[49]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[50]  R. Kruse,et al.  Fuzzy Control , 2015, Handbook of Computational Intelligence.

[51]  Frank L. Lewis,et al.  Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems , 2014, Autom..

[52]  Howard M. Schwartz,et al.  Self-learning fuzzy logic controllers for pursuit-evasion differential games , 2011, Robotics Auton. Syst..

[53]  H.K. Lam,et al.  Fuzzy controller with stability and performance rules for nonlinear systems , 2007, Fuzzy Sets Syst..

[54]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[55]  P. Varaiya,et al.  Differential Games , 1994 .

[56]  Hani Hagras,et al.  Learning and adaptation of an intelligent mobile robot navigator operating in unstructured environment based on a novel online Fuzzy-Genetic system , 2004, Fuzzy Sets Syst..

[57]  B. Silvano Zanutto,et al.  Learning Obstacle Avoidance with an Operant Behavior Model , 2004, Artificial Life.

[58]  Geoffrey J. Gordon Reinforcement Learning with Function Approximation Converges to a Region , 2000, NIPS.

[59]  Andrew W. Moore,et al.  Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.

[60]  Michael I. Jordan,et al.  On the Convergence of Stochastic Iterative Dynamic Programming Algorithms , 1994, Neural Computation.

[61]  P. Dayan,et al.  TD ( X ) Converges with Probability 1 , 1994 .

[62]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.