Integral Reinforcement Learning for online computation of feedback Nash strategies of nonzero-sum differential games
暂无分享,去创建一个
[1] Hiroaki Mukaidani. Newton's method for solving cross-coupled sign-indefinite algebraic Riccati equations for weakly coupled large-scale systems , 2007, Appl. Math. Comput..
[2] H. Abou-Kandil,et al. Necessary conditions for constant solutions of coupled Riccati equations in Nash games , 1993 .
[3] Hisham Abou-Kandil,et al. On global existence of solutions to coupled matrix Riccati equations in closed-loop Nash games , 1996, IEEE Trans. Autom. Control..
[4] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[5] T. Başar,et al. Dynamic Noncooperative Game Theory , 1982 .
[6] Hiroaki Mukaidani. Numerical computation of sign-indefinite linear quadratic differential games for weakly coupled large-scale systems , 2007, Int. J. Control.
[7] Y. Ho,et al. Nonzero-sum differential games , 1969 .
[8] Frank L. Lewis,et al. Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..
[9] Joe Brewer,et al. Kronecker products and matrix calculus in system theory , 1978 .
[10] Lyle Noakes,et al. Continuous-Time Adaptive Critics , 2007, IEEE Transactions on Neural Networks.
[11] M. Jungers,et al. Solving Coupled Algebraic Riccati Equations from Closed-Loop Nash Strategy in Discrete Time, by Lack of Trust Approach , 2008 .
[12] Jacob Engwerda,et al. LQ Dynamic Optimization and Differential Games , 2005 .
[13] Huaguang Zhang,et al. A New Approach to Solve a Class of Continuous-Time Nonlinear Quadratic Zero-Sum Game Using ADP , 2008, 2008 IEEE International Conference on Networking, Sensing and Control.
[14] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[15] T.-Y. Li,et al. Lyapunov Iterations for Solving Coupled Algebraic Riccati Equations of Nash Differential Games and Algebraic Riccati Equations of Zero-Sum Games , 1995 .
[16] C. Watkins. Learning from delayed rewards , 1989 .
[17] Randal W. Beard,et al. Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation , 1997, Autom..
[18] Yacine Chitour,et al. A new algorithm for solving coupled algebraic Riccati equations , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).
[19] D. Kleinman. On an iterative technique for Riccati equation computations , 1968 .