暂无分享,去创建一个
[1] Sebastian Ruder,et al. An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.
[2] Frank L. Lewis,et al. Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.
[3] Frank L. Lewis,et al. Aircraft Control and Simulation , 1992 .
[4] Yang Xiong,et al. Adaptive Dynamic Programming with Applications in Optimal Control , 2017 .
[5] Francesco Borrelli,et al. Kinematic and dynamic vehicle models for autonomous driving control design , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).
[6] A. M. Lyapunov. The general problem of the stability of motion , 1992 .
[7] S. Lyashevskiy. Constrained optimization and control of nonlinear systems: new results in optimal control , 1996, Proceedings of 35th IEEE Conference on Decision and Control.
[8] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[9] Frank L. Lewis,et al. Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[10] Warren B. Powell,et al. Approximate Dynamic Programming - Solving the Curses of Dimensionality , 2007 .
[11] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.
[12] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[13] Derong Liu,et al. Online approximate optimal control for affine non-linear systems with unknown internal dynamics using adaptive dynamic programming , 2014 .
[14] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[15] Draguna Vrabie,et al. Adaptive optimal controllers based on Generalized Policy Iteration in a continuous-time framework , 2009, 2009 17th Mediterranean Conference on Control and Automation.
[16] Frank L. Lewis,et al. Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..
[17] Frank L. Lewis,et al. Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks , 2013, IEEE Transactions on Neural Networks and Learning Systems.
[18] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .
[19] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[20] Frank L. Lewis,et al. Optimal Control , 1986 .
[21] Frank L. Lewis,et al. Off-Policy Reinforcement Learning for Synchronization in Multiagent Graphical Games , 2017, IEEE Transactions on Neural Networks and Learning Systems.
[22] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[23] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[24] Liwei Wang,et al. Gradient Descent Finds Global Minima of Deep Neural Networks , 2018, ICML.
[25] Yanan Li,et al. Driver-automation indirect shared control of highly automated vehicles with intention-aware authority transition , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).
[26] P. Werbos,et al. Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .
[27] Kurt Hornik,et al. Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.
[28] Frank L. Lewis,et al. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..
[29] Sergey Levine,et al. Guided Policy Search , 2013, ICML.
[30] Huaguang Zhang,et al. Adaptive Dynamic Programming: An Introduction , 2009, IEEE Computational Intelligence Magazine.
[31] Warren E. Dixon,et al. Model-based reinforcement learning for approximate optimal regulation , 2016, Autom..
[32] Kenji Doya,et al. Reinforcement Learning in Continuous Time and Space , 2000, Neural Computation.
[33] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[34] John N. Tsitsiklis,et al. Neuro-dynamic programming: an overview , 1995, Proceedings of 1995 34th IEEE Conference on Decision and Control.
[35] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[36] Paul J. Werbos,et al. Approximate dynamic programming for real-time control and neural modeling , 1992 .
[37] Yuanzhi Li,et al. A Convergence Theory for Deep Learning via Over-Parameterization , 2018, ICML.
[38] Zhong-Ping Jiang,et al. Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems , 2013, IEEE Transactions on Automatic Control.
[39] Robert Gardner,et al. Introduction To Real Analysis , 1994 .
[40] R. Bellman,et al. Dynamic Programming and Markov Processes , 1960 .
[41] Frank L. Lewis,et al. Multi-agent differential graphical games , 2011, Proceedings of the 30th Chinese Control Conference.
[42] K. Vamvoudakis. Event-triggered optimal adaptive control algorithm for continuous-time nonlinear systems , 2014, IEEE/CAA Journal of Automatica Sinica.
[43] Benjamin Recht,et al. A Tour of Reinforcement Learning: The View from Continuous Control , 2018, Annu. Rev. Control. Robotics Auton. Syst..
[44] Haibo He,et al. Event-Triggered Adaptive Dynamic Programming for Continuous-Time Systems With Control Constraints , 2017, IEEE Transactions on Neural Networks and Learning Systems.
[45] A. Laub,et al. On the numerical solution of the discrete-time algebraic Riccati equation , 1980 .
[46] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.