Neural Network-Based Solutions for Stochastic Optimal Control Using Path Integrals
暂无分享,去创建一个
Karthikeyan Rajagopal | Jerome R. Busemeyer | Sivasubramanya Nadar Balakrishnan | S. Balakrishnan | J. Busemeyer | K. Rajagopal
[1] Paul J. Werbos,et al. Approximate dynamic programming for real-time control and neural modeling , 1992 .
[2] Warren E. Dixon,et al. Model-based reinforcement learning for approximate optimal regulation , 2016, Autom..
[3] Frank L. Lewis,et al. Guest Editorial: Special Issue on Adaptive Dynamic Programming and Reinforcement Learning in Feedback Control , 2008, IEEE Trans. Syst. Man Cybern. Part B.
[4] D. Kleinman. On an iterative technique for Riccati equation computations , 1968 .
[5] Jinghao Zhu,et al. On stochastic Riccati equations for the stochastic LQR problem , 2005, Syst. Control. Lett..
[6] G. Saridis,et al. On Successive Approximation of Optimal Control of Stochastic Dynamic Systems , 2005 .
[7] S. Shreve,et al. Stochastic differential equations , 1955, Mathematical Proceedings of the Cambridge Philosophical Society.
[8] Sean P. Meyn,et al. Q-learning and Pontryagin's Minimum Principle , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.
[9] Jae Young Lee,et al. Approximate dynamic programming for continuous-time linear quadratic regulator problems: relaxation of known input-coupling matrix assumption , 2012 .
[10] Randa Herzallah. Probabilistic DHP adaptive critic for nonlinear stochastic control systems , 2013, Neural Networks.
[11] Frank L. Lewis,et al. Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.
[12] S. N. Balakrishnan,et al. Adaptive Critic-Based Neural Networks for Agile Missile Control , 1998 .
[13] S. N. Balakrishnan,et al. Adaptive-critic based neural networks for aircraft optimal control , 1996 .
[14] G. Sohie,et al. Generalization of the matrix inversion lemma , 1986, Proceedings of the IEEE.
[15] W. K. Hastings,et al. Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .
[16] X. Zhou,et al. Stochastic Controls: Hamiltonian Systems and HJB Equations , 1999 .
[17] Hilbert J. Kappen,et al. Graphical Model Inference in Optimal Control of Stochastic Multi-Agent Systems , 2008, J. Artif. Intell. Res..
[18] K. Morris,et al. Iterative Solution of Algebraic Riccati Equations for Damped Systems , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.
[19] Frank L. Lewis,et al. Continuous-Time Q-Learning for Infinite-Horizon Discounted Cost Linear Quadratic Regulator Problems , 2015, IEEE Transactions on Cybernetics.
[20] Paul J. Werbos,et al. Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.
[21] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[22] George M. Siouris,et al. Applied Optimal Control: Optimization, Estimation, and Control , 1979, IEEE Transactions on Systems, Man, and Cybernetics.
[23] H. Kappen. Path integrals and symmetry breaking for optimal control theory , 2005, physics/0505066.
[24] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[25] Weiwei Li,et al. An Iterative Optimal Control and Estimation Design for Nonlinear Stochastic System , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.
[26] M. B. McFarland,et al. Robust adaptive control using single-hidden-layer feedforward neural networks , 1999, Proceedings of the 1999 American Control Conference (Cat. No. 99CH36251).
[27] Warren B. Powell,et al. Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.
[28] Frank L. Lewis,et al. Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..
[29] Metropolis Methods for Quantum Monte Carlo Simulations , 2003, physics/0306182.
[30] J. Janssen,et al. Deterministic and Stochastic Optimal Control , 2013 .
[31] J. Mount. Importance Sampling , 2005 .
[32] Radhakant Padhi,et al. A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems , 2006, Neural Networks.
[33] Chris Watkins,et al. Learning from delayed rewards , 1989 .
[34] Hao Xu,et al. Stochastic Optimal Controller Design for Uncertain Nonlinear Networked Control System via Neuro Dynamic Programming , 2013, IEEE Transactions on Neural Networks and Learning Systems.
[35] Evangelos A. Theodorou,et al. Iterative path integral stochastic optimal control: Theory and applications to motor control , 2011 .