Policy iteration based feedback control
暂无分享,去创建一个
Xi Chen | Xi-Ren Cao | Kan-Jian Zhang | Yan-Kai Xu | Xi-Ren Cao | X. Chen | Kanjian Zhang | Yan-Kai Xu
[1] Richard S. Sutton,et al. Reinforcement Learning , 1992, Handbook of Machine Learning.
[2] R. P. Marques,et al. Discrete-Time Markov Jump Linear Systems , 2004, IEEE Transactions on Automatic Control.
[3] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.
[4] John N. Tsitsiklis,et al. Feature-based methods for large scale dynamic programming , 2004, Machine Learning.
[5] Xi-Ren Cao,et al. From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[6] Xi-Ren Cao,et al. A unified approach to Markov decision problems and performance sensitivity analysis , 2000, at - Automatisierungstechnik.
[7] H. Kushner. Numerical Methods for Stochastic Control Problems in Continuous Time , 2000 .
[8] O. Hernández-Lerma,et al. Discrete-time Markov control processes , 1999 .
[9] O. Hernández-Lerma,et al. Further topics on discrete-time Markov control processes , 1999 .
[10] Andrew G. Barto,et al. Reinforcement learning , 1998 .
[11] Xi-Ren Cao,et al. Algorithms for sensitivity analysis of Markov systems through potentials and perturbation realization , 1998, IEEE Trans. Control. Syst. Technol..
[12] W. Fleming. Book Review: Discrete-time Markov control processes: Basic optimality criteria , 1997 .
[13] P. Dupuis,et al. Numerical Methods in Stochastic Control. , 1996 .
[14] M. Fragoso,et al. Discrete-time LQ-optimal control problems for infinite Markov jump parameter systems , 1995, IEEE Trans. Autom. Control..
[15] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[16] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[17] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[18] R. Durrett. Probability: Theory and Examples , 1993 .
[19] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Vol. II , 1976 .