论文信息 - Policy iteration based feedback control - 字舞流文

Policy iteration based feedback control

Xi Chen | Xi-Ren Cao | Kan-Jian Zhang | Yan-Kai Xu | Xi-Ren Cao | X. Chen | Kanjian Zhang | Yan-Kai Xu

[1] Richard S. Sutton,et al. Reinforcement Learning , 1992, Handbook of Machine Learning.

[2] R. P. Marques,et al. Discrete-Time Markov Jump Linear Systems , 2004, IEEE Transactions on Automatic Control.

[3] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.

[4] John N. Tsitsiklis,et al. Feature-based methods for large scale dynamic programming , 2004, Machine Learning.

[5] Xi-Ren Cao,et al. From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[6] Xi-Ren Cao,et al. A unified approach to Markov decision problems and performance sensitivity analysis , 2000, at - Automatisierungstechnik.

[7] H. Kushner. Numerical Methods for Stochastic Control Problems in Continuous Time , 2000 .

[8] O. Hernández-Lerma,et al. Discrete-time Markov control processes , 1999 .

[9] O. Hernández-Lerma,et al. Further topics on discrete-time Markov control processes , 1999 .

[10] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[11] Xi-Ren Cao,et al. Algorithms for sensitivity analysis of Markov systems through potentials and perturbation realization , 1998, IEEE Trans. Control. Syst. Technol..

[12] W. Fleming. Book Review: Discrete-time Markov control processes: Basic optimality criteria , 1997 .

[13] P. Dupuis,et al. Numerical Methods in Stochastic Control. , 1996 .

[14] M. Fragoso,et al. Discrete-time LQ-optimal control problems for infinite Markov jump parameter systems , 1995, IEEE Trans. Autom. Control..

[15] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[16] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[17] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[18] R. Durrett. Probability: Theory and Examples , 1993 .

[19] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Vol. II , 1976 .