Learning-based model predictive control for Markov decision processes
暂无分享,去创建一个
Bart De Schutter | Marco Wiering | Rudy R. Negenborn | Hans Hellendoorn | B. Schutter | M. Wiering | H. Hellendoorn | R. Negenborn
[1] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[2] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.
[3] Jay H. Lee,et al. Model predictive control: past, present and future , 1999 .
[4] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[5] Alberto Bemporad,et al. The explicit linear quadratic regulator for constrained systems , 2003, Autom..
[6] Marco Wiering,et al. Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .
[7] Marco Wiering,et al. Explorations in efficient reinforcement learning , 1999 .
[8] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[9] Jan M. Maciejowski,et al. Predictive control : with constraints , 2002 .
[10] Michael Kearns,et al. Bias-Variance Error Bounds for Temporal Difference Updates , 2000, COLT.
[11] A. Jadbabaie,et al. Stabilizing receding horizon control of nonlinear systems: a control Lyapunov function approach , 1999, Proceedings of the 1999 American Control Conference (Cat. No. 99CH36251).
[12] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[13] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .