Value function approximation and model predictive control
暂无分享,去创建一个
Yuval Tassa | Emanuel Todorov | Tom Erez | M. Johnson | Mingyuan Zhong | T. Erez | Yuval Tassa | E. Todorov | Mingyuan Zhong | M. Johnson | Mikala C. Johnson | M. Johnson | Tom Erez | Mingyuan Zhong
[1] Christopher G. Atkeson,et al. Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming , 1993, NIPS.
[2] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[3] F. Allgöwer,et al. A quasi-infinite horizon nonlinear model predictive control scheme with guaranteed stability , 1997 .
[4] Jun Morimoto,et al. Nonparametric Representation of Policies and Value Functions: A Trajectory-Based Approach , 2002, NIPS.
[5] Arno Linnemann,et al. Toward infinite-horizon optimality in nonlinear model predictive control , 2002, IEEE Trans. Autom. Control..
[6] Stefan Schaal,et al. Incremental Online Learning in High Dimensions , 2005, Neural Computation.
[7] E. Todorov,et al. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..
[8] David Silver,et al. Combining online and offline knowledge in UCT , 2007, ICML '07.
[9] William D. Smart,et al. Receding Horizon Differential Dynamic Programming , 2007, NIPS.
[10] Michael Fink,et al. Online Learning of Search Heuristics , 2007, AISTATS.
[11] Emanuel Todorov,et al. Efficient computation of optimal actions , 2009, Proceedings of the National Academy of Sciences.
[12] Emanuel Todorov,et al. Eigenfunction approximation methods for linearly-solvable optimal control problems , 2009, 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning.
[13] Hans Joachim Ferreau,et al. Efficient Numerical Methods for Nonlinear MPC and Moving Horizon Estimation , 2009 .
[14] E. Todorov,et al. Aggregation Methods for Lineary-Solvable Markov Decision Process , 2011 .
[15] Yuval Tassa,et al. Infinite-Horizon Model Predictive Control for Periodic Tasks with Contacts , 2011, Robotics: Science and Systems.
[16] E. Todorov,et al. Moving least-squares approximations for linearly-solvable stochastic optimal control problems , 2011 .
[17] Yuval Tassa,et al. Synthesis and stabilization of complex behaviors through online trajectory optimization , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[18] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.