Improving Gradient Estimation by Incorporating Sensor Data
暂无分享,去创建一个
[1] Peter Stone,et al. Machine Learning for Fast Quadrupedal Locomotion , 2004, AAAI.
[2] Gm Gero Walter,et al. Bayesian linear regression , 2009 .
[3] BRIAN A. Garner,et al. A Kinematic Model of the Upper Limb Based on the Visible Human Project (VHP) Image Dataset. , 1999, Computer methods in biomechanics and biomedical engineering.
[4] S. Shankar Sastry,et al. Autonomous Helicopter Flight via Reinforcement Learning , 2003, NIPS.
[5] Lex Weaver,et al. The Optimal Reward Baseline for Gradient-Based Reinforcement Learning , 2001, UAI.
[6] Daniel M. Wolpert,et al. Making smooth moves , 2022 .
[7] Peter L. Bartlett,et al. Infinite-Horizon Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..
[8] Leonid Peshkin,et al. Learning from Scarce Experience , 2002, ICML.
[9] Zoubin Ghahramani,et al. Perspectives and problems in motor learning , 2001, Trends in Cognitive Sciences.
[10] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.
[11] Peter L. Bartlett,et al. Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning , 2001, J. Mach. Learn. Res..
[12] Michael I. Jordan,et al. Optimal feedback control as a theory of motor coordination , 2002, Nature Neuroscience.
[13] Noah J. Cowan,et al. Efficient Gradient Estimation for Motor Control Learning , 2002, UAI.
[14] Michael I. Jordan,et al. PEGASUS: A policy search method for large MDPs and POMDPs , 2000, UAI.
[15] Christian R. Shelton,et al. Policy Improvement for POMDPs Using Normalized Importance Sampling , 2001, UAI.