A Hybrid Learning Strategy for Real Hardware of Swing-Up Pendulum
暂无分享,去创建一个
[1] Kazunobu Yoshida,et al. Swing-up control of an inverted pendulum by energy-based methods , 1999, Proceedings of the 1999 American Control Conference (Cat. No. 99CH36251).
[2] Kenji Doya,et al. Efficient Nonlinear Control with Actor-Tutor Architecture , 1996, NIPS.
[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[4] Katsuhisa Furuta,et al. Swinging up a pendulum by energy control , 1996, Autom..
[5] Masami Iwase,et al. Time Optimal Swing-Up Control of Single Pendulum , 2001 .
[6] Shingo Nakamura,et al. Crossing the reality gap for a swing-up pendulum , 2006 .
[7] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[8] Mitsuo Kawato,et al. Multiple Model-Based Reinforcement Learning , 2002, Neural Computation.
[9] Keiichiro Hoashi,et al. Humanoid Robots in Waseda University—Hadaly-2 and WABIAN , 2002, Auton. Robots.
[10] Ryo Saegusa,et al. Nonlinear principal component analysis to preserve the order of principal components , 2003, Neurocomputing.
[11] M. Bugeja,et al. Non-linear swing-up and stabilizing control of an inverted pendulum system , 2003, The IEEE Region 8 EUROCON 2003. Computer as a Tool..
[12] Shuji Hashimoto,et al. A learning strategy using simulator for real hardware of swing-up pendulum , 2006 .