Multiple model-based reinforcement learning for nonlinear control
暂无分享,去创建一个
Mitsuo Kawato | Kenji Doya | Kazuyuki Samejima | Ken'Ichi Katagiri | K. Doya | M. Kawato | K. Samejima | Ken-ichi Katagiri
[1] Gerald Tesauro,et al. Practical Issues in Temporal Difference Learning , 1992, Mach. Learn..
[2] Long Ji Lin,et al. Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.
[3] Richard S. Sutton,et al. A Menu of Designs for Reinforcement Learning Over Time , 1995 .
[4] Jun Morimoto,et al. Hierarchical Reinforcement Learning of Low-Dimensional Subgoals and High-Dimensional Trajectories , 1998, ICONIP.
[5] BRENDAN O. MCGONIGLE,et al. Long-term retention of single and multistate prismatic adaptation by humans , 1978, Nature.
[6] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.
[7] D. Wolpert,et al. Internal models in the cerebellum , 1998, Trends in Cognitive Sciences.
[8] Andrew G. Barto,et al. Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.
[9] Klaus-Robert Müller,et al. Annealed Competition of Experts for a Segmentation and Classification of Switching Dynamics , 1996, Neural Computation.
[10] Andrew G. Barto,et al. Reinforcement learning , 1998 .
[11] Kenji Doya,et al. Reinforcement Learning in Continuous Time and Space , 2000, Neural Computation.
[12] John Moody,et al. Fast Learning in Networks of Locally-Tuned Processing Units , 1989, Neural Computation.
[13] Kumpati S. Narendra,et al. Adaptation and learning using multiple models, switching, and tuning , 1995 .
[14] Jun Morimoto,et al. Conference on Intelligent Robots and Systems Reinforcement Le,arning of Dynamic Motor Sequence: Learning to Stand Up , 2022 .
[15] Mitsuo Kawato,et al. Multiple Paired Forward-Inverse Models for Human Motor Learning and Control , 1998, NIPS.
[16] Geoffrey E. Hinton,et al. Feudal Reinforcement Learning , 1992, NIPS.
[17] D M Wolpert,et al. Multiple paired forward and inverse models for motor control , 1998, Neural Networks.
[18] Christopher G. Atkeson,et al. Constructive Incremental Learning from Only Local Information , 1998, Neural Computation.
[19] Stefano Nolfi,et al. Learning to perceive the world as articulated: an approach for hierarchical learning in sensory-motor systems , 1998, Neural Networks.
[20] W. Fleming,et al. Controlled Markov processes and viscosity solutions , 1992 .
[21] Chen K. Tham,et al. Reinforcement learning of multiple tasks using a hierarchical CMAC architecture , 1995, Robotics Auton. Syst..
[22] Naonori Ueda,et al. Deterministic Annealing Variant of the EM Algorithm , 1994, NIPS.
[23] Richard S. Sutton,et al. Planning by Incremental Dynamic Programming , 1991, ML.