论文信息 - Hyper-cubic Function Approximation for Reinforcement Learning Based on Autonomous-Decentralized Algorithm - 字舞流文

Hyper-cubic Function Approximation for Reinforcement Learning Based on Autonomous-Decentralized Algorithm

Hideo Yuasa | Shigeyuki Hosoe | Yuichi Kobayashi | S. Hosoe | H. Yuasa | Yuichi Kobayashi

[1] Thomas Martinetz,et al. Topology representing networks , 1994, Neural Networks.

[2] Kenji Doya,et al. Reinforcement Learning in Continuous Time and Space , 2000, Neural Computation.

[3] H. Yuasa,et al. Self-organizing system theory by use of reaction-diffusion equation on a graph with boundary , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[4] Christopher G. Atkeson,et al. Constructive Incremental Learning from Only Local Information , 1998, Neural Computation.

[5] Jun Morimoto,et al. Conference on Intelligent Robots and Systems Reinforcement Le,arning of Dynamic Motor Sequence: Learning to Stand Up , 2022 .

[6] Andrew W. Moore,et al. Locally Weighted Learning for Control , 1997, Artificial Intelligence Review.

[7] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.

[8] Minoru Asada,et al. Action-based sensor space categorization for robot learning , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[9] James S. Albus,et al. New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)1 , 1975 .

[10] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.

[11] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.

[12] Hiroshi Ishiguro,et al. Robot oriented state space construction , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[13] Minoru Asada,et al. Reasonable performance in less learning time by real robot based on incremental state space segmentation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[14] Xin Wang,et al. Stabilizing Value Function Approximation with the BFBP Algorithm , 2001, NIPS.

[15] Andrew W. Moore,et al. Variable Resolution Discretization in Optimal Control , 2002, Machine Learning.

[16] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[17] Takashi Omori,et al. Adaptive State Space Formation in Reinforcement Learning , 1998, ICONIP.

[18] A. Harry Klopf,et al. Reinforcement Learning Applied to a Differential Game , 1995, Adapt. Behav..