Distributed Reinforcement Learning using Bi-directional Decision Making for Controlling Multi-Stage Flow Systems
暂无分享,去创建一个
Shigenobu Kobayashi | Kei Aoki | Hajime Kimura | Akihiro Nagaiwa | H. Kimura | Shigenobu Kobayashi | Kei Aoki | Akihiro Nagaiwa
[1] Richard S. Sutton,et al. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding , 1995, NIPS.
[2] Csaba Szepesvári,et al. Multi-criteria Reinforcement Learning , 1998, ICML.
[3] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[4] Thomas G. Dietterich. The MAXQ Method for Hierarchical Reinforcement Learning , 1998, ICML.
[5] Craig Boutilier,et al. Sequential Optimality and Coordination in Multiagent Systems , 1999, IJCAI.
[6] Kagan Tumer,et al. General principles of learning-based multi-agent systems , 1999, AGENTS '99.
[7] Andrew W. Moore,et al. Distributed Value Functions , 1999, ICML.
[8] Sakamoto Yoshiyuki,et al. Quasi-Optimization of Water Distribution Scheduling Based on GA , 2000 .
[9] Peter Geibel,et al. Reinforcement Learning with Bounded Risk , 2001, ICML.
[10] Michail G. Lagoudakis,et al. Coordinated Reinforcement Learning , 2002, ICML.
[11] Shigenobu Kobayashi,et al. Adaptive Control of Sewerage Systems using Distributed Reinforcement Learning , 2003 .
[12] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[13] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.