论文信息 - Distributed Reinforcement Learning using Bi-directional Decision Making for Controlling Multi-Stage Flow Systems

Distributed Reinforcement Learning using Bi-directional Decision Making for Controlling Multi-Stage Flow Systems

Autonomous control systems have been requested recently for large-scale real systems. Distributed reinforcement learning is attracting attention specifically in control of physical flow systems such as lifeline systems. In this paper, we will introduce a model of Multi-Stage Flow System (MSFS) as a new problem class. MSFS is a framework which can describe various physical flow systems. Furthermore, it is effective in handling balance between a purpose of system and constraints, constraints under uncertainty and so on that are difficult to solve in conventional methods because of its features. We propose a new bi-directional decision making algorithm with feasible action sets based on a least commitment strategy. We apply our method to controlling of real sewerage systems. The simulation results show that only our method satisfies permissible levels and attains the performance within an acceptance level.

[1] Richard S. Sutton,et al. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding , 1995, NIPS.

[2] Csaba Szepesvári,et al. Multi-criteria Reinforcement Learning , 1998, ICML.

[3] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[4] Thomas G. Dietterich. The MAXQ Method for Hierarchical Reinforcement Learning , 1998, ICML.

[5] Craig Boutilier,et al. Sequential Optimality and Coordination in Multiagent Systems , 1999, IJCAI.

[6] Kagan Tumer,et al. General principles of learning-based multi-agent systems , 1999, AGENTS '99.

[7] Andrew W. Moore,et al. Distributed Value Functions , 1999, ICML.

[8] Sakamoto Yoshiyuki,et al. Quasi-Optimization of Water Distribution Scheduling Based on GA , 2000 .

[9] Peter Geibel,et al. Reinforcement Learning with Bounded Risk , 2001, ICML.

[10] Michail G. Lagoudakis,et al. Coordinated Reinforcement Learning , 2002, ICML.

[11] Shigenobu Kobayashi,et al. Adaptive Control of Sewerage Systems using Distributed Reinforcement Learning , 2003 .

[12] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[13] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.