论文信息 - Hierarchical Reinforcement Learning and Parallel Computing Applied to the k-server Problem

Hierarchical Reinforcement Learning and Parallel Computing Applied to the k-server Problem

In this paper was proposed an algorithm based on Hierarchical Reinforcement Learning (HRL) and Parallel Computing to solve an online computing problem, the K-Server Problem (KSP). The size of the storage structure used for reinforcement learning to obtain the optimal policy grows exponentially with the number of states and actions, limiting its use to smaller problems due to the curse of dimensionality. The problem is modeled as a multiple steps decision process computed in parallel by applying the Q-learning algorithm to obtain optimal policies in a reduced number of nodes obtained from an clustering process. The results show the applicability of the proposed method to real problems of large size.

Adrião Duarte Dória Neto | Jorge Dantas de Melo | Mademerson Leandro da Costa | Carlos Padilha

[1] Allan Borodin,et al. Online computation and competitive analysis , 1998 .

[2] Guangming Xie,et al. Coordination of Multiple Robotic Fish With Applications to Underwater Robot Competition , 2016, IEEE Transactions on Industrial Electronics.

[3] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.

[4] Ruben Usamentiaga,et al. A Parallel Genetic Algorithm for Optimizing an Industrial Inspection System , 2013, IEEE Latin America Transactions.

[5] Kurosh Madani,et al. Option-based motion planning and ANFIS-based tracking control for wheeled robot in cluttered environment , 2015, 2015 12th International Conference on Informatics in Control, Automation and Robotics (ICINCO).

[6] Lyle A. McGeoch,et al. Competitive algorithms for on-line problems , 1988, STOC '88.

[7] Carmelo J. A. Bastos Filho,et al. PSO Efficient Implementation on GPUs Using Low Latency Memory , 2015, IEEE Latin America Transactions.

[8] Edward F. Grove,et al. The harmonic k-server algorithm is competitive , 2000, JACM.

[9] E. M. Wright,et al. Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[10] Chris Watkins,et al. Learning from delayed rewards , 1989 .

[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12] Elias Koutsoupias,et al. On the competitive ratio of the work function algorithm for the k-server problem , 2004, Theor. Comput. Sci..

[13] Allan Borodin,et al. An optimal online algorithm for metrical task systems , 1987, STOC.

[14] A.D. Doria Neto,et al. The k-server problem: a reinforcement learning approach , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..