论文信息 - Q-learning using fuzzified states and weighted actions and its application to omni-direnctional mobile robot control

Q-learning using fuzzified states and weighted actions and its application to omni-direnctional mobile robot control

The conventional Q-learning algorithm is described by a finite number of discretized states and discretized actions. When the system is represented in continuous domain, this may cause an abrupt transition of action as the state rapidly changes. To avoid this abrupt transition of action, the learning system requires fine-tuned states. However, the learning time significantly increases and the system becomes computationally expensive as the number of states increases. To solve this problem, this paper proposes a novel Q-learning algorithm, which uses fuzzified states and weighted actions to update its state-action value. By applying the concept of fuzzy set to the states of Q-learning and using the weighted actions, the agent efficiently responds to the rapid changes of the states. The proposed algorithm is applied to omni-directional mobile robot and the results demonstrate the effectiveness of the proposed approach.

Jong-Hwan Kim | In-Won Park | Dong-Hyun Lee

[1] P. Glorennec,et al. Fuzzy Q-learning , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[2] Bart De Schutter,et al. Continuous-State Reinforcement Learning with Fuzzy Approximation , 2007, Adaptive Agents and Multi-Agents Systems.

[3] Nicholas Bambos,et al. A fuzzy reinforcement learning approach to power control in wireless transmitters , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4] P. Y. Glorennec,et al. Fuzzy Q-learning and dynamical fuzzy Q-learning , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.

[5] H. R. Berenji,et al. Fuzzy Q-learning: a new approach for fuzzy dynamic programming , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.

[6] Hamid R. Berenji,et al. A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters , 2003, IEEE Trans. Fuzzy Syst..

[7] Bart De Schutter,et al. Fuzzy Approximation for Convergent Model-Based Reinforcement Learning , 2007, 2007 IEEE International Fuzzy Systems Conference.

[8] H. R. Berenji,et al. Fuzzy Q-learning for generalization of reinforcement learning , 1996, Proceedings of IEEE 5th International Fuzzy Systems.

[9] Yong Duan,et al. Fuzzy reinforcement learning and its application in robot navigation , 2005, 2005 International Conference on Machine Learning and Cybernetics.