论文信息 - Grey Reinforcement Learning for Incomplete Information Processing

Grey Reinforcement Learning for Incomplete Information Processing

New representation and computation mechanisms are key approaches for learning problems with incomplete information or in large probabilistic environments. In this paper, traditional reinforcement learning (RL) methods are combined with grey theory and a novel grey reinforcement learning (GRL) framework is proposed to solve complex problems with incomplete information. Typical example of mobile robot navigation is given out to evaluate the performance and practicability of GRL. Related issues are also briefly discussed.

[1] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[2] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..

[3] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[4] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[5] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[6] Zonghai Chen,et al. Quantum Reinforcement Learning , 2005, ICNC.

[7] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[8] Zonghai Chen,et al. An Autonomous Mobile Robot Based on Quantum Algorithm , 2005, CIS.

[9] Benjamin J. Kaipers,et al. Qualitative Simulation , 1989, Artif. Intell..

[10] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11] Toshiyuki Kondo,et al. A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control , 2003, Robotics Auton. Syst..

[12] Ashwin Ram,et al. Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces , 1997, Adapt. Behav..

[13] Sridhar Mahadevan,et al. Hierarchical learning and planning in partially observable markov decision processes , 2002 .

[14] Hyung Suck Cho,et al. A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning , 1995, IEEE Trans. Syst. Man Cybern..

[15] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[16] Leslie Pack Kaelbling,et al. Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[17] Benjamin Kuipers,et al. Qualitative and Quantitative Simulation: Bridging the Gap , 1997, Artif. Intell..

[18] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[19] Jürgen Schmidhuber,et al. HQ-Learning , 1997, Adapt. Behav..

[20] Andrew James Smith,et al. Applications of the self-organising map to reinforcement learning , 2002, Neural Networks.

[21] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.

[22] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[23] Aristidis Likas. Reinforcement Learning Using the Stochastic Fuzzy Min–Max Neural Network , 2004, Neural Processing Letters.

[24] P. Glorennec,et al. Fuzzy Q-learning , 1997, Proceedings of 6th International Fuzzy Systems Conference.