论文信息 - Learning a Cost-Sensitive Internal Representation for Reinforcement Learning

Learning a Cost-Sensitive Internal Representation for Reinforcement Learning

Standard reinforcement learning methods assume they can identify each state distinctly before making an action decision. In reality, a robot agent only has a limited sensing capability, and identifying each state by extensive sensing can be time consuming. This paper describes an approach that learns active perception strategies in reinforcement learning and considers sensing costs explicitly. The approach learns a task-dependent internal representation and a decision policy simultaneously in a finite, deterministic environment. It not only maximizes the long-term discounted reward per action but also reduces the average sensing cost per state. The initial experimental results in a simulated robot navigation domain are encouraging.

Ming Tan | M. Tan | Ming Tan

[1] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[2] Robert E. Schapire,et al. A new approach to unsupervised learning in deterministic environments , 1990 .

[3] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[4] Ming Tan,et al. Cost-sensitive robot learning , 1991 .

[5] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.

[6] Ming Tan,et al. Two Case Studies in Cost-Sensitive Concept Acquisition , 1990, AAAI.

[7] Long Ji Lin,et al. Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.

[8] Ming Tan,et al. Cost-Sensitive Reinforcement Learning for Adaptive Classification and Control , 1991, AAAI.