Pinball: Planning and Learning in a Dynamic Real-Time Environment
暂无分享,去创建一个
[1] Hendrik Van Brussel,et al. A self-learning automaton with variable resolution for high precision assembly by industrial robots , 1982 .
[2] Jonas Karlsson,et al. Learning Multiple Goal Behavior via Task Decomposition and Dynamic Policy Merging , 1993 .
[3] Chris Watkins,et al. Learning from delayed rewards , 1989 .
[4] Alan D. Christiansen,et al. Learning reliable manipulation strategies without initial physical models , 1990, Proceedings., IEEE International Conference on Robotics and Automation.
[5] R. Sutton,et al. Connectionist Learning for Control: An Overview , 1989 .
[6] Tom M. Mitchell,et al. Generalization as Search , 2002 .
[7] David W. Aha,et al. Noise-Tolerant Instance-Based Learning Algorithms , 1989, IJCAI.
[8] Sridhar Mahadevan,et al. Rapid Task Learning for Real Robots , 1993 .
[9] Peter Cheeseman,et al. On the Representation and Estimation of Spatial Uncertainty , 1986 .