Learning reliable manipulation strategies without initial physical models

Abstract Christiansen, A.D., Mason, M.T. and Mitchell, T.M., Learning reliable manipulation strategies without initial physical models, Robitics and Autonomous Systems, 8 (1991) 7–18. We describe a robot, possessing limited sensory and effectory capabilties but no initial model of the effects of its actions on the world, that acquires such a model through exploration, practice, and observation. By acquiring an increasingly correct model of its actions, it generates increasingly successful plans to achieve its goals. In an apparently non-deterministic world, achieving reliability requires the identification of reliable actions and a preference for using such actions. Furthermore, by selecting its training carefully, the robot can significantly improve its learning rate.

[1]  W. Thomas Miller,et al.  Sensor-based control of robotic manipulators using a general learning algorithm , 1987, IEEE J. Robotics Autom..

[2]  Matthew T. Mason,et al.  An exploration of sensorless manipulation , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[3]  Barak A. Pearlmutter,et al.  Using a neural network to learn the dynamics of the CMU Direct-Drive Arm II , 1988 .

[4]  M. Degroot Optimal Statistical Decisions , 1970 .

[5]  Matthew T. Mason,et al.  An exploration of sensorless manipulation , 1986, IEEE J. Robotics Autom..

[6]  Russell H. Taylor,et al.  Sensor-based manipulation planning as a game with nature , 1988 .

[7]  James L. Crowley Dynamic world modeling for an intelligent mobile robot using a rotating ultra-sonic ranging device , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[8]  David J. Reinkensmeyer,et al.  Task-level robot learning , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[9]  Leslie Pack Kaelbling,et al.  A Formal Framework for Learning in Embedded Systems , 1989, ML.

[10]  Alan D. Christiansen,et al.  Experiments in Robot Learning , 1989, ML.

[11]  Jeann-Jacques E. Slotine,et al.  Adaptive trajectory control of manipulators , 1989 .

[12]  Jakub Segen,et al.  Automatic discovery of robotic grasp configurations , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[13]  Christopher G. Atkeson,et al.  Task-level robot learning: juggling a tennis ball more accurately , 1989, Proceedings, 1989 International Conference on Robotics and Automation.

[14]  Jean-Paul Laumond,et al.  Model Structuring and Concept Recognition: Two Aspects of Learning for a Mobile Robot , 1983, IJCAI.