Learning reliable manipulation strategies without initial physical models

A description is given of a robot, possessing limited sensory and effectory capabilities but no initial model of the effects of its actions on the world, that acquires such a model through exploration, practice, and observation. By acquiring an increasingly correct model of its actions, it generates increasingly successful plans to achieve its goals. In an apparently nondeterministic world, achieving reliability requires the identification of reliable actions and a preference for using such actions. Furthermore, by selecting its training actions carefully, the robot can significantly improve its learning rate.<<ETX>>

[1]  M. Degroot Optimal Statistical Decisions , 1970 .

[2]  Kumpati S. Narendra,et al.  Learning Automata - A Survey , 1974, IEEE Trans. Syst. Man Cybern..

[3]  Hendrik Van Brussel,et al.  A self-learning automaton with variable resolution for high precision assembly by industrial robots , 1982 .

[4]  Jean-Paul Laumond,et al.  Model Structuring and Concept Recognition: Two Aspects of Learning for a Mobile Robot , 1983, IJCAI.

[5]  James L. Crowley Dynamic world modeling for an intelligent mobile robot using a rotating ultra-sonic ranging device , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[6]  W. Thomas Miller,et al.  Sensor-based control of robotic manipulators using a general learning algorithm , 1987, IEEE J. Robotics Autom..

[7]  Russell H. Taylor,et al.  Sensor-based manipulation planning as a game with nature , 1988 .

[8]  Jakub Segen,et al.  Automatic discovery of robotic grasp configurations , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[9]  Matthew T. Mason,et al.  An exploration of sensorless manipulation , 1986, IEEE J. Robotics Autom..

[10]  David J. Reinkensmeyer,et al.  Task-level robot learning , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[11]  Barak A. Pearlmutter,et al.  Using a neural network to learn the dynamics of the CMU Direct-Drive Arm II , 1988 .

[12]  Alan D. Christiansen,et al.  Experiments in Robot Learning , 1989, ML.

[13]  Jeann-Jacques E. Slotine,et al.  Adaptive trajectory control of manipulators , 1989 .

[14]  V. Rich Personal communication , 1989, Nature.

[15]  Christopher G. Atkeson,et al.  Task-level robot learning: juggling a tennis ball more accurately , 1989, Proceedings, 1989 International Conference on Robotics and Automation.