Learning prospective pick and place behavior

When interacting with an object, the possible choices of grasping and manipulation operations are often limited by pick-and-place constraints. Traditional planning methods are analytical in nature and require geometric models of parts, fixtures and motions to identify and avoid the constraints. These methods can easily become computationally expensive and are often brittle under model or sensory uncertainty. In contrast, infants do not construct complete models of the objects that they manipulate, but instead appear to incrementally construct models based on interaction with the objects themselves. We propose that robotic pick-and-place operations can be formulated as prospective behavior and that an intelligent agent can use interaction with the environment to learn strategies which accommodate the constraints based on expected future success. We present experiments demonstrating this technique, and compare the strategies utilized by the agent to the behaviors observed in young children when presented with a similar task.

[1]  Tomás Lozano-Pérez,et al.  Automatic Planning of Manipulator Transfer Movements , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[2]  C. Watkins Learning from delayed rewards , 1989 .

[3]  Tomás Lozano-Pérez,et al.  Planning two-fingered grasps for pick-and-place operations on polyhedra , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[4]  Roderic A. Grupen,et al.  The applications of harmonic functions to robotics , 1993, J. Field Robotics.

[5]  Roderic A. Grupen,et al.  A control basis for multilegged walking , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[6]  R. A. Grupen,et al.  TITLE A Hybrid Discrete Event Dynamic Systems Approach to Robot Control , 1996 .

[7]  J. A. Coelho,et al.  A Control Basis for Learning Multifingered Grasps , 1997 .

[8]  J. A. Coelho,et al.  A control basis for learning multifingered grasps , 1997, J. Field Robotics.

[9]  Justus H. Piater,et al.  Toward learning visual discrimination strategies , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[10]  M. E. McCarty,et al.  Problem solving in infancy: the emergence of an action plan. , 1999, Developmental psychology.

[11]  Justus H. Piater,et al.  Feature learning for recognition with Bayesian networks , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[12]  Roderic A. Grupen,et al.  Multifingered grasping: grasp reflexes and control context , 2001 .

[13]  Justus H. Piater,et al.  Developing haptic and visual perceptual categories for reaching and grasping with a humanoid robot , 2001, Robotics Auton. Syst..

[14]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.