Learning cooperative assembly with the graph representation of a state-action space

In this paper, we present a method for two robot manipulators to learn cooperative assembly tasks. A learning algorithm based on trial end error is used to find a sequence for each robot to assemble the goal aggregate. It is shown that a distributed learning method based on a Markov decision process is able to learn the sequences for the involved robots. A novel state-action graph is used to store the reinforcement values of the learning process. The approach is designed in a way that not only exact matches but also similar aggregates are accepted by the system.

[1]  Jianwei Zhang,et al.  Learning cooperative grasping with the graph representation of a state-action space , 2002, Robotics Auton. Syst..

[2]  Ron Sun,et al.  Partitioning in reinforcement learning , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[3]  Steven J. Bradtke,et al.  Reinforcement Learning Applied to Linear Quadratic Regulation , 1992, NIPS.

[4]  Arthur C. Sanderson,et al.  Task sequence planning for robotic assembly , 1989 .

[5]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[6]  Aristides A. G. Requicha,et al.  Representations for assemblies , 1991 .

[7]  Andrew W. Moore,et al.  Variable Resolution Dynamic Programming , 1991, ML Workshop.

[8]  Kuo-Chung Tai,et al.  The Tree-to-Tree Correction Problem , 1979, JACM.

[9]  Craig Boutilier,et al.  Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..

[10]  Franz Kummert,et al.  Learning assembly sequence plans using functional models , 1999, Proceedings of the 1999 IEEE International Symposium on Assembly and Task Planning (ISATP'99) (Cat. No.99TH8470).

[11]  Lusheng Wang,et al.  Alignment of trees: an alternative to tree edit , 1995 .

[12]  Michael I. Jordan,et al.  Reinforcement Learning with Soft State Aggregation , 1994, NIPS.

[13]  Samuel Pierre,et al.  An artificial intelligence approach for generating assembly sequences in CAD/CAM , 1996, Artif. Intell. Eng..