论文信息 - Learning cooperative assembly with the graph representation of a state-action space

Learning cooperative assembly with the graph representation of a state-action space

In this paper, we present a method for two robot manipulators to learn cooperative assembly tasks. A learning algorithm based on trial end error is used to find a sequence for each robot to assemble the goal aggregate. It is shown that a distributed learning method based on a Markov decision process is able to learn the sequences for the involved robots. A novel state-action graph is used to store the reinforcement values of the learning process. The approach is designed in a way that not only exact matches but also similar aggregates are accepted by the system.

Jianwei Zhang | Markus Ferch | Matthias Höchsmann

[1] Jianwei Zhang,et al. Learning cooperative grasping with the graph representation of a state-action space , 2002, Robotics Auton. Syst..

[2] Ron Sun,et al. Partitioning in reinforcement learning , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[3] Steven J. Bradtke,et al. Reinforcement Learning Applied to Linear Quadratic Regulation , 1992, NIPS.

[4] Arthur C. Sanderson,et al. Task sequence planning for robotic assembly , 1989 .

[5] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[6] Aristides A. G. Requicha,et al. Representations for assemblies , 1991 .

[7] Andrew W. Moore,et al. Variable Resolution Dynamic Programming , 1991, ML Workshop.

[8] Kuo-Chung Tai,et al. The Tree-to-Tree Correction Problem , 1979, JACM.

[9] Craig Boutilier,et al. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..

[10] Franz Kummert,et al. Learning assembly sequence plans using functional models , 1999, Proceedings of the 1999 IEEE International Symposium on Assembly and Task Planning (ISATP'99) (Cat. No.99TH8470).

[11] Lusheng Wang,et al. Alignment of trees: an alternative to tree edit , 1995 .

[12] Michael I. Jordan,et al. Reinforcement Learning with Soft State Aggregation , 1994, NIPS.

[13] Samuel Pierre,et al. An artificial intelligence approach for generating assembly sequences in CAD/CAM , 1996, Artif. Intell. Eng..