Active Learning for Vision-Based Robot Grasping

Reliable vision-based grasping has proved elusive outside of controlled environments. One approach towards building more flexible and domain-independent robot grasping systems is to employ learning to adapt the robot's perceptual and motor system to the task. However, one pitfall in robot perceptual and motor learning is that the cost of gathering the learning set may be unacceptably high. Active learning algorithms address this shortcoming by intelligently selecting actions so as to decrease the number of examples necessary to achieve good performance and also avoid separate training and execution phases, leading to higher autonomy. We describe the IE-ID3 algorithm, which extends the Interval Estimation (IE) active learning approach from discrete to real-valued learning domains by combining IE with a classification tree learning algorithm (ID-3). We present a robot system which rapidly learns to select the grasp approach directions using IE-ID3 given simplified superquadric shape approximations of objects. Initial results on a small set of objects show that a robot with a laser scanner system can rapidly learn to pick up new objects, and simulation studies show the superiority of the active learning approach for a simulated grasping task using larger sets of objects. Extensions of the approach and future areas of research incorporating more sophisticated perceptual and action representation are discussed

[1]  G. Schlesinger Der mechanische Aufbau der künstlichen Glieder , 1919 .

[2]  J. Napier The prehensile movements of the human hand. , 1956, The Journal of bone and joint surgery. British volume.

[3]  George E. P. Box,et al.  Evolutionary Operation: A Statistical Method for Process Improvement , 1969 .

[4]  J. Ross Quinlan,et al.  Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[5]  D. Rumelhart Learning internal representations by back-propagating errors , 1986 .

[6]  Jakub Segen,et al.  Automatic discovery of robotic grasp configurations , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[7]  Joe Jackson,et al.  Knowledge-based prehension: capturing human dexterity , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[8]  David J. Reinkensmeyer,et al.  Task-level robot learning , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[9]  David A. Cohn,et al.  Training Connectionist Networks with Queries and Selective Sampling , 1989, NIPS.

[10]  Mark R. Cutkosky,et al.  On grasp choice, grasp models, and the design of hands for manufacturing tasks , 1989, IEEE Trans. Robotics Autom..

[11]  Ming Tan,et al.  CSL: a cost-sensitive learning system for sensing and grasping objects , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[12]  S. Lehman The Neural and Behavioural Organization of Goal‐Directed Movements , 1990, Neurology.

[13]  Sharon A. Stansfield,et al.  Knowledge-based robotic grasping , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[14]  Ruzena Bajcsy,et al.  Recovery of Parametric Models from Range Images: The Case for Superquadrics with Global Deformations , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Richard S. Sutton,et al.  Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[16]  Imin Kao,et al.  Grasping, manipulation, and control with tactile sensing , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[17]  Andrew W. Moore,et al.  Acquisition of Dynamic Control Knowledge for a Robotic Manipulator , 1990, ML.

[18]  Ruzena Bajcsy,et al.  Segmentation via manipulation , 1991, IEEE Trans. Robotics Autom..

[19]  Gianni Vercelli,et al.  Shape analysis and hand preshaping for grasping , 1991, Proceedings IROS '91:IEEE/RSJ International Workshop on Intelligent Robots and Systems '91.

[20]  Andrew W. Moore,et al.  Fast, Robust Adaptive Control by Learning only Forward Models , 1991, NIPS.

[21]  Sebastian Thrun,et al.  Active Exploration in Dynamic Environments , 1991, NIPS.

[22]  Sebastian Thrun,et al.  The role of exploration in learning control , 1992 .

[23]  Ruzena Bajcsy,et al.  Robotic sensorimotor learning in continuous domains , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[24]  Sharon Stansfield Connectionist and neural net implementations of a robotic grasp generator , 1992, Defense, Security, and Sensing.

[25]  Marcos Salganicoff Learning and forgetting for perception-action: a projection pursuit and density adaptive approach , 1992 .

[26]  Leslie Pack Kaelbling,et al.  Learning in embedded systems , 1993 .

[27]  Marcos Salganicoff,et al.  Density-Adaptive Learning and Forgetting , 1993, ICML.

[28]  Katsushi Ikeuchi,et al.  Toward automatic robot instruction from perception-recognizing a grasp from observation , 1993, IEEE Trans. Robotics Autom..

[29]  Dana H. Ballard,et al.  Recognizing teleoperated manipulations , 1993, [1993] Proceedings IEEE International Conference on Robotics and Automation.

[30]  Simon Kasif,et al.  OC1: A Randomized Induction of Oblique Decision Trees , 1993, AAAI.

[31]  Andrew Blake,et al.  Visually guided grasping in 3D , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[32]  Roderic A. Grupen,et al.  Learning admittance mappings for force-guided assembly , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[33]  Christian Laugier,et al.  Achieving Dextrous Grasping by Integrating Planning and Vision-Based Sensing , 1995, Int. J. Robotics Res..

[34]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[35]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[36]  Marcos Salganicoff,et al.  Active Exploration and Learning in real-Valued Spaces using Multi-Armed Bandit Allocation Indices , 1995, ICML.

[37]  Shimon Edelman,et al.  Learning to grasp using visual information , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[38]  Ming Tan,et al.  Cost-sensitive learning of classification knowledge and its applications in robotics , 2004, Machine Learning.

[39]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.