Viewpoint optimization for aiding grasp synthesis algorithms using reinforcement learning

ABSTRACT Grasp synthesis for unknown objects is a challenging problem as the algorithms are expected to cope with missing object shape information. This missing information is a function of the vision sensor viewpoint. The majority of the grasp synthesis algorithms in literature synthesize a grasp by using one single image of the target object and making assumptions on the missing shape information. On the contrary, this paper proposes the use of robot's depth sensor actively: we propose an active vision methodology that optimizes the viewpoint of the sensor for increasing the quality of the synthesized grasp over time. By this way, we aim to relax the assumptions on the sensor's viewpoint and boost the success rates of the grasp synthesis algorithms. A reinforcement learning technique is employed to obtain a viewpoint optimization policy, and a training process and automated training data generation procedure are presented. The methodology is applied to a simple force-moment balance-based grasp synthesis algorithm, and a thousand simulations with five objects are conducted with random initial poses in which the grasp synthesis algorithm was not able to obtain a good grasp with the initial viewpoint. In 94% of these cases, the policy achieved to find a successful grasp. GRAPHICAL ABSTRACT

[1]  H.G. Cai,et al.  Grasping unknown objects based on 3d model reconstruction , 2005, Proceedings, 2005 IEEE/ASME International Conference on Advanced Intelligent Mechatronics..

[2]  Subhashis Banerjee,et al.  Active recognition through next view planning: a survey , 2004, Pattern Recognit..

[3]  Lucas Paletta,et al.  Active object recognition by view integration and reinforcement learning , 2000, Robotics Auton. Syst..

[4]  Dieter Fox,et al.  Autonomous generation of complete 3D object models using next best view manipulation planning , 2011, 2011 IEEE International Conference on Robotics and Automation.

[5]  Stefano Caselli,et al.  Perception and Grasping of Object Parts from Active Robot Exploration , 2014, J. Intell. Robotic Syst..

[6]  Markus Vincze,et al.  Empty the basket - a shape based learning approach for grasping piles of unknown objects , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Kensuke Harada,et al.  Experiments on Learning Based Industrial Bin-picking with Iterative Visual Recognition , 2018, Ind. Robot.

[8]  Guido C. H. E. de Croon,et al.  Comparing active vision models , 2009, Image Vis. Comput..

[9]  Michael Suppa,et al.  Efficient next-best-scan planning for autonomous 3D surface reconstruction of unknown objects , 2013, Journal of Real-Time Image Processing.

[10]  Lucas Paletta,et al.  Appearance-based active object recognition , 2000, Image Vis. Comput..

[11]  Leslie Pack Kaelbling,et al.  Integrated task and motion planning in belief space , 2013, Int. J. Robotics Res..

[12]  Jun Li,et al.  Active Recognition and Manipulation for Mobile Robot Bin Picking , 2014, Technology Transfer Experiments from the ECHORD Project.

[13]  Julien Marzat,et al.  Learning Viewpoint Planning in Active Recognition on a Small Sampling Budget: A Kriging Approach , 2010, 2010 Ninth International Conference on Machine Learning and Applications.

[14]  Quoc V. Le,et al.  Learning to grasp objects with multiple contact points , 2010, 2010 IEEE International Conference on Robotics and Automation.

[15]  Ashutosh Saxena,et al.  Robotic Grasping of Novel Objects using Vision , 2008, Int. J. Robotics Res..

[16]  Gearing Up and Accelerating Cross-fertilization between Academic and Industrial Robotics Research in Europe: - Technology Transfer Experiments from the ECHORD Project , 2014, Springer Tracts in Advanced Robotics.

[17]  Marek Sewer Kopicki,et al.  Active vision for dexterous grasping of novel objects , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[18]  Pieter Abbeel,et al.  Active exploration using trajectory optimization for robotic grasping in the presence of occlusions , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[19]  Yuichi Motai,et al.  Hand–Eye Calibration Applied to Viewpoint Selection for Robotic Vision , 2008, IEEE Transactions on Industrial Electronics.

[20]  Jing Xiao,et al.  Efficient and effective grasping of novel objects through learning and adapting a knowledge base , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Kimitoshi Yamazaki,et al.  A grasp planning for picking up an unknown object for a mobile manipulator , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[22]  Jürgen Beyerer,et al.  Bayesian active object recognition via Gaussian process regression , 2012, 2012 15th International Conference on Information Fusion.

[23]  Maxim Likhachev,et al.  Planning for grasp selection of partially occluded objects , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Joachim Denzler,et al.  Information Theoretic Sensor Data Selection for Active Object Recognition and State Estimation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Laura Fernández Gallardo,et al.  Detection of parametrized 3-D primitives from stereo for robotic grasping , 2011, 2011 15th International Conference on Advanced Robotics (ICAR).

[26]  Shengyong Chen,et al.  Active vision in robotic systems: A survey of recent developments , 2011, Int. J. Robotics Res..

[27]  脇元 修一,et al.  IEEE International Conference on Robotics and Automation (ICRA) におけるフルードパワー技術の研究動向 , 2011 .

[28]  Claire Dune,et al.  Active rough shape estimation of unknown objects , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[29]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[30]  Ashutosh Saxena,et al.  Learning to Grasp Novel Objects Using Vision , 2006, ISER.

[31]  Dorin Comaniciu,et al.  Conditional feature sensitivity: a unifying view on active recognition and feature selection , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[32]  Gary M. Bone,et al.  Automated modeling and robotic grasping of unknown three-dimensional objects , 2008, 2008 IEEE International Conference on Robotics and Automation.

[33]  Danica Kragic,et al.  Mind the gap - robotic grasping under incomplete observation , 2011, 2011 IEEE International Conference on Robotics and Automation.

[34]  Tal Arbel,et al.  A fast discriminant approach to active object recognition and pose estimation , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[35]  Joachim Denzler,et al.  A Framework for Actively Selecting Viewpoints in Object Recognition , 2009, Int. J. Pattern Recognit. Artif. Intell..

[36]  Oussama Khatib,et al.  Grasping with application to an autonomous checkout robot , 2011, 2011 IEEE International Conference on Robotics and Automation.

[37]  Katsunari Shibata,et al.  Active perception and recognition learning system based on Actor-Q architecture , 2002, Systems and Computers in Japan.

[38]  Lawson L. S. Wong,et al.  Learning Grasp Strategies with Partial Shape Information , 2008, AAAI.

[39]  Ashutosh Saxena,et al.  Efficient grasping from RGBD images: Learning using a new rectangle representation , 2011, 2011 IEEE International Conference on Robotics and Automation.