Visual-Guided Robot Arm Using Self-Supervised Deep Convolutional Neural Networks

Perception-based learning approaches to robotic grasping have shown significant promise. This is further reinforced by using supervised deep learning in robotic arm. However, to properly train deep networks and prevent overfitting, massive datasets of labelled samples must be available. Creating such datasets by human labelling is an exhaustive task since most objects can be grasped at multiple points and in several orientations. Accordingly, this work employs a self-supervised learning technique in which the training dataset is labelled by the robot itself. Above all, we propose a cascaded network that reduces the time of the grasping task by eliminating ungraspable samples from the inference process. In addition to grasping task which performs pose estimation, we enlarge the network to perform an auxiliary task, object classification in which data labelling can be done easily by human. Notably, our network is capable of estimating 18 grasping poses and classifying 4 objects simultaneously. The experimental results show that the proposed network achieves an accuracy of 94.8% in estimating the grasping pose and 100% in classifying the object category, in 0.65 seconds.

[1]  Gary M. Bone,et al.  Automated modeling and robotic grasping of unknown three-dimensional objects , 2008, 2008 IEEE International Conference on Robotics and Automation.

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  Douglas Chai,et al.  Review of Deep Learning Methods in Robotic Grasp Detection , 2018, Multimodal Technol. Interact..

[4]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[5]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[6]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[8]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Honglak Lee,et al.  Deep learning for detecting robotic grasps , 2013, Int. J. Robotics Res..

[10]  Manabu Hashimoto,et al.  Model-less Estimation Method for Robot Grasping Parameters Using 3D Shape Primitive Approximation , 2018, 2018 IEEE 14th International Conference on Automation Science and Engineering (CASE).

[11]  Christopher Kanan,et al.  Robotic grasp detection using deep convolutional neural networks , 2016, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.

[13]  Lawson L. S. Wong,et al.  Learning Grasp Strategies with Partial Shape Information , 2008, AAAI.

[14]  Ashutosh Saxena,et al.  Efficient grasping from RGBD images: Learning using a new rectangle representation , 2011, 2011 IEEE International Conference on Robotics and Automation.

[15]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Sukhan Lee,et al.  A robot-camera hand/eye self-calibration system using a planar target , 2013, IEEE ISR 2013.

[17]  Abhinav Gupta,et al.  Supersizing self-supervision: Learning to grasp from 50K tries and 700 robot hours , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[18]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[19]  Joseph Redmon,et al.  Real-time grasp detection using convolutional neural networks , 2014, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[20]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).