Improved object pose estimation via deep pre-touch sensing

For certain manipulation tasks, object pose estimation from head-mounted cameras may not be sufficiently accurate. This is at least in part due to our inability to perfectly calibrate the coordinate frames of today's high degree of freedom robot arms that link the head to the end-effectors. We present a novel framework combining pre-touch sensing and deep learning to more accurately estimate pose in an efficient manner. The use of pre-touch sensing allows our method to localize the object directly with respect to the robot's end effector, thereby avoiding error caused by miscalibration of the arms. Instead of requiring the robot to scan the entire object with its pre-touch sensor, we use a deep neural network to detect object regions that contain distinctive geometric features. By focusing pre-touch sensing on these regions, the robot can more efficiently gather the information necessary to adjust its original pose estimate. Our region detection network was trained using a new dataset containing objects of widely varying geometries and has been labeled in a scalable fashion that is free from human bias. This dataset is applicable to any task that involves a pre-touch sensor gathering geometric information, and has been made publicly available. We evaluate our framework by having the robot re-estimate the pose of a number of objects of varying geometries. Compared to two simpler region proposal methods, we find that our deep neural network performs significantly better. In addition, we find that after a sequence of scans, objects can typically be localized to within 0.5 cm of their true position. We also observe that the original pose estimate can often be significantly improved after collecting a single quick scan.

[1]  Joshua R. Smith,et al.  Pre-touch sensing for sequential manipulation , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Shigeki Sugano,et al.  Tactile object recognition using deep learning and dropout , 2014, 2014 IEEE-RAS International Conference on Humanoid Robots.

[3]  Oussama Khatib,et al.  Virtual whiskers — Highly responsive robot collision avoidance , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Kevin Huang,et al.  Sensor-aided teleoperated grasping of transparent objects , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[5]  Honglak Lee,et al.  Deep learning for detecting robotic grasps , 2013, Int. J. Robotics Res..

[6]  Michael Beetz,et al.  Improving robot manipulation through fingertip perception , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Hubert Zangl,et al.  Responsive fingers — capacitive sensing during object manipulation , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[8]  Ashutosh Saxena,et al.  Reactive grasping using optical proximity sensors , 2009, 2009 IEEE International Conference on Robotics and Automation.

[9]  John Kenneth Salisbury,et al.  Using Near-Field Stereo Vision for Robotic Grasping in Cluttered Environments , 2010, ISER.

[10]  Ran Liu,et al.  Outlier Robust ICP for Minimizing Fractional RMSD , 2006, Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007).

[11]  Di Guo,et al.  Transmissive optical pretouch sensing for robotic grasping , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12]  Sven Behnke,et al.  Registration with the Point Cloud Library: A Modular Framework for Aligning in 3-D , 2015, IEEE Robotics & Automation Magazine.

[13]  Ashutosh Saxena,et al.  Efficient grasping from RGBD images: Learning using a new rectangle representation , 2011, 2011 IEEE International Conference on Robotics and Automation.

[14]  Joshua R. Smith,et al.  Seashell effect pretouch sensing for robotic grasping , 2012, 2012 IEEE International Conference on Robotics and Automation.

[15]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[16]  Sergey Levine,et al.  Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[17]  Joseph Redmon,et al.  Real-time grasp detection using convolutional neural networks , 2014, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[18]  Sergey Levine,et al.  Deep spatial autoencoders for visuomotor learning , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[19]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[21]  Abhinav Gupta,et al.  Supersizing self-supervision: Learning to grasp from 50K tries and 700 robot hours , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[22]  Joshua R. Smith,et al.  Electric field imaging pretouch for robotic graspers , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23]  Joshua R. Smith,et al.  Electric Field Servoing for robotic manipulation , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[24]  Yi Li,et al.  Robot Learning Manipulation Action Plans by "Watching" Unconstrained Videos from the World Wide Web , 2015, AAAI.

[25]  Pieter Abbeel,et al.  Active exploration using trajectory optimization for robotic grasping in the presence of occlusions , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[26]  Oussama Khatib,et al.  Global Localization of Objects via Touch , 2011, IEEE Transactions on Robotics.

[27]  Wolfram Burgard,et al.  Point feature extraction on 3D range scans taking into account object boundaries , 2011, 2011 IEEE International Conference on Robotics and Automation.

[28]  Kurt Konolige,et al.  Calibrating a Multi-arm Multi-sensor Robot: A Bundle Adjustment Approach , 2010, ISER.