Kit-Net: Self-Supervised Learning to Kit Novel 3D Objects into Novel 3D Cavities

In industrial part kitting, 3D objects are inserted into cavities for transportation or subsequent assembly. Kitting is a critical step as it can decrease downstream processing and handling times and enable lower storage and shipping costs. We present Kit-Net, a framework for kitting previously unseen 3D objects into cavities given depth images of both the target cavity and an object held by a gripper in an unknown initial orientation. Kit-Net uses self-supervised deep learning and data-augmentation to train a convolutional neural network (CNN) to robustly estimate 3D rotations between objects and matching concave or convex cavities using a large training dataset of simulated depth images pairs. Kit-Net then uses the trained CNN to implement a controller to orient and position novel objects for insertion into novel prismatic and conformal 3D cavities. Experiments in simulation suggest that Kit-Net can orient objects to have a 98.9% average intersection volume between the object mesh and that of the target cavity. Physical experiments with industrial objects succeed in 18 % of trials using a baseline method and in 63% of trials with Kit-Net. Video, code, and data are available at https://github.com/BerkeleyAutomation/Kit-Net.

[1]  Dieter Fox,et al.  PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes , 2017, Robotics: Science and Systems.

[2]  Kenneth Salisbury,et al.  Reorienting Objects with a Robot Hand Using Grasp Gaits , 1996 .

[3]  Ashwin Balakrishna,et al.  Orienting Novel 3D Objects Using Self-Supervised Learning of Rotation Transforms , 2020, 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE).

[4]  Kenneth Y. Goldberg,et al.  Orienting polygonal parts without sensors , 1993, Algorithmica.

[5]  Xinyu Liu,et al.  Dex-Net 3.0: Computing Robust Robot Suction Grasp Targets in Point Clouds using a New Analytic Model and Deep Learning , 2017, ArXiv.

[6]  Ken Goldberg,et al.  X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  Jonathan Tompson,et al.  Discovery of Latent 3D Keypoints via End-to-end Geometric Reasoning , 2018, NeurIPS.

[8]  Peter R. Florence,et al.  Transporter Networks: Rearranging the Visual World for Robotic Manipulation , 2020, CoRL.

[9]  Andy Zeng,et al.  Form2Fit: Learning Shape Priors for Generalizable Assembly from Disassembly , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[10]  Kostas E. Bekris,et al.  se(3)-TrackNet: Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic Domains , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  Tatsuya Nagatani,et al.  Robotic General Parts Feeder: Bin-picking, Regrasping, and Kitting , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Timothy Bretl,et al.  PoseRBPF: A Rao–Blackwellized Particle Filter for 6-D Object Pose Tracking , 2019, IEEE Transactions on Robotics.

[13]  Jeannette Bohg,et al.  Interpreting Contact Interactions to Overcome Failure in Robot Assembly Tasks , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Matthew T. Mason,et al.  Parts orienting with shape uncertainty , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[15]  Otmar Hilliges,et al.  Learning to Assemble: Estimating 6D Poses for Robotic Object-Object Manipulation , 2020, IEEE Robotics and Automation Letters.

[16]  Armin Biess,et al.  Learning Pose Estimation for High-Precision Robotic Assembly Using Simulated Depth Images , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[17]  Dieter Fox,et al.  6-DOF Grasping for Target-driven Object Manipulation in Clutter , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[18]  Siddhartha S. Srinivasa,et al.  Extrinsic dexterity: In-hand manipulation with external forces , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[19]  Jeremy A. Marvel,et al.  Enabling Fixtureless Assemblies in Human-Robot Collaborative Workcells by Reducing Uncertainty in the Part Pose Estimate , 2018, 2018 IEEE 14th International Conference on Automation Science and Engineering (CASE).

[20]  Esa Rahtu,et al.  Relative Camera Pose Estimation Using Convolutional Neural Networks , 2017, ACIVS.

[21]  Ashwin Balakrishna,et al.  Exploratory Grasping: Asymptotically Optimal Algorithms for Grasping Challenging Polyhedral Objects , 2020, CoRL.

[22]  Vladlen Koltun,et al.  Deep Global Registration , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Ken Goldberg,et al.  Learning ambidextrous robot grasping policies , 2019, Science Robotics.

[24]  Xinyu Liu,et al.  Dex-Net 2.0: Deep Learning to Plan Robust Grasps with Synthetic Point Clouds and Analytic Grasp Metrics , 2017, Robotics: Science and Systems.

[25]  Kenneth Y. Goldberg,et al.  Geometric Eccentricity and the Complexity of Manipulation Plans , 2000, Algorithmica.