Sparse Template-Based 6-D Pose Estimation of Metal Parts Using a Monocular Camera

The six-dimensional (6-D) pose estimation of smooth metal parts is a common and important task in intelligent manufacturing. Computer-aided design (CAD)-based monocular vision methods offer more advantages than those offered by other methods. However, they are subject to several drawbacks such as high complexity, low robustness, and unsatisfactory accuracy, which hinder their application in industry. In this paper, a new approach with corresponding practical algorithms is proposed to solve these problems. The proposed approach uses high-level geometric features and the correlation of straight contours, to represent the part. Moreover, it exploits the matched special location points on the geometric features, which are the endpoints of the straight contours, to accurately estimate the 6-D pose. Practical algorithms based on the modification of the existing line-feature descriptors are proposed to implement the approach. The experimental results revealed that the proposed approach and algorithms can achieve higher accuracy and robustness with fewer templates.

[1]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[3]  Tetsuro Yabuta,et al.  High-accuracy multiviewpoint stereo measurement using the maximum-likelihood method , 1997, IEEE Trans. Ind. Electron..

[4]  C. Steger OCCLUSION , CLUTTER , AND ILLUMINATION INVARIANT OBJECT RECOGNITION , 2002 .

[5]  Manabu Hashimoto,et al.  Fast 6D Pose Estimation from a Monocular Image Using Hierarchical Pose Trees , 2016, ECCV.

[6]  Alessio Del Bue,et al.  Fast 6D pose estimation for texture-less objects from a single RGB image , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[7]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.

[8]  Dima Damen,et al.  Real-time Learning and Detection of 3D Texture-less Objects: A Scalable Approach , 2012, BMVC 2012.

[9]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[10]  De Xu,et al.  Ceiling-Based Visual Positioning for an Indoor Mobile Robot With Monocular Vision , 2009, IEEE Transactions on Industrial Electronics.

[11]  Tinne Tuytelaars,et al.  Is 2D Information Enough For Viewpoint Estimation? , 2014, BMVC.

[12]  Holly E. Rushmeier,et al.  The 3D Model Acquisition Pipeline , 2002, Comput. Graph. Forum.

[13]  Markus Ulrich,et al.  Combining Scale-Space and Similarity-Based Aspect Graphs for Fast 3D Object Recognition , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Haryong Song,et al.  Robust Vision-Based Relative-Localization Approach Using an RGB-Depth Camera and LiDAR Sensor Fusion , 2016, IEEE Transactions on Industrial Electronics.

[15]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[16]  Zhiguo Jiang,et al.  Vision-Based Pose Estimation for Textureless Space Objects by Contour Points Matching , 2018, IEEE Transactions on Aerospace and Electronic Systems.

[17]  Vincent Lepetit,et al.  A Novel Representation of Parts for Accurate 3D Object Detection and Tracking in Monocular Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[18]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[19]  Vincent Lepetit,et al.  Dominant orientation templates for real-time detection of texture-less objects , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  Roberto Cipolla,et al.  PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[21]  Rafael Grompone von Gioi,et al.  LSD: a Line Segment Detector , 2012, Image Process. Line.

[22]  Gérard G. Medioni,et al.  Object modelling by registration of multiple range images , 1992, Image Vis. Comput..

[23]  Philip David,et al.  Object recognition in high clutter images using line features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[24]  Vincent Lepetit,et al.  Gradient Response Maps for Real-Time Detection of Textureless Objects , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Federico Tombari,et al.  BOLD Features to Detect Texture-less Objects , 2013, 2013 IEEE International Conference on Computer Vision.

[26]  Leonidas J. Guibas,et al.  Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[27]  Jae-Bok Song,et al.  Monocular Vision-Based SLAM in Indoor Environment Using Corner, Lamp, and Door Features From Upward-Looking Camera , 2011, IEEE Transactions on Industrial Electronics.

[28]  Sheng-Jen Tsai,et al.  Design and Implementation of a Binocular-Vision System for Locating Footholds of a Multi-Legged Walking Robot , 1985, IEEE Transactions on Industrial Electronics.

[29]  Ping Zhang,et al.  Human–Manipulator Interface Based on Multisensory Process via Kalman Filters , 2014, IEEE Transactions on Industrial Electronics.

[30]  Libing Jiang,et al.  Single-Image 3D Pose Estimation for Texture-Less Object via Symmetric Prior , 2018, IEICE Trans. Inf. Syst..

[31]  Gérard G. Medioni,et al.  Object modeling by registration of multiple range images , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[32]  Antonio Torralba,et al.  FPM: Fine Pose Parts-Based Model with 3D CAD Models , 2014, ECCV.