Optimal Model-Based 6-D Object Pose Estimation With Structured-Light Depth Sensors

Structured-light (SL) depth sensors are widely used because of their simplicity in design and ability to process depth data with minimal computational expense. Certain SL light coding methods can, however, lead to a loss of information, as well as inhomogeneous depth errors that depend on the composition and properties of the scene. This results in a reduction of potential accuracy for model-based pose estimation methods that operate on the depth images or subsequently transformed three-dimensional point clouds, such as the popular class of point set registration (PSR) methods. We therefore formulate an asymptotically optimal maximum likelihood estimation (MLE) method that operates directly on the raw SL infrared (IR) images. The proposed SLIR-MLE method maximizes the likelihood of the measured IR image over the pose region given the object model, sensor model, and calibrated speckle and thermal noise distributions. We also formulate a method to compute the Fisher information contained in the IR image and resulting Cramér–Rao bound (CRB) of any unbiased pose estimator for unique SL sensor measurement data. SLIR-MLE is shown to nearly achieve the calculated CRB for the Kinect sensor by operating on the more informative raw IR images. Furthermore, our method is shown to outperform two cutting edge PSR methods by an order of magnitude in the respective mean square errors.

[1]  Peter A. Beling,et al.  Efficacy of statistical model-based pose estimation of rigid objects with corresponding CAD models using commodity depth sensors , 2014, IECON 2014 - 40th Annual Conference of the IEEE Industrial Electronics Society.

[2]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[3]  周炯,et al.  Kind of static method and system for object reconstruction , 2014 .

[4]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[5]  Harry L. Van Trees,et al.  Detection, Estimation, and Modulation Theory, Part I , 1968 .

[6]  Shahram Izadi,et al.  Modeling Kinect Sensor Noise for Improved 3D Reconstruction and Tracking , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[7]  Christian Eitzinger,et al.  Photometric stereo sensor for robot-assisted industrial quality inspection of coated composite material surfaces , 2015, International Conference on Quality Control by Artificial Vision.

[8]  Javier V. Gómez,et al.  Indoor Furniture and Room Recognition for a Robot Using Internet-Derived Models and Object Context , 2012, 2012 10th International Conference on Frontiers of Information Technology.

[9]  K. Harding Industrial metrology: Engineering precision , 2008 .

[10]  Qinfen Zheng,et al.  Model-based target recognition in pulsed ladar imagery , 2001, IEEE Trans. Image Process..

[11]  Achim J. Lilienthal,et al.  Application based 3D sensor evaluation: A case study in 3D object pose estimation for automated unloading of containers , 2013, 2013 European Conference on Mobile Robots.

[12]  Ling Shao,et al.  Enhanced Computer Vision With Microsoft Kinect Sensor: A Review , 2013, IEEE Transactions on Cybernetics.

[13]  마크 올레이닉 Methods and systems for food preparation in a robotic cooking kitchen , 2015 .

[14]  Shuzhi Sam Ge,et al.  Constrained Multilegged Robot System Modeling and Fuzzy Control With Uncertain Kinematics and Dynamics Incorporating Foot Force Optimization , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[15]  R. M. Fewster,et al.  Information on parameters of interest decreases under transformations , 2013, J. Multivar. Anal..

[16]  Markus Vincze,et al.  3DNet: Large-scale object class recognition from CAD models , 2012, 2012 IEEE International Conference on Robotics and Automation.

[17]  Rick Quax,et al.  Nonparametric estimation of Fisher information from real data. , 2015, Physical review. E.

[18]  In-So Kweon,et al.  Exploiting Shading Cues in Kinect IR Images for Geometry Refinement , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Joseph A. O'Sullivan,et al.  Quantitative statistical assessment of conditional models for synthetic aperture radar , 2004, IEEE Transactions on Image Processing.

[20]  Margrit Betke,et al.  Necessary conditions to attain performance bounds on structure and motion estimates of rigid objects , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[21]  Venu Madhav Govindu,et al.  On Averaging Multiview Relations for 3D Scan Registration , 2014, IEEE Transactions on Image Processing.

[22]  J. Spall Monte Carlo Computation of the Fisher Information Matrix in Nonstandard Settings , 2005 .

[23]  Peter A. Beling,et al.  Simulating Kinect Infrared and Depth Images , 2016, IEEE Transactions on Cybernetics.

[24]  Autocorrelation analysis of spectral dependency of surface roughness speckle patterns , 2009, 2009 International Symposium on Optomechatronic Technologies.

[25]  Paul G. Maropoulos,et al.  Recent developments in large-scale dimensional metrology , 2009 .

[26]  Andreas Kolb,et al.  Kinect range sensing: Structured-light versus Time-of-Flight Kinect , 2015, Comput. Vis. Image Underst..

[27]  Chaoli Wang,et al.  Information Theory in Scientific Visualization , 2011, Entropy.

[28]  Guido M. Cortelazzo,et al.  Time-of-Flight Cameras and Microsoft Kinect™ , 2012, Springer Briefs in Electrical and Computer Engineering.

[29]  Derek D. Lichti,et al.  A method for automated registration of unorganised point clouds , 2008 .

[30]  Silvio Savarese,et al.  Automatic Targetless Extrinsic Calibration of a 3D Lidar and Camera by Maximizing Mutual Information , 2012, AAAI.

[31]  Christian Riess,et al.  A Comparative Error Analysis of Current Time-of-Flight Sensors , 2016, IEEE Transactions on Computational Imaging.

[32]  Margrit Betke,et al.  Information-conserving object recognition , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[33]  S. Kay Fundamentals of statistical signal processing: estimation theory , 1993 .

[34]  Jianxiong Xiao,et al.  Sliding Shapes for 3D Object Detection in Depth Images , 2014, ECCV.

[35]  Aamir Saeed Malik,et al.  Noise Robustness Analysis of Point Cloud Descriptors , 2013, ACIVS.

[36]  Josef A. Nossek,et al.  A Lower Bound for the Fisher Information Measure , 2013, IEEE Signal Processing Letters.

[37]  Vahid Montazerhodjat Photon-limited time of flight depth acquisition : new parametric model and its analysis , 2013 .

[38]  Peter A. Beling,et al.  Statistical Analysis-Based Error Models for the Microsoft Kinect™ Depth Sensor , 2014, Sensors.

[39]  Gary R. Bradski,et al.  Recognition and Pose Estimation of Rigid Transparent Objects with a Kinect Sensor , 2012, Robotics: Science and Systems.

[40]  Alan L. Yuille,et al.  Non-Rigid Point Set Registration by Preserving Global and Local Structures , 2016, IEEE Transactions on Image Processing.

[41]  Andriy Myronenko,et al.  Point Set Registration: Coherent Point Drift , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.