Start from minimum labeling: Learning of 3D object models and point labeling from a large and complex environment

A large category model base can provide object-level knowledge for various perception tasks of the intelligent vehicle system. The automatic and efficient construction of such a model base is highly desirable but challenging. This paper presents a novel semi-supervised approach to discover possible prototype models of 3D object structures from the point cloud of a large and complex environment, given a limited number of seeds in an object category. Our method incrementally trains the models while simultaneously collecting object samples. Considering the bias problem of model learning caused by bias accumulation in a sample collection, we propose to gradually differentiate the standard category model into several sub-category models to represent different intra-category structural styles. Thus, new sub-categories are discovered and modeled, old models are improved, and redundant models for similar structures are deleted iteratively during the learning process. This multiple-model strategy provides several interactive options for the category boundary to deal with the bias problem. Experimental results demonstrate the effectiveness and high efficiency of our approach to model mining from “big point cloud data”.

[1]  Ryosuke Shibasaki,et al.  SLAM in a dynamic large outdoor environment using a laser scanner , 2008, 2008 IEEE International Conference on Robotics and Automation.

[2]  Paul Newman,et al.  Parsing Outdoor Scenes from Streamed 3D Laser Data Using Online Clustering and Incremental Belief Updates , 2012, AAAI.

[3]  Martial Hebert,et al.  Contextual classification with functional Max-Margin Markov Networks , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Markus Vincze,et al.  A Global Hypotheses Verification Method for 3D Object Recognition , 2012, ECCV.

[5]  Wolfram Burgard,et al.  Unsupervised learning of 3D object models from partial views , 2009, 2009 IEEE International Conference on Robotics and Automation.

[6]  Dieter Fox,et al.  A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[7]  Hongbin Zha,et al.  Scene understanding in a large dynamic environment through a laser-based sensing , 2010, 2010 IEEE International Conference on Robotics and Automation.

[8]  Martial Hebert,et al.  Onboard contextual classification of 3-D point clouds with learned high-order Markov Random Fields , 2009, 2009 IEEE International Conference on Robotics and Automation.

[9]  Bernt Schiele,et al.  3D Object Detection with Multiple Kinects , 2012, ECCV Workshops.

[10]  Xuan Song,et al.  Category Modeling from Just a Single Labeling: Use Depth Information to Guide the Learning of 2D Models , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Ben Taskar,et al.  Discriminative learning of Markov random fields for segmentation of 3D scan data , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Thorsten Joachims,et al.  Contextually guided semantic labeling and search for three-dimensional point clouds , 2013, Int. J. Robotics Res..

[13]  Dieter Fox,et al.  Toward object discovery and modeling via 3-D scene comparison , 2011, 2011 IEEE International Conference on Robotics and Automation.

[14]  Bin Yu,et al.  Model Selection and the Principle of Minimum Description Length , 2001 .

[15]  Dieter Fox,et al.  RGB-D object discovery via multi-scene analysis , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  Martin Buss,et al.  Realtime segmentation of range data using continuous nearest neighbors , 2009, 2009 IEEE International Conference on Robotics and Automation.

[17]  Martial Hebert,et al.  Natural terrain classification using three‐dimensional ladar data for ground robot mobility , 2006, J. Field Robotics.

[18]  Dieter Fox,et al.  Detection-based object labeling in 3D scenes , 2012, 2012 IEEE International Conference on Robotics and Automation.

[19]  Fei-Fei Li,et al.  OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Dieter Fox,et al.  RGB-(D) scene labeling: Features and algorithms , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Bernt Schiele,et al.  Monocular 3D Scene Modeling and Inference: Understanding Multi-Object Traffic Scenes , 2010, ECCV.

[22]  Justus H. Piater,et al.  A Probabilistic Framework for 3D Visual Object Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Siddhartha S. Srinivasa,et al.  Structure discovery in multi-modal data: A region-based approach , 2011, 2011 IEEE International Conference on Robotics and Automation.

[24]  Dieter Fox,et al.  Sparse distance learning for object recognition combining RGB and depth information , 2011, 2011 IEEE International Conference on Robotics and Automation.

[25]  Paul Newman,et al.  A generative framework for fast urban labeling using spatial and temporal context , 2009, Auton. Robots.

[26]  Paul Newman,et al.  Semantic categorization of outdoor scenes with uncertainty estimates using multi-class gaussian process classification , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[27]  Wolfram Burgard,et al.  Unsupervised learning of compact 3D models based on the detection of recurrent structures , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  Xuan Song,et al.  Learning Graph Matching: Oriented to Category Modeling from Cluttered Scenes , 2013, 2013 IEEE International Conference on Computer Vision.

[29]  Wolfram Burgard,et al.  Robust 3D scan point classification using associative Markov networks , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[30]  Dirk Wollherr,et al.  A clustering method for efficient segmentation of 3D laser data , 2008, 2008 IEEE International Conference on Robotics and Automation.

[31]  Xuan Song,et al.  Unsupervised 3D category discovery and point labeling from a large urban environment , 2013, 2013 IEEE International Conference on Robotics and Automation.

[32]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[33]  Dimitris N. Metaxas,et al.  D - Clutter: Building object model library from unsupervised segmentation of cluttered scenes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Thorsten Joachims,et al.  Semantic Labeling of 3D Point Clouds for Indoor Scenes , 2011, NIPS.

[35]  Roland Siegwart,et al.  Unsupervised discovery of repetitive objects , 2010, 2010 IEEE International Conference on Robotics and Automation.

[36]  Vladimir G. Kim,et al.  Shape-based recognition of 3D point clouds in urban environments , 2009, 2009 IEEE 12th International Conference on Computer Vision.