3-D scene analysis via sequenced predictions over points and regions

We address the problem of understanding scenes from 3-D laser scans via per-point assignment of semantic labels. In order to mitigate the difficulties of using a graphical model for modeling the contextual relationships among the 3-D points, we instead propose a multi-stage inference procedure to capture these relationships. More specifically, we train this procedure to use point cloud statistics and learn relational information (e.g., tree-trunks are below vegetation) over fine (point-wise) and coarse (region-wise) scales. We evaluate our approach on three different datasets, that were obtained from different sensors, and demonstrate improved performance.

[1]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[2]  Martial Hebert,et al.  Efficient multiple model recognition in cluttered 3-D scenes , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[3]  Mi-Suen Lee,et al.  A Computational Framework for Segmentation and Grouping , 2000 .

[4]  Miguel Á. Carreira-Perpiñán,et al.  Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[5]  Vladimir Kolmogorov,et al.  An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Ben Taskar,et al.  Discriminative learning of Markov random fields for segmentation of 3D scan data , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  William W. Cohen,et al.  Stacked Sequential Learning , 2005, IJCAI.

[8]  Gérard G. Medioni,et al.  A voting-based computational framework for visual motion analysis and interpretation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Martial Hebert,et al.  Natural terrain classification using three‐dimensional ladar data for ground robot mobility , 2006, J. Field Robotics.

[10]  Fernando Pereira,et al.  Structured Learning with Approximate Inference , 2007, NIPS.

[11]  Martial Hebert,et al.  Data Structures for Efficient Dynamic Processing in 3-D , 2007, Int. J. Robotics Res..

[12]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[13]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[14]  Kostas Daniilidis,et al.  Object Detection from Large-Scale 3D Datasets Using Bottom-Up and Top-Down Descriptors , 2008, ECCV.

[15]  Nico Blodow,et al.  Towards 3D Point cloud based object maps for household environments , 2008, Robotics Auton. Syst..

[16]  Dieter Fox,et al.  3D laser scan classification using web data and domain adaptation , 2009, Robotics: Science and Systems.

[17]  Martial Hebert,et al.  Contextual classification with functional Max-Margin Markov Networks , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  James J. Little,et al.  A Hybrid Conditional Random Field for Estimating the Underlying Ground Surface From Airborne LiDAR Data , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Daniel Huber,et al.  Using Context to Create Semantic 3D Models of Indoor Environments , 2010, BMVC.

[20]  Martial Hebert,et al.  Stacked Hierarchical Labeling , 2010, ECCV.

[21]  I. Stamos,et al.  Sequential Classification in Point Clouds of Urban Scenes , 2010 .

[22]  Surya P. N. Singh,et al.  A Pipeline for the Segmentation and Classification of 3D Point Clouds , 2010, ISER.

[23]  O. Barinova,et al.  NON-ASSOCIATIVE MARKOV NETWORKS FOR 3D POINT CLOUD CLASSIFICATION , 2010 .

[24]  Armin B. Cremers,et al.  Learning to hash logistic regression for fast 3D scan point classification , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Ioannis Stamos,et al.  Online Algorithms for Classification of Urban Objects in 3D Point Clouds , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.