论文信息 - Dense 3D semantic mapping of indoor scenes from RGB-D images

Dense 3D semantic mapping of indoor scenes from RGB-D images

Dense semantic segmentation of 3D point clouds is a challenging task. Many approaches deal with 2D semantic segmentation and can obtain impressive results. With the availability of cheap RGB-D sensors the field of indoor semantic segmentation has seen a lot of progress. Still it remains unclear how to deal with 3D semantic segmentation in the best way. We propose a novel 2D-3D label transfer based on Bayesian updates and dense pairwise 3D Conditional Random Fields. This approach allows us to use 2D semantic segmentations to create a consistent 3D semantic reconstruction of indoor scenes. To this end, we also propose a fast 2D semantic segmentation approach based on Randomized Decision Forests. Furthermore, we show that it is not needed to obtain a semantic segmentation for every frame in a sequence in order to create accurate semantic 3D reconstructions. We evaluate our approach on both NYU Depth datasets and show that we can obtain a significant speed-up compared to other methods.

[1] Thorsten Joachims,et al. Contextually guided semantic labeling and search for three-dimensional point clouds , 2013, Int. J. Robotics Res..

[2] Nathan Silberman,et al. Indoor scene segmentation using a structured light sensor , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[3] Vladlen Koltun,et al. Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[4] Moonhong Baeg,et al. Spatial Uncertainty Model for Visual Features Using a Kinect™ Sensor , 2012, Sensors.

[5] Pushmeet Kohli,et al. Associative hierarchical CRFs for object class image segmentation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[6] Paul Newman,et al. Parsing Outdoor Scenes from Streamed 3D Laser Data Using Online Clustering and Incremental Belief Updates , 2012, AAAI.

[7] Joachim Hertzberg,et al. Towards semantic maps for mobile robots , 2008, Robotics Auton. Syst..

[8] Wolfram Burgard,et al. Probabilistic Robotics (Intelligent Robotics and Autonomous Agents) , 2005 .

[9] Wolfram Burgard,et al. A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10] Dieter Fox,et al. RGB-(D) scene labeling: Features and algorithms , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Antonio Criminisi,et al. TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[12] Wolfram Burgard,et al. An evaluation of the RGB-D SLAM system , 2012, 2012 IEEE International Conference on Robotics and Automation.

[13] Jitendra Malik,et al. Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[14] Andrew W. Fitzgibbon,et al. Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[15] Jamie Shotton,et al. Semantic Texton Forests , 2010, Computer Vision: Detection, Recognition and Reconstruction.

[16] Pierre Geurts,et al. Extremely randomized trees , 2006, Machine Learning.

[17] Thorsten Joachims,et al. Contextually Guided Semantic Labeling and Search for 3D Point Clouds , 2011, ArXiv.

[18] Bastian Leibe,et al. Joint 2D-3D temporally consistent semantic segmentation of street scenes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19] Jörg Stückler,et al. Semantic mapping using object-class segmentation of RGB-D images , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20] Giovanni Maria Farinella,et al. Computer Vision: Detection, Recognition and Reconstruction , 2010, Computer Vision: Detection, Recognition and Reconstruction.

[21] Luc Van Gool,et al. Hough Forests for Object Detection, Tracking, and Action Recognition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22] Yann LeCun,et al. Indoor Semantic Segmentation using depth information , 2013, ICLR.

[23] Roberto Cipolla,et al. Semantic texton forests for image categorization and segmentation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[24] Ali Shahrokni,et al. Mesh Based Semantic Modelling for Indoor and Outdoor Scenes , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Albert S. Huang,et al. Visual Odometry and Mapping for Autonomous Flight Using an RGB-D Camera , 2011, ISRR.

[26] Derek Hoiem,et al. Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[27] Martial Hebert,et al. Efficient 3-D scene analysis from streaming data , 2013, 2013 IEEE International Conference on Robotics and Automation.