论文信息 - Semantic 3D Mapping from Deep Image Segmentation

Semantic 3D Mapping from Deep Image Segmentation

The perception and identification of visual stimuli from the environment is a fundamental capacity of autonomous mobile robots. Current deep learning techniques make it possible to identify and segment objects of interest in an image. This paper presents a novel algorithm to segment the object’s space from a deep segmentation of an image taken by a 3D camera. The proposed approach solves the boundary pixel problem that appears when a direct mapping from segmented pixels to their correspondence in the point cloud is used. We validate our approach by comparing baseline approaches using real images taken by a 3D camera, showing that our method outperforms their results in terms of accuracy and reliability. As an application of the proposed algorithm, we present a semantic mapping approach for a mobile robot’s indoor environments.

Francisco Martín | José Miguel Guerrero | Jonatan Ginés | Fernando García González | Manuel Fernández

[1] Stefan Leutenegger,et al. SemanticFusion: Dense 3D semantic mapping with convolutional neural networks , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[2] Wolfram Burgard,et al. OctoMap: an efficient probabilistic 3D mapping framework based on octrees , 2013, Autonomous Robots.

[3] Jon Louis Bentley,et al. K-d trees for semidynamic point sets , 1990, SCG '90.

[4] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.

[5] Ulrich Neumann,et al. Depth-aware CNN for RGB-D Segmentation , 2018, ECCV.

[6] Roberto Cipolla,et al. Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding , 2015, BMVC.

[7] Hideo Saito,et al. Efficient Object-Oriented Semantic Mapping With Object Detector , 2019, IEEE Access.

[8] Timm Linder,et al. Accurate detection and 3D localization of humans using a novel YOLO-based RGB-D fusion approach and synthetic training data , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[9] Thomas A. Funkhouser,et al. Semantic Scene Completion from a Single Depth Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Hideo Saito,et al. Fast and Accurate Semantic Mapping through Geometric-based Incremental Segmentation , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11] Donald Meagher,et al. Geometric modeling using octree encoding , 1982, Comput. Graph. Image Process..

[12] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Roland Siegwart,et al. Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery , 2019, IEEE Robotics and Automation Letters.

[14] Olaf Kähler,et al. InfiniTAM v3: A Framework for Large-Scale 3D Reconstruction with Loop Closure , 2017, ArXiv.

[15] Toon Goedemé,et al. Exploring RGB+Depth Fusion for Real-Time Object Detection , 2019, Sensors.

[16] Cyrill Stachniss,et al. Bonnet: An Open-Source Training and Deployment Framework for Semantic Segmentation in Robotics using CNNs , 2018, 2019 International Conference on Robotics and Automation (ICRA).