Solving Occlusion in Terrain Mapping with Neural Networks

Accurate and complete terrain maps enhance the awareness of autonomous robots and enable safe and optimal path planning. Rocks and topography often create occlusions and lead to missing elevation information in the Digital Elevation Map (DEM). Currently, mostly traditional inpainting techniques based on diffusion or patch-matching are used by autonomous mobile robots to fill-in incomplete DEMs. These methods cannot leverage the high-level terrain characteristics and the geometric constraints of line of sight we humans use intuitively to predict occluded areas. We propose to use neural networks to reconstruct the occluded areas in DEMs. We introduce a self-supervised learning approach capable of training on real-world data without a need for ground-truth information. We accomplish this by adding artificial occlusion to the incomplete elevation maps constructed on a real robot by performing ray casting. We first evaluate a supervised learning approach on synthetic data for which we have the full ground-truth available and subsequently move to several realworld datasets. These real-world datasets were recorded during autonomous exploration of both structured and unstructured terrain with a legged robot, and additionally in a planetary scenario on Lunar analogue terrain. We state a significant improvement compared to the Telea and Navier-Stokes baseline methods both on synthetic terrain and for the real-world datasets. Our neural network is able to run in real-time on both CPU and GPU with suitable sampling rates for autonomous ground robots.

[1]  Ting-Chun Wang,et al.  Image Inpainting for Irregular Holes Using Partial Convolutions , 2018, ECCV.

[2]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[3]  Rafael Bidarra,et al.  A Survey of Procedural Methods for Terrain Modelling , 2009 .

[4]  Gianfranco Visentin,et al.  HDPR: A Mobile Testbed Testbed for Current and Future Rover Technologies , 2016 .

[5]  Thomas S. Huang,et al.  Generative Image Inpainting with Contextual Attention , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[6]  Meng Wang,et al.  Semantic Image Inpainting with Progressive Generative Networks , 2018, ACM Multimedia.

[7]  Yanbing Xue,et al.  Iterative color-depth MST cost aggregation for stereo matching , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).

[8]  Yanan Fan,et al.  Cosmo VAE: Variational Autoencoder for CMB Image Inpainting , 2020, 2020 International Joint Conference on Neural Networks (IJCNN).

[9]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[10]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[11]  Chao Yang,et al.  Contextual-Based Image Inpainting: Infer, Match, and Translate , 2017, ECCV.

[12]  Angela Dai,et al.  SG-NN: Sparse Generative Neural Networks for Self-Supervised Scene Completion of RGB-D Scans , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Tieyong Zeng,et al.  Explicit Coherence Enhancing Filter With Spatial Adaptive Elliptical Kernel , 2012, IEEE Signal Processing Letters.

[14]  Adam Finkelstein,et al.  PatchMatch: a randomized correspondence algorithm for structural image editing , 2009, SIGGRAPH 2009.

[15]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[16]  Linwei Yue,et al.  Void Filling of Digital Elevation Models with a Terrain Texture Learning Model Based on Generative Adversarial Networks , 2019, Remote. Sens..

[17]  Bo Du,et al.  Progressive Reconstruction of Visual Structure for Image Inpainting , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[18]  Evelyn Lunasin,et al.  The Navier–Stokes–Voight model for image inpainting , 2009, 0901.4548.

[19]  Sebastian Thrun,et al.  Lifelong robot learning , 1993, Robotics Auton. Syst..

[20]  Marc Pollefeys,et al.  Real-time stereo matching failure prediction and resolution using orthogonal stereo setups , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Gianfranco Visentin,et al.  Habitability on Early Mars and the Search for Biosignatures with the ExoMars Rover , 2017, Astrobiology.

[22]  Omar ElHarrouss,et al.  Image Inpainting: A Review , 2019, Neural Processing Letters.

[23]  Leon A. Gatys,et al.  A Neural Algorithm of Artistic Style , 2015, ArXiv.

[24]  Marco Hutter,et al.  Probabilistic Terrain Mapping for Mobile Robots With Uncertain Localization , 2018, IEEE Robotics and Automation Letters.

[25]  Gözde B. Ünal,et al.  Patch-Based Image Inpainting with Generative Adversarial Networks , 2018, ArXiv.

[26]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[27]  Winston H. Hsu,et al.  Free-Form Video Inpainting With 3D Gated Convolution and Temporal PatchGAN , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[28]  Bo Dai,et al.  Self-Supervised Scene De-Occlusion , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Ning Wang,et al.  Multistage attention network for image inpainting , 2020, Pattern Recognit..

[30]  Peter Fankhauser,et al.  ANYmal - a highly mobile and dynamic quadrupedal robot , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[31]  Andrew Y. Ng,et al.  Stereo vision and terrain modeling for quadruped robots , 2009, 2009 IEEE International Conference on Robotics and Automation.

[32]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[33]  Kostas Alexis,et al.  Graph‐based subterranean exploration path planning using aerial and legged robots , 2020, J. Field Robotics.

[34]  Beatriz L. Boada,et al.  Traversable Region Modeling for Outdoor Navigation , 2005, J. Intell. Robotic Syst..

[35]  R. Siegwart,et al.  ROBOT-CENTRIC ELEVATION MAPPING WITH UNCERTAINTY ESTIMATES , 2014 .

[36]  Peter Fankhauser,et al.  ANYmal - toward legged robots for harsh environments , 2017, Adv. Robotics.

[37]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[38]  Alexandru Telea,et al.  An Image Inpainting Technique Based on the Fast Marching Method , 2004, J. Graphics, GPU, & Game Tools.

[39]  Guillermo Sapiro,et al.  Navier-stokes, fluid dynamics, and image and video inpainting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[40]  Ares Lagae,et al.  A Survey of Procedural Noise Functions , 2010, Comput. Graph. Forum.

[41]  Shahram Izadi,et al.  Modeling Kinect Sensor Noise for Improved 3D Reconstruction and Tracking , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[42]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[43]  Levin Gerdes,et al.  SLAM for autonomous planetary rovers with global localization , 2020, J. Field Robotics.