TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation

Effective use of camera-based vision systems is essential for robust performance in autonomous off-road driving, particularly in the high-speed regime. Despite success in structured, on-road settings, current end-to-end approaches for scene prediction have yet to be successfully adapted for complex outdoor terrain. To this end, we present TerrainNet, a vision-based terrain perception system for semantic and geometric terrain prediction for aggressive, off-road navigation. The approach relies on several key insights and practical considerations for achieving reliable terrain modeling. The network includes a multi-headed output representation to capture fine- and coarse-grained terrain features necessary for estimating traversability. Accurate depth estimation is achieved using self-supervised depth completion with multi-view RGB and stereo inputs. Requirements for real-time performance and fast inference speeds are met using efficient, learned image feature projections. Furthermore, the model is trained on a large-scale, real-world off-road dataset collected across a variety of diverse outdoor environments. We show how TerrainNet can also be used for costmap prediction and provide a detailed framework for integration into a planning module. We demonstrate the performance of TerrainNet through extensive comparison to current state-of-the-art baselines for camera-only scene prediction. Finally, we showcase the effectiveness of integrating TerrainNet within a complete autonomous-driving stack by conducting a real-world vehicle test in a challenging off-road scenario.

[1]  Philip R. Osteen,et al.  Probabilistic Traversability Model for Risk-Aware Motion Planning in Off-Road Environments , 2022, ArXiv.

[2]  Adam W. Harley,et al.  Simple-BEV: What Really Matters for Multi-Sensor BEV Perception? , 2022, 2023 IEEE International Conference on Robotics and Automation (ICRA).

[3]  Hans-Joachim Wünsche,et al.  Dynamic Resolution Terrain Estimation for Autonomous (Dirt) Road Driving Fusing LiDAR and Vision , 2022, 2022 IEEE Intelligent Vehicles Symposium (IV).

[4]  Philipp Krahenbuhl,et al.  Cross-view Transformers for real-time Map-view Semantic Segmentation , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  M. Hutter,et al.  Elevation Mapping for Locomotion and Navigation using GPU , 2022, 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[6]  C. Sevastopoulos,et al.  A Survey of Traversability Estimation for Mobile Robots , 2022, IEEE Access.

[7]  Jifeng Dai,et al.  BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers , 2022, ECCV.

[8]  J. How,et al.  Risk-Aware Off-Road Navigation via a Learned Speed Distribution Map , 2022, 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Peter Corke,et al.  A Survey on Terrain Traversability Analysis for Autonomous Ground Vehicles: Methods, Sensors, and Challenges , 2022, Field Robotics.

[10]  David D. Fan,et al.  Hybrid Imitative Planning with Geometric and Predictive Costs in Off-road Environments , 2021, 2022 International Conference on Robotics and Automation (ICRA).

[11]  R. Bowden,et al.  Translating Images into Maps , 2021, 2022 International Conference on Robotics and Automation (ICRA).

[12]  Luc Van Gool,et al.  Structured Bird’s-Eye-View Traffic Scene Understanding from Onboard Images , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[13]  Marco Hutter,et al.  Reconstructing Occluded Elevation Information in Terrain Maps With Self-Supervised Learning , 2021, IEEE Robotics and Automation Letters.

[14]  Tianrui Guan,et al.  TNS: Terrain Traversability Mapping and Navigation System for Autonomous Excavators , 2021, Robotics: Science and Systems.

[15]  Andreas Geiger,et al.  NEAT: Neural Attention Fields for End-to-End Autonomous Driving , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[16]  David D. Fan,et al.  Learning Risk-Aware Costmaps for Traversability in Challenging Environments , 2021, IEEE Robotics and Automation Letters.

[17]  Karel Zimmermann,et al.  Pose Consistency KKT-Loss for Weakly Supervised Learning of Robot-Terrain Interaction Model , 2021, IEEE Robotics and Automation Letters.

[18]  Shengfeng He,et al.  Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Anima Anandkumar,et al.  SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers , 2021, NeurIPS.

[20]  Hans-Joachim Wünsche,et al.  Probabilistic Terrain Estimation for Autonomous Off-Road Driving , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Andrew Thoesen,et al.  Planetary Surface Mobility and Exploration: A Review , 2021, Current Robotics Reports.

[22]  David D. Fan,et al.  STEP: Stochastic Traversability Evaluation and Planning for Safe Off-road Navigation , 2021, Robotics: Science and Systems.

[23]  Steven L. Waslander,et al.  Categorical Depth Distribution Network for Monocular 3D Object Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Xinge Zhu,et al.  Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Philip R. Osteen,et al.  RELLIS-3D Dataset: Data, Benchmarks and Analysis , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[26]  Jacopo Banfi,et al.  Planning Paths Through Unknown Space by Imagining What Lies Therein , 2020, CoRL.

[27]  Sanja Fidler,et al.  Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D , 2020, ECCV.

[28]  Sergio Casas,et al.  Perceive, Predict, and Plan: Safe Motion Planning Through Interpretable Semantic Representations , 2020, ECCV.

[29]  Pengfei Duan,et al.  FISHING Net: Future Inference of Semantic Heatmaps In Grids , 2020, ArXiv.

[30]  Lutz Eckstein,et al.  A Sim2Real Deep Learning Approach for the Transformation of Images from Multiple Vehicle-Mounted Cameras to a Semantically Segmented Image in Bird’s Eye View , 2020, 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC).

[31]  Yan Wang,et al.  End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Roberto Cipolla,et al.  Predicting Semantic Map Representations From Images Using Pyramid Occupancy Networks , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Nolan Wagener,et al.  An Online Learning Approach to Model Predictive Control , 2019, Robotics: Science and Systems.

[34]  Jiong Yang,et al.  PointPillars: Fast Encoders for Object Detection From Point Clouds , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Fawzi Nashashibi,et al.  Sparse and Dense Data with CNNs: Depth Completion and Semantic Segmentation , 2018, 2018 International Conference on 3D Vision (3DV).

[36]  Marco Hutter,et al.  Probabilistic Terrain Mapping for Mobile Robots With Uncertain Localization , 2018, IEEE Robotics and Automation Letters.

[37]  Tom Duckett,et al.  Agricultural Robotics: The Future of Robotic Agriculture , 2018, UKRAS White Papers.

[38]  Xi Chen,et al.  Geometric and visual terrain classification for autonomous mobile navigation , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[39]  Nolan Wagener,et al.  Information theoretic MPC for model-based reinforcement learning , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[40]  Paul Newman,et al.  The path less taken: A fast variational approach for scene segmentation used for closed loop control , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[41]  Wolfgang Hess,et al.  Real-time loop closure in 2D LIDAR SLAM , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[42]  James M. Rehg,et al.  Aggressive driving with model predictive path integral control , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[43]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[44]  Larry H. Matthies,et al.  High fidelity day/night stereo mapping with vegetation and negative obstacle detection for vision-in-the-loop walking , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[45]  Pedro U. Lima,et al.  Search and Rescue Robots: The Civil Protection Teams of the Future , 2012, 2012 Third International Conference on Emerging Security Technologies.

[46]  David Silver,et al.  Learning from Demonstration for Autonomous Navigation in Complex Unstructured Terrain , 2010, Int. J. Robotics Res..

[47]  Larry H. Matthies,et al.  Autonomous off‐road navigation with end‐to‐end learning for the LAGR program , 2009, J. Field Robotics.

[48]  Wolfram Burgard,et al.  Multi-Level Surface Maps for Outdoor Terrain Mapping and Loop Closing , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[49]  Alonzo Kelly,et al.  Toward Reliable Off Road Autonomous Vehicles Operating in Challenging Environments , 2006, Int. J. Robotics Res..

[50]  Guillermo Sapiro,et al.  Navier-stokes, fluid dynamics, and image and video inpainting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[51]  Eric Krotkov,et al.  The Defense Advanced Research Projects Agency (DARPA) Tactical Mobile Robotics Program , 1999, Int. J. Robotics Res..

[52]  Larry H. Matthies,et al.  Robotic vehicles for planetary exploration , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[53]  Alberto Elfes,et al.  Using occupancy grids for mobile robot perception and navigation , 1989, Computer.

[54]  Byron Boots,et al.  Semantic Terrain Classification for Off-Road Autonomous Driving , 2021, CoRL.

[55]  Sebastian Scherer,et al.  Real-Time Semantic Mapping for Autonomous Off-Road Navigation , 2017, FSR.

[56]  Dong-Hyun Lee,et al.  Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .

[57]  Roberto Manduchi,et al.  Obstacle Detection and Terrain Classification for Autonomous Off-Road Navigation , 2005, Auton. Robots.