DeepTIO: A Deep Thermal-Inertial Odometry With Visual Hallucination

Visual odometry shows excellent performance in a wide range of environments. However, in visually-denied scenarios (e.g. heavy smoke or darkness), pose estimates degrade or even fail. Thermal cameras are commonly used for perception and inspection when the environment has low visibility. However, their use in odometry estimation is hampered by the lack of robust visual features. In part, this is as a result of the sensor measuring the ambient temperature profile rather than scene appearance and geometry. To overcome this issue, we propose a Deep Neural Network model for thermal-inertial odometry (DeepTIO) by incorporating a visual hallucination network to provide the thermal network with complementary information. The hallucination network is taught to predict fake visual features from thermal images by using Huber loss. We also employ selective fusion to attentively fuse the features from three different modalities, i.e thermal, hallucination, and inertial features. Extensive experiments are performed in hand-held and mobile robot data in benign and smoke-filled environments, showing the efficacy of the proposed model.

[1]  Bo Yang,et al.  DeepPCO: End-to-End Point Cloud Odometry through Deep Parallel Neural Network , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  Yasin Almalioglu,et al.  Distilling Knowledge From a Deep Pose Regressor Network , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[3]  Dongbing Gu,et al.  SGANVO: Unsupervised Deep Visual Odometry and Depth Estimation With Stacked Generative Adversarial Networks , 2019, IEEE Robotics and Automation Letters.

[4]  Sen Wang,et al.  Learning Monocular Visual Odometry through Geometry-Aware Curriculum Learning , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[5]  Fredrik Gustafsson,et al.  Zero-Velocity Detection—A Bayesian Approach to Adaptive Thresholding , 2019, IEEE Sensors Letters.

[6]  Wei Wu,et al.  Selective Sensor Fusion for Neural Visual-Inertial Odometry , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Christos Papachristos,et al.  Keyframe-based Direct Thermal–Inertial Odometry , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[8]  Christos Papachristos,et al.  Visual-Thermal Landmarks and Inertial Fusion for Navigation in Degraded Visual Environments , 2019, 2019 IEEE Aerospace Conference.

[9]  W. Ritter,et al.  Seeing Through Fog Without Seeing Fog: Deep Multimodal Sensor Fusion in Unseen Adverse Weather , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  W. Ritter,et al.  Seeing Through Fog Without Seeing Fog: Deep Sensor Fusion in the Absence of Labeled Training Data , 2019, ArXiv.

[11]  Yasin Almalioglu,et al.  GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[12]  Jörg Stückler,et al.  Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse Odometry , 2018, ECCV.

[13]  Christos Papachristos,et al.  Thermal-Inertial Localization for Autonomous Navigation of Aerial Robots through Obscurants , 2018, 2018 International Conference on Unmanned Aircraft Systems (ICUAS).

[14]  Wolfram Burgard,et al.  VLocNet++: Deep Multitask Learning for Semantic Visual Localization and Odometry , 2018, IEEE Robotics and Automation Letters.

[15]  Ian D. Reid,et al.  Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Wolfram Burgard,et al.  Deep Auxiliary Learning for Visual Localization and Odometry , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[17]  Muhamad Risqi U. Saputra,et al.  Visual SLAM and Structure from Motion in Dynamic Environments , 2018, ACM Comput. Surv..

[18]  Agathoniki Trigoni,et al.  IONet: Learning to Cure the Curse of Drift in Inertial Odometry , 2018, AAAI.

[19]  Karthik Ramani,et al.  Learning Hand Articulations by Hallucinating Heat Distribution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[20]  Dongbing Gu,et al.  UnDeepVO: Monocular Visual Odometry Through Unsupervised Deep Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Shaojie Shen,et al.  VINS-Mono: A Robust and Versatile Monocular Visual-Inertial State Estimator , 2017, IEEE Transactions on Robotics.

[22]  Noah Snavely,et al.  Unsupervised Learning of Depth and Ego-Motion from Video , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Sen Wang,et al.  VINet: Visual-Inertial Odometry as a Sequence-to-Sequence Learning Problem , 2017, AAAI.

[24]  Sen Wang,et al.  DeepVO: Towards end-to-end visual odometry with deep Recurrent Convolutional Neural Networks , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[25]  Guillermo Sapiro,et al.  Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Trevor Darrell,et al.  Learning with Side Information through Modality Hallucination , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Christoforos Kanellakis,et al.  Evaluation of visual localization systems in underground mining , 2016, 2016 24th Mediterranean Conference on Control and Automation (MED).

[28]  Paulo Vinicius Koerich Borges,et al.  Practical Infrared Visual Odometry , 2016, IEEE Transactions on Intelligent Transportation Systems.

[29]  Roland Siegwart,et al.  Robust visual inertial odometry using a direct EKF-based approach , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[30]  Nabil Aouf,et al.  Thermal Stereo Odometry for UAVs , 2015, IEEE Sensors Journal.

[31]  Thomas Brox,et al.  FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[32]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[33]  Davide Scaramuzza,et al.  SVO: Fast semi-direct monocular visual odometry , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[34]  Mohammadreza Aghaei,et al.  Light Unmanned Aerial Vehicles (UAVs) for Cooperative Inspection of PV Plants , 2014, IEEE Journal of Photovoltaics.

[35]  Daibing Zhang,et al.  Camera Calibration of Thermal-Infrared Stereo Vision System , 2013, 2013 Fourth International Conference on Intelligent Systems Design and Engineering Applications.

[36]  Wolfram Burgard,et al.  A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[37]  J. Underwood,et al.  Towards reliable perception for Unmanned Ground Vehicles in challenging conditions , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[38]  Amir Averbuch,et al.  Scene based non-uniformity correction in thermal images using Kalman filter , 2007, Image Vis. Comput..

[39]  Les E. Atlas,et al.  Recurrent neural networks and robust time series prediction , 1994, IEEE Trans. Neural Networks.

[40]  Angel Domingo Sappa,et al.  A Visible-Thermal Fusion Based Monocular Visual Odometry , 2015, ROBOT.

[41]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[42]  James R. Bergen,et al.  Visual odometry , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[43]  Frederick R. Forst,et al.  On robust estimation of the location parameter , 1980 .