Comparative study on real-time pose estimation of vision-based unmanned underwater vehicles

Background: Navigation and localization are key to the successful execution of autonomous unmanned underwater vehicles (UUVs) in marine environmental monitoring, underwater 3D mapping, and ocean resource surveys. The estimation of the position and the orientation of autonomous UUVs are a long-standing challenging and fundamental problem. As one of the underwater sensors, camera has always been the focus of attention due to its advantages of low cost and rich content information in visibility waters, especially in the fields of visual perception of the underwater environment, target recognition and tracking. At present, the visual real-time pose estimation technology that can be used for UUVs is mainly divided into geometry-based visual positioning algorithms and deep learning-based visual positioning algorithms. Methods: In order to compare the performance of different positioning algorithms and strategies, this paper uses C++ and python, takes the ORB-SLAM3 algorithm and DF-VO algorithm as representatives to conduct a comparative experiment and analysis. Results: The geometry-based algorithm ORB-SLAM3 is less affected by illumination, performs more stably in different underwater environments, and has a shorter calculation time, but its robustness is poor in complex environments. The visual positioning algorithm DF-VO based on deep learning takes longer time to compute, and the positioning accuracy is more easily affected by illumination, especially in dark conditions. However, its robustness is better in unstructured environments such as large-scale image rotation and dynamic object interference. Conclusions: In general, the deep learning-based algorithm is more robust, but multiple deep learning networks make it need more time to compute. The geometry-based method costs less time and is more accurate in low-light and turbid underwater conditions. However, in real underwater situations, these two methods can be connected as binocular vision or methods of multi-sensor combined pose estimation.

[1]  Deren Li,et al.  A Survey on Visual Navigation and Positioning for Autonomous UUVs , 2022, Remote. Sens..

[2]  V. Semkin,et al.  Threats from and Countermeasures for Unmanned Aerial and Underwater Vehicles , 2022, Sensors.

[3]  Meng Li,et al.  REAL-TIME POSITIONING AND TRACKING FOR VISION-BASED UNMANNED UNDERWATER VEHICLES , 2022, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[4]  Kai Sun,et al.  Review of Underwater Sensing Technologies and Applications , 2021, Sensors.

[5]  Y. Pétillot,et al.  AUV localisation: a review of passive and active techniques , 2021, International Journal of Intelligent Robotics and Applications.

[6]  Steve Macenski,et al.  A Comparison of Modern General-Purpose Visual SLAM Approaches , 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  Ruizhi Chen,et al.  VNLSTM-PoseNet: A novel deep ConvNet for real-time 6-DOF camera relocalization in urban streets , 2021, Geo spatial Inf. Sci..

[8]  Iestyn Pierce,et al.  Tracking the fine scale movements of fish using autonomous maritime robotics: A systematic state of the art review , 2021 .

[9]  Ian Reid,et al.  DF-VO: What Should Be Learnt for Visual Odometry? , 2021, ArXiv.

[10]  Jun Liu,et al.  Tracking and Localization based on Multi-angle Vision for Underwater Target , 2020, Electronics.

[11]  Sanjoy Das,et al.  Localization schemes for Underwater Acoustic Sensor Networks - A Review , 2020, Comput. Sci. Rev..

[12]  Carlos Campos,et al.  ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual–Inertial, and Multimap SLAM , 2020, IEEE Transactions on Robotics.

[13]  Alice Newton,et al.  Contribution of Remote Sensing Technologies to a Holistic Coastal and Marine Environmental Management Framework: A Review , 2020, Remote. Sens..

[14]  Ming Li,et al.  Accumulative Errors Optimization for Visual Odometry of ORB-SLAM2 Based on RGB-D Cameras , 2019, ISPRS Int. J. Geo Inf..

[15]  Na Li,et al.  Improved SLAM closed-loop detection algorithm based on DBoW2 , 2019, Journal of Physics: Conference Series.

[16]  Daoliang Li,et al.  Survey of underwater robot positioning navigation , 2019, Applied Ocean Research.

[17]  A. Burguera,et al.  A Trajectory-Based Approach to Multi-Session Underwater Visual SLAM Using Global Image Signatures , 2019, Journal of Marine Science and Engineering.

[18]  Chunhua Shen,et al.  Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video , 2019, NeurIPS.

[19]  Dongbing Gu,et al.  Ongoing Evolution of Visual SLAM from Geometry to Deep Learning: Challenges and Opportunities , 2018, Cognitive Computation.

[20]  Jörg Stückler,et al.  Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse Odometry , 2018, ECCV.

[21]  Gabriel J. Brostow,et al.  Digging Into Self-Supervised Monocular Depth Estimation , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  Michael J. Black,et al.  Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Ian D. Reid,et al.  Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Zhichao Yin,et al.  GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[25]  Dongbing Gu,et al.  UnDeepVO: Monocular Visual Odometry Through Unsupervised Deep Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[26]  Noah Snavely,et al.  Unsupervised Learning of Depth and Ego-Motion from Video , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Federico Tombari,et al.  CNN-SLAM: Real-Time Dense Monocular SLAM with Learned Depth Prediction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Antonio Vasilijević,et al.  Coordinated Navigation of Surface and Underwater Marine Robotic Vehicles for Ocean Sampling and Environmental Monitoring , 2017, IEEE/ASME Transactions on Mechatronics.

[29]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[30]  Francesc Moreno-Noguer,et al.  Exhaustive Linearization for Robust Camera Pose and Focal Length Estimation , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Sudip Misra,et al.  The effects of variable sound speed on localization in Underwater Sensor Networks , 2011, 2011 Australasian Telecommunication Networks and Applications Conference (ATNAC).

[32]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[33]  Winston K. G. Seah,et al.  A Survey of Techniques and Challenges in Underwater Localization , 2011 .

[34]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.

[35]  Richard I. Hartley,et al.  Optimised KD-trees for fast image descriptor matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[37]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[38]  T. Nicosevici,et al.  A review of sensor fusion techniques for underwater vehicle navigation , 2004, Oceans '04 MTS/IEEE Techno-Ocean '04 (IEEE Cat. No.04CH37600).

[39]  Jin Qin,et al.  LARGER RECEPTIVE FIELD BASED RGB VISUAL RELOCALIZATION METHOD USING CONVOLUTIONAL NETWORK , 2021 .

[40]  Sajad Saeedi,et al.  AUV Navigation and Localization: A Review , 2014, IEEE Journal of Oceanic Engineering.