A 6-DOFs event-based camera relocalization system by CNN-LSTM and image denoising

Abstract At present, in the research of simultaneous localization and mapping systems, many traditional relocalization methods have been replaced by camera relocalization techniques based on convolutional neural network (CNN) and long and short-term memory (LSTM). However, in a system using an event dataset to train the neural network, the complex scenes are chaotic, and the noise of the event images is excessive. Both issues make the model unable to return to the six-degrees of freedom (6-DOFs) pose well. This paper proposes a 6-DOFs pose camera relocalization method based on the CNN image denoising model and CNN-LSTM. Firstly, the CNN image denoising model is used to solve the problem of excessive noise points in complex scenes. Then, a network framework combining CNN and LSTM trains the event camera relocalization model to obtain better 6-DOFs pose accuracy. Finally, the study performs experimental simulations by using complex scene datasets without and with denoising images. Experimental results show that the proposed method of camera relocalization has many advantages. It enhances the robustness of the model in the training process, reduces the mutation situation, and the trained model has a smaller error and faster speed when predicting the pose, thus improving the accuracy and real-time of the camera relocalization model.

[1]  Ian D. Reid,et al.  Automatic Relocalization and Loop Closing for Real-Time Monocular SLAM , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Juho Kannala,et al.  Camera Relocalization by Computing Pairwise Relative Poses Using Convolutional Neural Network , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[3]  Roberto Cipolla,et al.  Modelling uncertainty in deep learning for camera relocalization , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Young-Tack Park,et al.  Path-based reasoning approach for knowledge graph completion using CNN-BiLSTM with attention mechanism , 2020, Expert Syst. Appl..

[5]  Lei Zhang,et al.  FFDNet: Toward a Fast and Flexible Solution for CNN-Based Image Denoising , 2017, IEEE Transactions on Image Processing.

[6]  Daniel Cremers,et al.  Image-Based Localization Using LSTMs for Structured Feature Correlation , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[7]  Shan Gai,et al.  New image denoising algorithm via improved deep convolutional neural network with perceptive loss , 2019, Expert Syst. Appl..

[8]  Stamatios Lefkimmiatis,et al.  Universal Denoising Networks : A Novel CNN Architecture for Image Denoising , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Juan D. Tardós,et al.  Fast relocalisation and loop closing in keyframe-based SLAM , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[10]  Dongbing Gu,et al.  Indoor Relocalization in Challenging Environments With Dual-Stream Convolutional Neural Networks , 2018, IEEE Transactions on Automation Science and Engineering.

[11]  Lahcen Koutti,et al.  MultiD-CNN: A multi-dimensional feature learning approach based on deep convolutional networks for gesture recognition in RGB-D image sequences , 2020, Expert Syst. Appl..

[12]  Stefan Leutenegger,et al.  SemanticFusion: Dense 3D semantic mapping with convolutional neural networks , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Roberto Cipolla,et al.  Geometric Loss Functions for Camera Pose Regression with Deep Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Alessandro Foi,et al.  Nonlocality-Reinforced Convolutional Neural Networks for Image Denoising , 2018, IEEE Signal Processing Letters.

[15]  Enrico Grosso,et al.  2D recurrent neural networks: a high-performance tool for robust visual tracking in dynamic scenes , 2017, Neural Computing and Applications.

[16]  Yong Liu,et al.  Parse geometry from a line: Monocular depth estimation with partial laser observation , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[17]  Roberto Cipolla,et al.  PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[18]  Dorian Gálvez-López,et al.  Bags of Binary Words for Fast Place Recognition in Image Sequences , 2012, IEEE Transactions on Robotics.

[19]  Xiaolin Hu,et al.  Delving deeper into convolutional neural networks for camera relocalization , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[20]  Wangmeng Zuo,et al.  Learning Deep CNN Denoiser Prior for Image Restoration , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Tobi Delbrück,et al.  The event-camera dataset and simulator: Event-based data for pose estimation, visual odometry, and SLAM , 2016, Int. J. Robotics Res..

[22]  Liang Lin,et al.  Multi-level Wavelet-CNN for Image Restoration , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[23]  Lei Zhang,et al.  Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[24]  Ender M. Eksioglu,et al.  Online dictionary learning algorithm with periodic updates and its application to image denoising , 2014, Expert Syst. Appl..

[25]  Paul Newman,et al.  Appearance-only SLAM at large scale with FAB-MAP 2.0 , 2011, Int. J. Robotics Res..