Rethinking the Fourier-Mellin Transform: Multiple Depths in the Camera's View

Remote sensing and robotics often rely on visual odometry (VO) for localization. Many standard approaches for VO use feature detection. However, these methods will meet challenges if the environments are feature-deprived or highly repetitive. Fourier-Mellin Transform (FMT) is an alternative VO approach that has been shown to show superior performance in these scenarios and is often used in remote sensing. One limitation of FMT is that it requires an environment that is equidistant to the camera, i.e., single-depth. To extend the applications of FMT to multi-depth environments, this paper presents the extended Fourier-Mellin Transform (eFMT), which maintains the advantages of FMT with respect to feature-deprived scenarios. To show the robustness and accuracy of eFMT, we implement an eFMT-based visual odometry framework and test it in toy examples and a large-scale drone dataset. All these experiments are performed on data collected in challenging scenarios, such as, trees, wooden boards and featureless roofs. The results show that eFMT performs better than FMT in the multi-depth settings. Moreover, eFMT also outperforms state-of-the-art VO algorithms, such as ORB-SLAM3, SVO and DSO, in our experiments.

[1]  Cyrill Stachniss,et al.  WeedMap: A large-scale semantic weed mapping framework using aerial multispectral imaging and deep neural network for precision farming , 2018, Remote. Sens..

[2]  Uwe Stilla,et al.  PRECISE DISPARITY ESTIMATION FOR NARROW BASELINE STEREO BASED ON MULTISCALE SUPERPIXELS AND PHASE CORRELATION , 2019 .

[3]  Tim Kazik,et al.  Visual odometry based on the Fourier-Mellin transform for a rover using a monocular ground-facing camera , 2011, 2011 IEEE International Conference on Mechatronics.

[4]  Daniel Cremers,et al.  LSD-SLAM: Large-Scale Direct Monocular SLAM , 2014, ECCV.

[5]  Alessandro Rizzi,et al.  Unsupervised matching of visual landmarks for robotic homing using Fourier-Mellin transform , 2002, Robotics Auton. Syst..

[6]  Qian Huang,et al.  A review of monocular visual odometry , 2019, The Visual Computer.

[7]  Andreas Birk,et al.  Robust estimation of camera-tilt for iFMI based underwater photo-mapping using a calibrated monocular camera , 2013, 2013 IEEE International Conference on Robotics and Automation.

[8]  Michel Defrise,et al.  Symmetric Phase-Only Matched Filtering of Fourier-Mellin Transforms for Image Registration and Recognition , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Andreas Birk,et al.  On the effects of Sampling Resolution in Improved Fourier Mellin based Registration for Underwater Mapping , 2010 .

[10]  Yongjun Zhang,et al.  A novel extended phase correlation algorithm based on Log-Gabor filtering for multimodal remote sensing image registration , 2019, International Journal of Remote Sensing.

[11]  Arturo Gomez Chavez,et al.  Improved Fourier Mellin Invariant for Robust Rotation Estimation with Omni-Cameras , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[12]  Heinrich H. Bülthoff,et al.  Nonlinear ego-motion estimation from optical flow for online control of a quadrotor UAV , 2015, Int. J. Robotics Res..

[13]  Roland Chapuis,et al.  Radar Scan Matching SLAM Using the Fourier-Mellin Transform , 2009, FSR.

[14]  Titus Cieslewski,et al.  Are We Ready for Autonomous Drone Racing? The UZH-FPV Drone Racing Dataset , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[15]  Luis Payá,et al.  Performance of Global-Appearance Descriptors in Map Building and Localization Using Omnidirectional Vision , 2014, Sensors.

[16]  Qingwen Xu,et al.  Pose Estimation for Omni-directional Cameras using Sinusoid Fitting , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17]  Michael Gassner,et al.  SVO: Semidirect Visual Odometry for Monocular and Multicamera Systems , 2017, IEEE Transactions on Robotics.

[18]  Andreas Birk,et al.  Scale-Free Registrations in 3D: 7 Degrees of Freedom with Fourier Mellin SOFT Transforms , 2018, International Journal of Computer Vision.

[19]  Luis Payá,et al.  Using Omnidirectional Vision to Create a Model of the Environment: A Comparative Evaluation of Global-Appearance Descriptors , 2016, J. Sensors.

[20]  Hongxia Wang,et al.  Fourier-Mellin Transform and Fractal Coding for Secure and Robust Fingerprint Image Hashing , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[21]  Jean-Paul Gauthier,et al.  Harmonic Analysis : Motions and Pattern Analysis on Motion Groups and Their Homogeneous Spaces , 2004 .

[22]  Davide Scaramuzza,et al.  A Tutorial on Quantitative Trajectory Evaluation for Visual(-Inertial) Odometry , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[23]  Luca Lucchese Estimating affine transformations in the frequency domain , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[24]  Roland Siegwart,et al.  The EuRoC micro aerial vehicle datasets , 2016, Int. J. Robotics Res..

[25]  Andreas Birk,et al.  Online generation of an underwater photo map with improved Fourier Mellin based registration , 2009, OCEANS 2009-EUROPE.

[26]  Friedrich Fraundorfer,et al.  Visual Odometry Part I: The First 30 Years and Fundamentals , 2022 .

[27]  Daniel Cremers,et al.  Direct Sparse Odometry , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Riadh Abdelfattah,et al.  InSAR image co‐registration using the Fourier–Mellin transform , 2005 .

[29]  Roland Göcke,et al.  Optical flow estimation using Fourier Mellin Transform , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  James R. Bergen,et al.  Visual odometry , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[31]  B. N. Chatterji,et al.  An FFT-based technique for translation, rotation, and scale-invariant image registration , 1996, IEEE Trans. Image Process..

[32]  Sertac Karaman,et al.  The Blackbird Dataset: A large-scale dataset for UAV perception in aggressive flight , 2018, ISER.

[33]  Bisheng Yang,et al.  Autonomous Vehicle Localization with Prior Visual Point Cloud Map Constraints in GNSS-Challenged Environments , 2021, Remote. Sens..

[34]  Andreas Birk,et al.  Fast and robust photomapping with an Unmanned Aerial Vehicle (UAV) , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[35]  Davide Scaramuzza,et al.  The Zurich urban micro aerial vehicle dataset , 2017, Int. J. Robotics Res..

[36]  Yong Zhao,et al.  Map2DFusion: Real-time incremental UAV image mosaicing based on monocular SLAM , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[37]  Andreas Birk,et al.  Large-scale mosaicking with spectral registration based simultaneous localization and mapping (iFMI-SLAM) in the Ligurian Sea , 2013, 2013 MTS/IEEE OCEANS - Bergen.

[38]  Yinan Lu,et al.  An Application of Fourier-Mellin Transform in Image Registration , 2005, The Fifth International Conference on Computer and Information Technology (CIT'05).

[39]  Richard Elvira,et al.  ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual–Inertial, and Multimap SLAM , 2021, IEEE Transactions on Robotics.

[40]  Jacek Turski Projective Fourier analysis for patterns , 2000, Pattern Recognit..

[41]  Francisco Argüello,et al.  Fourier–Mellin registration of two hyperspectral images , 2017 .

[42]  Arturo Gomez Chavez,et al.  A Divide and Conquer Method for 3D Registration of Inhomogeneous, Partially Overlapping Scans with Fourier Mellin SOFT (FMS) , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[43]  Qian Du,et al.  Image Registration With Fourier-Based Image Correlation: A Comprehensive Review of Developments and Applications , 2019, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.