Visual Odometry in Dynamic Environments with Geometric Multi-layer Optimisation

This paper presents a novel approach for optimising visual odometry results in a dynamic outdoor environment. Egomotion estimation is still considered to be one of the more difficult tasks in computer vision because of its continued computation pipeline: every phase of visual odometry can be a source of noise or errors, and influence future results. Also, tracking features in a dynamic environment is very challenging. Since feature tracking can only match two features in integer coordinates, there will be a data loss at sub-pixel level. In this paper we introduce a weighting scheme that measures the geometric relations between different layers: We divide tracked features into three groups based on geometric constrains; each group is recognised as being a “layer”. Each layer has a weight which depends on the distribution of the grouped features on the 2D image and the actual position in 3D scene coordinates. This geometric multi-layer approach can effectively remove all the dynamic features in the scene, and provide more reliable feature tracking results. Moreover, we propose a 3-state Kalman filter optimisation approach. Our method follows the traditional process of visual odometry algorithms by focusing on motion estimation between pairs of two consecutive frames. Experiments and evaluations are carried out for trajectory estimation. We use the provided ground truth of the KITTI data-sets to analyse mean rotation and translation errors over distance.

[1]  Friedrich Fraundorfer,et al.  Visual Odometry Part I: The First 30 Years and Fundamentals , 2022 .

[2]  James R. Bergen,et al.  Visual odometry , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[3]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[4]  R. E. Kalman,et al.  A New Approach to Linear Filtering and Prediction Problems , 2002 .

[5]  S. Shafer,et al.  Dynamic stereo vision , 1989 .

[6]  Jeffrey K. Uhlmann,et al.  Unscented filtering and nonlinear estimation , 2004, Proceedings of the IEEE.

[7]  S. F. Schmidt,et al.  Application of State-Space Methods to Navigation Problems , 1966 .

[8]  Larry H. Matthies,et al.  Error modeling in stereo navigation , 1986, IEEE J. Robotics Autom..

[9]  Reinhard Klette,et al.  Concise Computer Vision , 2014, Undergraduate Topics in Computer Science.

[10]  Trevor Darrell,et al.  Motion estimation from disparity images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[11]  Julius Ziegler,et al.  Making Bertha Drive—An Autonomous Journey on a Historic Route , 2014, IEEE Intelligent Transportation Systems Magazine.

[12]  Reinhard Klette,et al.  Stereo accuracy for collision avoidance for varying collision trajectories , 2013, 2013 IEEE Intelligent Vehicles Symposium (IV).

[13]  Uwe Franke,et al.  Dense, Robust, and Accurate Motion Field Estimation from Stereo Image Sequences in Real-Time , 2010, ECCV.

[14]  Larry H. Matthies,et al.  Two years of Visual Odometry on the Mars Exploration Rovers , 2007, J. Field Robotics.