Momo: Monocular motion estimation on manifolds

Knowledge about the location of a vehicle is indispensable for autonomous driving. In order to apply global localisation methods, a pose prior must be known which can be obtained from visual odometry. The quality and robustness of that prior determine the success of localisation. Momo is a monocular frame-to-frame motion estimation methodology providing a high quality visual odometry for that purpose. By taking into account the motion model of the vehicle, reliability and accuracy of the pose prior are significantly improved. We show that especially in low-structure environments Momo outperforms the state of the art. Moreover, the method is designed so that multiple cameras with or without overlap can be integrated. The evaluation on the KITTI-dataset and on a proper multi-camera dataset shows that even with only 100–300 feature matches the prior is estimated with high accuracy and in real-time.

[1]  Richard I. Hartley,et al.  In Defense of the Eight-Point Algorithm , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Volker Willert,et al.  Flow-decoupled normalized reprojection error for visual odometry , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[3]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[4]  Julius Ziegler,et al.  StereoScan: Dense 3d reconstruction in real-time , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[5]  Christoph Gustav Keller,et al.  Multi trajectory pose adjustment for life-long mapping , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[6]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[7]  Andrew W. Fitzgibbon,et al.  Invariant fitting of two view geometry , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[9]  Wolfram Burgard,et al.  G2o: A general framework for graph optimization , 2011, 2011 IEEE International Conference on Robotics and Automation.

[10]  Marc Pollefeys,et al.  Motion Estimation for Self-Driving Cars with a Generalized Camera , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Alex Zelinsky,et al.  Learning OpenCV---Computer Vision with the OpenCV Library (Bradski, G.R. et al.; 2008)[On the Shelf] , 2009, IEEE Robotics & Automation Magazine.

[12]  F. Dellaert Factor Graphs and GTSAM: A Hands-on Introduction , 2012 .

[13]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[15]  Ivan Petrovic,et al.  Stereo odometry based on careful feature selection and tracking , 2015, 2015 European Conference on Mobile Robots (ECMR).

[16]  Davide Scaramuzza,et al.  1-Point-RANSAC Structure from Motion for Vehicle-Mounted Cameras by Exploiting Non-holonomic Constraints , 2011, International Journal of Computer Vision.