Keyframe-based visual–inertial odometry using nonlinear optimization

Combining visual and inertial measurements has become popular in mobile robotics, since the two sensing modalities offer complementary characteristics that make them the ideal choice for accurate visual–inertial odometry or simultaneous localization and mapping (SLAM). While historically the problem has been addressed with filtering, advancements in visual estimation suggest that nonlinear optimization offers superior accuracy, while still tractable in complexity thanks to the sparsity of the underlying problem. Taking inspiration from these findings, we formulate a rigorously probabilistic cost function that combines reprojection errors of landmarks and inertial terms. The problem is kept tractable and thus ensuring real-time operation by limiting the optimization to a bounded window of keyframes through marginalization. Keyframes may be spaced in time by arbitrary intervals, while still related by linearized inertial terms. We present evaluation results on complementary datasets recorded with our custom-built stereo visual–inertial hardware that accurately synchronizes accelerometer and gyroscope measurements with imagery. A comparison of both a stereo and monocular version of our algorithm with and without online extrinsics estimation is shown with respect to ground truth. Furthermore, we compare the performance to an implementation of a state-of-the-art stochastic cloning sliding-window filter. This competitive reference implementation performs tightly coupled filtering-based visual–inertial odometry. While our approach declaredly demands more computation, we show its superior performance in terms of accuracy.

[1]  Yuanxin Wu,et al.  On 'A Kalman Filter-Based Algorithm for IMU-Camera Calibration: Observability Analysis and Performance Evaluation' , 2013, ArXiv.

[2]  Chao Jia,et al.  Probabilistic 3-D motion estimation for rolling shutter video rectification from visual and inertial measurements , 2012, 2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP).

[3]  Sebastian Thrun,et al.  The Graph SLAM Algorithm with Applications to Large-Scale Mapping of Urban Structures , 2006, Int. J. Robotics Res..

[4]  Stergios I. Roumeliotis,et al.  Vision-Aided Inertial Navigation for Spacecraft Entry, Descent, and Landing , 2009, IEEE Transactions on Robotics.

[5]  Hauke Strasdat,et al.  Real-time monocular SLAM: Why filter? , 2010, 2010 IEEE International Conference on Robotics and Automation.

[6]  Anastasios I. Mourikis,et al.  Estimator initialization in vision-aided inertial navigation with unknown camera-IMU calibration , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Agostino Martinelli,et al.  State Estimation Based on the Concept of Continuous Symmetry and Observability Analysis: The Case of Calibration , 2011, IEEE Transactions on Robotics.

[8]  Stergios I. Roumeliotis,et al.  A First-Estimates Jacobian EKF for Improving SLAM Consistency , 2009, ISER.

[9]  Stephan Weiss,et al.  Vision based navigation for micro helicopters , 2012 .

[10]  John J. Leonard,et al.  Towards consistent visual-inertial navigation , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[11]  Dimitrios G. Kottas,et al.  On the Consistency of Vision-Aided Inertial Navigation , 2012, ISER.

[12]  Dimitrios G. Kottas,et al.  Observability-constrained Vision-aided Inertial Navigation , 2012 .

[13]  Roland Siegwart,et al.  Keyframe-based Visual-Inertial SLAM using Nonlinear Optimization , 2013, RSS 2013.

[14]  Michael Veth,et al.  Tightly-Coupled Image-Aided Inertial Navigation Using the Unscented Kalman Filter , 2007 .

[15]  Matthew Johnson-Roberson,et al.  Airborne smoothing and mapping using vision and inertial sensors , 2009, 2009 IEEE International Conference on Robotics and Automation.

[16]  Anastasios I. Mourikis,et al.  Optimization-Based Estimator Design for Vision-Aided Inertial Navigation , 2012, Robotics: Science and Systems.

[17]  Anastasios I. Mourikis,et al.  Real-time motion tracking on a cellphone using inertial sensing and a rolling-shutter camera , 2013, 2013 IEEE International Conference on Robotics and Automation.

[18]  Gaurav S. Sukhatme,et al.  Sliding window filter with application to planetary landing , 2010, J. Field Robotics.

[19]  脇元 修一,et al.  IEEE International Conference on Robotics and Automation (ICRA) におけるフルードパワー技術の研究動向 , 2011 .

[20]  Roland Siegwart,et al.  A robust and modular multi-sensor fusion approach applied to MAV navigation , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Gaurav S. Sukhatme,et al.  Visual-Inertial Sensor Fusion: Localization, Mapping and Sensor-to-Sensor Self-calibration , 2011, Int. J. Robotics Res..

[22]  D.S. Bayard,et al.  An estimation algorithm for vision-based exploration of small bodies in space , 2005, Proceedings of the 2005, American Control Conference, 2005..

[23]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[24]  Stefano Soatto,et al.  Visual-inertial navigation, mapping and localization: A scalable real-time causal approach , 2011, Int. J. Robotics Res..

[25]  Ian D. Reid,et al.  RSLAM: A System for Large-Scale Mapping in Constant-Time Using Stereo , 2011, International Journal of Computer Vision.

[26]  Stergios I. Roumeliotis,et al.  Augmenting inertial navigation with image-based motion estimation , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[27]  Kurt Konolige,et al.  Large-Scale Visual Odometry for Rough Terrain , 2007, ISRR.

[28]  Frank Dellaert,et al.  Factor graph based incremental smoothing in inertial navigation systems , 2012, 2012 15th International Conference on Information Fusion.

[29]  Stergios I. Roumeliotis,et al.  C-KLAM: Constrained keyframe-based localization and mapping , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[30]  Stergios I. Roumeliotis,et al.  An observability-constrained sliding window filter for SLAM , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31]  Salah Sukkarieh,et al.  Real-time implementation of airborne inertial-SLAM , 2007, Robotics Auton. Syst..

[32]  Tyrone L. Vincent,et al.  Three-Dimensional Motion and Structure Estimation Using Inertial Sensors and Computer Vision for Augmented Reality , 2002, Presence: Teleoperators & Virtual Environments.

[33]  P. Furgale,et al.  Pose estimation using linearized rotations and quaternion algebra , 2011 .

[34]  Camillo J. Taylor,et al.  Camera trajectory estimation using inertial sensor measurements and structure from motion results , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[35]  G. Klein,et al.  Parallel Tracking and Mapping for Small AR Workspaces , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[36]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[37]  Andrew J. Davison,et al.  Real-time simultaneous localisation and mapping with a single camera , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[38]  Dimitrios G. Kottas,et al.  Towards Consistent Vision-Aided Inertial Navigation , 2012, WAFR.

[39]  P. Savage STRAPDOWN INERTIAL NAVIGATION INTEGRATION ALGORITHM DESIGN. PART 2: VELOCITY AND POSITION ALGORITHMS , 1998 .

[40]  Frank Dellaert,et al.  iSAM2: Incremental smoothing and mapping using the Bayes tree , 2012, Int. J. Robotics Res..

[41]  Jorge Dias,et al.  Relative Pose Calibration Between Visual and Inertial Sensors , 2007, Int. J. Robotics Res..

[42]  Agostino Martinelli,et al.  Visual-inertial structure from motion: Observability vs minimum number of sensors , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[43]  Sanjiv Singh,et al.  Online Motion Estimation from Image and Inertial Measurements , 2003 .

[44]  Eun-Hwan Shin,et al.  An unscented Kalman filter for in-motion alignment of low-cost IMUs , 2004, PLANS 2004. Position Location and Navigation Symposium (IEEE Cat. No.04CH37556).

[45]  Frank Dellaert,et al.  Fast 3D pose estimation with out-of-sequence measurements , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[46]  Roland Siegwart,et al.  Unified temporal and spatial calibration for multi-sensor systems , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[47]  Anastasios I. Mourikis,et al.  Vision-aided inertial navigation for resource-constrained systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[48]  Laurent Kneip,et al.  OpenGV: A unified and generalized approach to real-time calibrated geometric vision , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[49]  Roland Siegwart,et al.  A synchronized visual-inertial sensor system with FPGA pre-processing for accurate real-time SLAM , 2014, ICRA 2014.

[50]  Mingyang Li,et al.  Improving the accuracy of EKF-based visual-inertial odometry , 2012, 2012 IEEE International Conference on Robotics and Automation.

[51]  Roland Siegwart,et al.  Keyframe-Based Visual-Inertial SLAM using Nonlinear Optimization , 2013, Robotics: Science and Systems.

[52]  P. Furgale Extensions to the Visual Odometry Pipeline for the Exploration of Planetary Surfaces , 2011 .

[53]  Anastasios I. Mourikis,et al.  Motion tracking with fixed-lag smoothing: Algorithm and consistency analysis , 2011, 2011 IEEE International Conference on Robotics and Automation.

[54]  Stergios I. Roumeliotis,et al.  A Multi-State Constraint Kalman Filter for Vision-aided Inertial Navigation , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[55]  Roland Siegwart,et al.  Real-time onboard visual-inertial state estimation and self-calibration of MAVs in unknown environments , 2012, 2012 IEEE International Conference on Robotics and Automation.

[56]  Sanjiv Singh,et al.  Motion Estimation from Image and Inertial Measurements , 2004, Int. J. Robotics Res..

[57]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.