Multi modal pose fusion for monocular flight with unmanned aerial vehicles

Pose estimation using a monocular camera is an area of active research in visual inertial unmanned aerial vehicles. Some of the common methods involve fusing pose from two sources using a kalman filter or a lie algebraic representation for solving the global motion constraints. Some systems even couple inertial systems (IMU) with the visual pose estimation from the camera for robust pose estimation. This work extends the idea of a tightly coupled visual inertial system to a photometric error based method of semi dense visual odometry. The estimated pose from the camera is prone to error spikes due to loss in tracking keyframes. With extensive experimentation we prove that our visual inertial system for semi dense visual odometry is better than using visual odometry alone. We also show robustness of our method and compare our performance to the existing state of the art tightly coupled visual inertial systems that exist in an outdoor environment.

[1]  Daniel Cremers,et al.  Semi-dense Visual Odometry for a Monocular Camera , 2013, 2013 IEEE International Conference on Computer Vision.

[2]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[3]  Stergios I. Roumeliotis,et al.  A Multi-State Constraint Kalman Filter for Vision-aided Inertial Navigation , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[4]  Rob Fergus,et al.  Depth Map Prediction from a Single Image using a Multi-Scale Deep Network , 2014, NIPS.

[5]  Anastasios I. Mourikis,et al.  High-precision, consistent EKF-based visual-inertial odometry , 2013, Int. J. Robotics Res..

[6]  Martial Hebert,et al.  Vision and Learning for Deliberative Monocular Cluttered Flight , 2014, FSR.

[7]  David J. Fleet,et al.  Performance of optical flow techniques , 1994, International Journal of Computer Vision.

[8]  Vijay Kumar,et al.  Tightly-coupled monocular visual-inertial fusion for autonomous flight of rotorcraft MAVs , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[9]  J.-Y. Bouguet,et al.  Pyramidal implementation of the lucas kanade feature tracker , 1999 .

[10]  James R. Bergen,et al.  Visual odometry , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[11]  Horst Bischof,et al.  Incremental Surface Extraction from Sparse Structure-from-Motion Point Clouds , 2013, BMVC.

[12]  Gaurav S. Sukhatme,et al.  Combined optic-flow and stereo-based navigation of urban canyons for a UAV , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Vijay Kumar,et al.  Vision-Based State Estimation and Trajectory Control Towards High-Speed Flight with a Quadrotor , 2013, Robotics: Science and Systems.

[14]  Yan Lu,et al.  A Trail-Following Robot Which Uses Appearance and Structural Cues , 2012, FSR.

[15]  Salah Sukkarieh,et al.  Visual-Inertial-Aided Navigation for High-Dynamic Motion in Built Environments Without Initial Conditions , 2012, IEEE Transactions on Robotics.

[16]  Daniel Cremers,et al.  LSD-SLAM: Large-Scale Direct Monocular SLAM , 2014, ECCV.

[17]  Venu Madhav Govindu Lie-algebraic averaging for globally consistent motion estimation , 2004, CVPR 2004.

[18]  Marc Pollefeys,et al.  Semi-direct EKF-based monocular visual-inertial odometry , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19]  Christian Kerl Odometry from RGB-D Cameras for Autonomous Quadrocopters , 2012 .

[20]  Frank Dellaert,et al.  IMU Preintegration on Manifold for Efficient Visual-Inertial Maximum-a-Posteriori Estimation , 2015, Robotics: Science and Systems.

[21]  Vijay Kumar,et al.  3 D Indoor Exploration with a Computationally Constrained MAV , 2011 .

[22]  Vijay Kumar,et al.  Autonomous multi-floor indoor navigation with a computationally constrained MAV , 2011, 2011 IEEE International Conference on Robotics and Automation.

[23]  Changchang Wu,et al.  Towards Linear-Time Incremental Structure from Motion , 2013, 2013 International Conference on 3D Vision.

[24]  Gamini Dissanayake,et al.  A review of recent developments in Simultaneous Localization and Mapping , 2011, 2011 6th International Conference on Industrial and Information Systems.