Toward featureless visual navigation: Simultaneous localization and planar surface extraction using motion vectors in video streams

Unlike the traditional feature-based methods, we propose using motion vectors (MVs) from video streams as inputs for visual navigation. Although MVs are very noisy and with low spatial resolution, MVs do possess high temporal resolution which means it is possible to merge MVs from different frames to improve signal quality. Homography filtering and MV thresholding are proposed to further improve MV quality so that we can establish plane observations from MVs. We propose an extended Kalman filter (EKF) based approach to simultaneously track robot motion and planes. We formally model error propagation of MVs and derive variance of the merged MVs. We have implemented the proposed method and tested it in physical experiments. Results show that the system is capable of performing robot localization and plane mapping with a relative trajectory error of less than 5.1%.

[1]  Tom Drummond,et al.  Edge landmarks in monocular SLAM , 2009, Image Vis. Comput..

[2]  J. Weickert,et al.  Lucas/Kanade meets Horn/Schunck: combining local and global optic flow methods , 2005 .

[3]  Yan Lu,et al.  Automatic building exterior mapping using multilayer feature graphs , 2013, 2013 IEEE International Conference on Automation Science and Engineering (CASE).

[4]  Il Hong Suh,et al.  SoF-SLAM: Segments-on-Floor-based monocular SLAM , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[5]  Óscar Martínez Mozos,et al.  A comparative evaluation of interest point detectors and local descriptors for visual SLAM , 2010, Machine Vision and Applications.

[6]  Ji Zhang,et al.  Error Aware Monocular Visual Odometry using Vertical Line Pairs for Small Robots in Urban Areas , 2010, AAAI.

[7]  Filippo Vella,et al.  Digital image stabilization by adaptive block motion vectors filtering , 2002, IEEE Trans. Consumer Electron..

[8]  Sebastian Thrun,et al.  Probabilistic robotics , 2002, CACM.

[9]  Soon-Jo Chung,et al.  CurveSLAM: An approach for vision-based navigation without point features , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  Il Hong Suh,et al.  Building a partial 3D line-based map using a monocular SLAM , 2011, 2011 IEEE International Conference on Robotics and Automation.

[11]  Walterio W. Mayol-Cuevas,et al.  Discovering Higher Level Structure in Visual SLAM , 2008, IEEE Transactions on Robotics.

[12]  Simon Lacroix,et al.  Monocular-vision based SLAM using Line Segments , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[13]  Ian D. Reid,et al.  Growing semantically meaningful models for visual SLAM , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Ian D. Reid,et al.  Real-Time Monocular SLAM with Straight Lines , 2006, BMVC.

[15]  John J. Craig,et al.  Introduction to Robotics Mechanics and Control , 1986 .

[16]  Danica Kragic,et al.  A framework for vision based bearing only 3D SLAM , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[17]  Dezhen Song,et al.  Vision-based motion planning for an autonomous motorcycle on ill-structured roads , 2007, Auton. Robots.

[18]  Dezhen Song,et al.  On the Analysis of the Depth Error on the Road Plane for Monocular Vision-Based Robot Navigation , 2008, WAFR.

[19]  Lorenzo Favalli,et al.  Object tracking for retrieval applications in MPEG-2 , 2000, IEEE Trans. Circuits Syst. Video Technol..

[20]  Tom Drummond,et al.  Scalable Monocular SLAM , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21]  Javier Civera,et al.  Dense multi-planar scene estimation from a sparse set of images , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22]  Andrew Zisserman,et al.  Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[23]  Yan Lu,et al.  A two-view based multilayer feature graph for robot navigation , 2012, 2012 IEEE International Conference on Robotics and Automation.

[24]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Andrew J. Davison,et al.  Live dense reconstruction with a single moving camera , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26]  Ji Zhang,et al.  On the error analysis of vertical line pair-based monocular visual odometry in urban area , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[27]  Thomas S. Huang,et al.  Fast camera motion analysis in MPEG domain , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[28]  Yan Lu,et al.  High level landmark-based visual navigation using unsupervised geometric constraints in local bundle adjustment , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).