GMS: Grid-Based Motion Statistics for Fast, Ultra-robust Feature Correspondence

Feature matching aims at generating correspondences across images, which is widely used in many computer vision tasks. Although considerable progress has been made on feature descriptors and fast matching for initial correspondence hypotheses, selecting good ones from them is still challenging and critical to the overall performance. More importantly, existing methods often take a long computational time, limiting their use in real-time applications. This paper attempts to separate true correspondences from false ones at high speed. We term the proposed method (GMS) grid-based motion Statistics, which incorporates the smoothness constraint into a statistic framework for separation and uses a grid-based implementation for fast calculation. GMS is robust to various challenging image changes, involving in viewpoint, scale, and rotation. It is also fast, e.g., take only 1 or 2 ms in a single CPU thread, even when 50 K correspondences are processed. This has important implications for real-time applications. What’s more, we show that incorporating GMS into the classic feature matching and epipolar geometry estimation pipeline can significantly boost the overall performance. Finally, we integrate GMS into the well-known ORB-SLAM system for monocular initialization, resulting in a significant improvement.

[1]  Renaud Marlet,et al.  Virtual Line Descriptor and Semi-Local Graph Matching Method for Reliable Feature Correspondence , 2012, BMVC.

[2]  Vincent Lepetit,et al.  Learning to Find Good Correspondences , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Shi-Min Hu,et al.  HFS: Hierarchical Feature Selection for Efficient Image Segmentation , 2016, ECCV.

[4]  Zhuowen Tu,et al.  Robust Point Matching via Vector Field Consensus , 2014, IEEE Transactions on Image Processing.

[5]  Richard I. Hartley,et al.  In Defense of the Eight-Point Algorithm , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Zhengyou Zhang,et al.  Determining the Epipolar Geometry and its Uncertainty: A Review , 1998, International Journal of Computer Vision.

[7]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[8]  Renaud Marlet,et al.  Virtual Line Descriptor and Semi-Local Matching Method for Reliable Feature Correspondence , 2012 .

[9]  Minh N. Do,et al.  CODE: Coherence Based Decision Boundaries for Feature Correspondence , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Jan-Michael Frahm,et al.  Comparative Evaluation of Binary Features , 2012, ECCV.

[11]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[12]  Wolfram Burgard,et al.  A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Noah Snavely,et al.  Robust Global Translations with 1DSfM , 2014, ECCV.

[14]  Matthew R. Casey FAST APPROXIMATE NEAREST NEIGHBORS , 2006 .

[15]  Hyun Soo Park,et al.  3D Semantic Trajectory Reconstruction from 3D Pixel Continuum , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Jan-Michael Frahm,et al.  Structure-from-Motion Revisited , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Jiri Matas,et al.  Repeatability Is Not Enough: Learning Affine Regions via Discriminability , 2017, ECCV.

[18]  Yasuyuki Matsushita,et al.  GMS: Grid-Based Motion Statistics for Fast, Ultra-robust Feature Correspondence , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Junjun Jiang,et al.  Locality Preserving Matching , 2017, IJCAI.

[20]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[21]  Ji Zhao,et al.  An Evaluation of Feature Matchers for Fundamental Matrix Estimation , 2019, BMVC.

[22]  R. F. Brown,et al.  PERFORMANCE EVALUATION , 2019, ISO 22301:2019 and business continuity management – Understand how to plan, implement and enhance a business continuity management system (BCMS).

[23]  Xiang Bai,et al.  Richer Convolutional Features for Edge Detection , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Vladlen Koltun,et al.  Deep Fundamental Matrix Estimation , 2018, ECCV.

[26]  Le Zhang,et al.  DEL: Deep Embedding Learning for Efficient Image Segmentation , 2018, IJCAI.

[27]  Han Wang,et al.  GMC: Grid Based Motion Clustering in Dynamic Environment , 2019, IntelliSys.

[28]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[30]  Jian-Huang Lai,et al.  Dimensionality's Blessing: Clustering Images by Underlying Distribution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[32]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[33]  Peter J. Rousseeuw,et al.  Robust Regression and Outlier Detection , 2005, Wiley Series in Probability and Statistics.

[34]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[35]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[36]  Zhao Yi,et al.  A Robust Robot Design for Item Picking , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[37]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Jiri Matas,et al.  Working hard to know your neighbor's margins: Local descriptor learning loss , 2017, NIPS.

[39]  Jean-Michel Morel,et al.  ASIFT: A New Framework for Fully Affine Invariant Image Comparison , 2009, SIAM J. Imaging Sci..

[40]  ARNO KNAPITSCH,et al.  Tanks and temples , 2017, ACM Trans. Graph..

[41]  Ilan Shimshoni,et al.  Epipolar Geometry Estimation for Urban Scenes with Repetitive Structures , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .