论文信息 - MODS: Fast and robust method for two-view matching

MODS: Fast and robust method for two-view matching

Abstract A novel algorithm for wide-baseline matching called MODS—matching on demand with view synthesis—is presented. The MODS algorithm is experimentally shown to solve a broader range of wide-baseline problems than the state of the art while being nearly as fast as standard matchers on simple problems. The apparent robustness vs. speed trade-off is finessed by the use of progressively more time-consuming feature detectors and by on-demand generation of synthesized images that is performed until a reliable estimate of geometry is obtained. We introduce an improved method for tentative correspondence selection, applicable both with and without view synthesis. A modification of the standard first to second nearest distance rule increases the number of correct matches by 5–20% at no additional computational cost. Performance of the MODS algorithm is evaluated on several standard publicly available datasets, and on a new set of geometrically challenging wide baseline problems that is made public together with the ground truth. Experiments show that the MODS outperforms the state-of-the-art in robustness and speed. Moreover, MODS performs well on other classes of difficult two-view problems like matching of images from different modalities, with wide temporal baseline or with significant lighting changes.

[1] Noah Snavely,et al. Image matching using local symmetry features , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2] Cordelia Schmid,et al. Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[3] Tom Drummond,et al. Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[4] Rahul Sukthankar,et al. D-Nets: Beyond patch-based image descriptors , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Chia-Ling Tsai,et al. Registration of Challenging Image Pairs: Initialization, Estimation, and Decision , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Raj Gupta,et al. Robust order-based methods for feature description , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7] Andrew Zisserman,et al. Wide baseline stereo matching , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[8] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[9] David G. Lowe,et al. Shape Descriptors for Maximally Stable Extremal Regions , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[10] Karel Lenc,et al. A Few Things One Should Know About Feature Extraction , Description and Matching , 2014 .

[11] Jean-Michel Morel,et al. ASIFT: An Algorithm for Fully Affine Invariant Comparison , 2011, Image Process. Line.

[12] Andrew Zisserman,et al. Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[13] Vincent Lepetit,et al. Keypoint recognition using randomized trees , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Darius Burschka,et al. Adaptive and Generic Corner Detection Based on the Accelerated Segment Test , 2010, ECCV.

[15] Jitendra Malik,et al. Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[16] Angel Domingo Sappa,et al. Multispectral Image Feature Points , 2012, Sensors.

[17] Cordelia Schmid,et al. A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[18] Pierre Vandergheynst,et al. FREAK: Fast Retina Keypoint , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19] Christopher Hunt,et al. Notes on the OpenSURF Library , 2009 .

[20] Jiri Matas,et al. Two-view matching with view synthesis revisited , 2013, 2013 28th International Conference on Image and Vision Computing New Zealand (IVCNZ 2013).

[21] Bodo Rosenhahn,et al. Increasing the accuracy of feature evaluation benchmarks using differential evolution , 2011, 2011 IEEE Symposium on Differential Evolution (SDE).

[22] Jiri Matas,et al. Efficient Image Detail Mining , 2014, ACCV.

[23] Jiri Matas,et al. Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[24] Jiri Matas,et al. Fixing the Locally Optimized RANSAC , 2012, BMVC.

[25] Michael Isard,et al. Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[26] Bin Fan,et al. Local Intensity Order Pattern for feature description , 2011, 2011 International Conference on Computer Vision.

[27] Yongtian Wang,et al. A completely affine invariant image-matching method based on perspective projection , 2011, Machine Vision and Applications.

[28] Cordelia Schmid,et al. A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[30] Serge J. Belongie,et al. Ultra-wide Baseline Aerial Imagery Matching in Urban Environments , 2013, BMVC.

[31] Wen-Hsiang Tsai,et al. Vision-Based Autonomous Vehicle Guidance for Indoor Security Patrolling by a SIFT-Based Vehicle-Localization Technique , 2010, IEEE Transactions on Vehicular Technology.

[32] Charles V. Stewart,et al. Keypoint Descriptors for Matching Across Multiple Image Modalities and Non-linear Intensity Variations , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[33] Wei Li,et al. Fully affine invariant SURF for image matching , 2012, Neurocomputing.

[34] Zhenbing Zhao,et al. Infrared/visible image matching algorithm based on NSCT and DAISY , 2011, 2011 4th International Congress on Image and Signal Processing.

[35] Lionel Moisan,et al. A Probabilistic Criterion to Detect Rigid Point Matches Between Two Images and Estimate the Fundamental Matrix , 2004, International Journal of Computer Vision.

[36] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[37] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[38] Pietro Perona,et al. Evaluation of Features Detectors and Descriptors Based on 3D Objects , 2005, ICCV.

[39] Jiri Matas,et al. Locally Optimized RANSAC , 2003, DAGM-Symposium.

[40] Jean-Michel Morel,et al. ASIFT: A New Framework for Fully Affine Invariant Image Comparison , 2009, SIAM J. Imaging Sci..

[41] Henrik Aanæs,et al. Interesting Interest Points , 2011, International Journal of Computer Vision.

[42] Ieee Xplore,et al. IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43] Jiri Matas,et al. Matching with PROSAC - progressive sample consensus , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[44] Jie Tian,et al. A Partial Intensity Invariant Feature Descriptor for Multimodal Retinal Image Registration , 2010, IEEE Transactions on Biomedical Engineering.

[45] Gary R. Bradski,et al. ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[46] Zhanyi Hu,et al. This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTION ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 1 Rotationally Invariant Descript , 2011 .

[47] Henrik Aanæs,et al. Finding the Best Feature Detector-Descriptor Combination , 2011, 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission.

[48] Pietro Perona,et al. Evaluation of Features Detectors and Descriptors based on 3D Objects , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[49] Andrew Zisserman,et al. Three things everyone should know to improve object retrieval , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[50] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[51] Roland Siegwart,et al. BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[52] Jiri Matas,et al. Two-view geometry estimation unaffected by a dominant plane , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[53] David G. Lowe,et al. Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.