Adaptively feature matching via joint transformational-spatial clustering

The transformational and spatial proximities are important cues for identifying inliers from an appearance based match set because correct matches generally stay close in input images and share similar local transformations. However, most existing approaches only check one type of them or both types consecutively with manually set thresholds, and thus their matching accuracy and flexibility in handling large-scale images are limited. In this paper, we present an efficient clustering based approach to identify match inliers with both proximities simultaneously. It first projects the putative matches into a joint transformational-spatial space, where mismatches tend to scatter all around while correct matches gather together. A mode-seeking process based on joint kernel density estimation is then proposed to obtain significant clusters in the joint space, where each cluster contains matches mapping the same object across images with high accuracy. Moreover, kernel bandwidths for measuring match proximities are adaptively set during density estimation, which enhances its applicability for matching different images. Experiments on three standard datasets show that the proposed approach delivers superior performance on a variety of feature matching tasks, including multi-object matching, duplicate object matching and object retrieval.

[1]  Josef Sivic,et al.  Convolutional Neural Network Architecture for Geometric Matching , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  James Stephen Marron,et al.  Canonical kernels for density estimation , 1988 .

[3]  Philip H. S. Torr,et al.  The Development and Comparison of Robust Methods for Estimating the Fundamental Matrix , 1997, International Journal of Computer Vision.

[4]  Minh N. Do,et al.  Common Visual Pattern Discovery via Nonlinear Mean Shift Clustering , 2015, IEEE Transactions on Image Processing.

[5]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[6]  Vincent Lepetit,et al.  Learning to Find Good Correspondences , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Yasuyuki Matsushita,et al.  GMS: Grid-Based Motion Statistics for Fast, Ultra-robust Feature Correspondence , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Junjun Jiang,et al.  Robust Feature Matching Using Spatial Clustering With Heavy Outliers , 2020, IEEE Transactions on Image Processing.

[9]  Thomas Brox,et al.  DeMoN: Depth and Motion Network for Learning Monocular Stereo , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Minsu Cho,et al.  Feature correspondence and deformable object matching via agglomerative correspondence clustering , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11]  Zhiguo Cao,et al.  NM-Net: Mining Reliable Neighbors for Robust Feature Correspondences , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Minsu Cho,et al.  Mode-seeking on graphs via random walks , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Junjun Jiang,et al.  Locality Preserving Matching , 2018, International Journal of Computer Vision.

[14]  Minsu Cho,et al.  Hyper-graph matching via reweighted random walks , 2011, CVPR 2011.

[15]  Cristian Sminchisescu,et al.  Deep Learning of Graph Matching , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Minsu Cho,et al.  Progressive graph matching: Making a move of graphs via probabilistic voting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Kim-Kwang Raymond Choo,et al.  Adaptive Fusion and Category-Level Dictionary Learning Model for Multiview Human Action Recognition , 2019, IEEE Internet of Things Journal.

[18]  Ahmed A. Abd El-Latif,et al.  Linear discriminant multi-set canonical correlations analysis (LDMCCA): an efficient approach for feature fusion of finger biometrics , 2013, Multimedia Tools and Applications.

[19]  Shaohua Wan,et al.  A unified two-parallel-branch deep neural network for joint gland contour and segmentation learning , 2019, Future Gener. Comput. Syst..

[20]  Wael Said,et al.  IoT Security Based on Iris Verification Using Multi-Algorithm Feature Level Fusion Scheme , 2019, 2019 2nd International Conference on Computer Applications & Information Security (ICCAIS).

[21]  Torsten Sattler,et al.  Comparative Evaluation of Hand-Crafted and Learned Local Features , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Wee Sun Lee,et al.  Deep Graphical Feature Learning for the Feature Matching Problem , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[23]  Kamil Adamczewski,et al.  Discrete Tabu Search for Graph Matching , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[24]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Shaohua Wan,et al.  Exploring Deep Learning for View-Based 3D Model Retrieval , 2020, ACM Trans. Multim. Comput. Commun. Appl..

[26]  Bo Li,et al.  Local Feature Descriptor for Image Matching: A Survey , 2019, IEEE Access.

[27]  Kyoung Mu Lee,et al.  Unsupervised detection and segmentation of identical objects , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[29]  Kunio Kashino,et al.  Robust Spatial Matching as Ensemble of Weak Geometric Relations , 2015, BMVC.

[30]  Shuicheng Yan,et al.  Fast Detection of Dense Subgraphs with Iterative Shrinking and Expansion , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Mohammed Atiquzzaman,et al.  Automated Colorization of a Grayscale Image With Seed Points Propagation , 2020, IEEE Transactions on Multimedia.

[32]  Luc Van Gool,et al.  Simultaneous Object Recognition and Segmentation from Single or Multiple Model Views , 2006, International Journal of Computer Vision.

[33]  Vincent Lepetit,et al.  LIFT: Learned Invariant Feature Transform , 2016, ECCV.

[34]  Ahmed A. Abd El-Latif,et al.  A Fully Automatic Player Detection Method Based on One-Class SVM , 2013, IEICE Trans. Inf. Syst..

[35]  Bing-Yu Chen,et al.  Robust Feature Matching with Alternate Hough and Inverted Hough Transforms , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Cordelia Schmid,et al.  SfM-Net: Learning of Structure and Motion from Video , 2017, ArXiv.

[37]  Minsu Cho,et al.  Co-recognition of Image Pairs by Data-Driven Monte Carlo Image Exploration , 2008, ECCV.

[38]  Ahmed A. Abd El-Latif,et al.  Finger-vein Verification Using Gabor Filter and SIFT Feature Matching , 2012, 2012 Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[39]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[40]  Junjun Jiang,et al.  LMR: Learning a Two-Class Classifier for Mismatch Removal , 2019, IEEE Transactions on Image Processing.

[41]  Ahmed A. Abd El-Latif,et al.  A new image segmentation method via fusing NCut eigenvectors maps , 2012, Digital Image Processing.

[42]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[43]  C. Schmid,et al.  Exploiting descriptor distances for precise image search , 2011 .

[44]  Xin He,et al.  Saliency detection based on integrated features , 2014, Neurocomputing.

[45]  Binbin Chen,et al.  Geometry consistency aware confidence evaluation for feature matching , 2020, Image Vis. Comput..

[46]  Kamil Adamczewski,et al.  Subgraph matching using compactness prior for robust feature correspondence , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).