Matchability Prediction for Full-Search Template Matching Algorithms

While recent approaches have shown that it is possible to do template matching by exhaustively scanning the parameter space, the resulting algorithms are still quite demanding. In this paper we alleviate the computational load of these algorithms by proposing an efficient approach for predicting the match ability of a template, before it is actually performed. This avoids large amounts of unnecessary computations. We learn the match ability of templates by using dense convolutional neural network descriptors that do not require ad-hoc criteria to characterize a template. By using deep learning descriptions of patches we are able to predict match ability over the whole image quite reliably. We will also show how no specific training data is required to solve problems like panorama stitching in which you usually require data from the scene in question. Due to the highly parallelizable nature of this tasks we offer an efficient technique with a negligible computational cost at test time.

[1]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[3]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[4]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[5]  Konrad Schindler,et al.  Predicting Matchability , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Federico Tombari,et al.  Fast Full-Search Equivalent Template Matching by Enhanced Bounded Correlation , 2008, IEEE Transactions on Image Processing.

[7]  Tinne Tuytelaars,et al.  Discriminatively Trained Templates for 3D Object Detection: A Real Time Scalable Approach , 2013, 2013 IEEE International Conference on Computer Vision.

[8]  Vincent Lepetit,et al.  Pose Priors for Simultaneously Solving Alignment and Correspondence , 2008, ECCV.

[9]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[10]  Michael Werman,et al.  Accelerating Pattern Matching or How Much Can You Slide? , 2007, ACCV.

[11]  Dumitru Erhan,et al.  Deep Neural Networks for Object Detection , 2013, NIPS.

[12]  Petros Maragos,et al.  Motion displacement estimation using an affine model for image matching , 1991 .

[13]  Vincent Lepetit,et al.  Combining Geometric and Appearance Priors for Robust Homography Estimation , 2010, ECCV.

[14]  Federico Tombari,et al.  Full-Search-Equivalent Pattern Matching with Incremental Dissimilarity Approximations , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Vincent Lepetit,et al.  Linear and Quadratic Subsets for Template-Based Tracking , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Jiri Matas,et al.  Tracking by an Optimal Sequence of Linear Predictors , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Quan Wang,et al.  Real-Time Image Matching Based on Multiple View Kernel Projection , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Vincent Lepetit,et al.  Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes , 2011, 2011 International Conference on Computer Vision.

[19]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[20]  Hae Yong Kim,et al.  Grayscale Template-Matching Invariant to Rotation, Scale, Translation, Brightness and Contrast , 2007, PSIVT.

[21]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[22]  Shai Avidan,et al.  FasT-Match: Fast Affine Template Matching , 2013, CVPR.

[23]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[24]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[25]  Toshimitsu Kaneko,et al.  Feature selection for reliable tracking using template matching , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[26]  Paolo Valigi,et al.  Learning Contours for Automatic Annotations of Mountains Pictures on a Smartphone , 2014, ICDSC.

[27]  Samuel Audet,et al.  Image-Based Rendering Using Image-Based Priors , 2006 .

[28]  Vincent Lepetit,et al.  Dominant orientation templates for real-time detection of texture-less objects , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[29]  Francesc Moreno-Noguer,et al.  Non-Rigid Graph Registration Using Active Testing Search , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Vittorio Ferrari,et al.  Exploiting spatial overlap to efficiently compute appearance distances between image windows , 2011, NIPS 2011.

[31]  Francesc Moreno-Noguer,et al.  Simultaneous correspondence and non-rigid 3D reconstruction of the coronary tree from single X-ray images , 2011, 2011 International Conference on Computer Vision.

[32]  Nassir Navab,et al.  Rapid selection of reliable templates for visual tracking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[33]  Davide Scaramuzza,et al.  Appearance-based Active, Monocular, Dense Reconstruction for Micro Aerial Vehicles , 2014, Robotics: Science and Systems.