论文信息 - A Theory of Local Matching: SIFT and Beyond

A Theory of Local Matching: SIFT and Beyond

Why has SIFT been so successful? Why its extension, DSP-SIFT, can further improve SIFT? Is there a theory that can explain both? How can such theory benefit real applications? Can it suggest new algorithms with reduced computational complexity or new descriptors with better accuracy for matching? We construct a general theory of local descriptors for visual matching. Our theory relies on concepts in energy minimization and heat diffusion. We show that SIFT and DSP-SIFT approximate the solution the theory suggests. In particular, DSP-SIFT gives a better approximation to the theoretical solution; justifying why DSP-SIFT outperforms SIFT. Using the developed theory, we derive new descriptors that have fewer parameters and are potentially better in handling affine deformations.

Hossein Mobahi | Stefano Soatto

[1] Hossein Mobahi,et al. A Theoretical Analysis of Optimization by Gaussian Continuation , 2015, AAAI.

[2] Hossein Mobahi,et al. On the Link between Gaussian Homotopy Continuation and Convex Envelopes , 2015, EMMCVPR.

[3] Jonathan Balzer,et al. Multi-view feature engineering and learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[5] Laura Sevilla-Lara,et al. Distribution Fields with Adaptive Kernels for Large Displacement Image Alignment , 2013, BMVC.

[6] Stefano Soatto,et al. Domain-size pooling in local descriptors: DSP-SIFT , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Andrea Vedaldi,et al. Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.