Fast Explicit Diffusion for Accelerated Features in Nonlinear Scale Spaces

We propose a novel and fast multiscale feature detection and description approach that exploits the benefits of nonlinear scale spaces. Previous attempts to detect and describe features in nonlinear scale spaces such as KAZE [1] and BFSIFT [6] are highly time consuming due to the computational burden of creating the nonlinear scale space. In this paper we propose to use recent numerical schemes called Fast Explicit Diffusion (FED) [3, 4] embedded in a pyramidal framework to dramatically speed-up feature detection in nonlinear scale spaces. In addition, we introduce a Modified-Local Difference Binary (M-LDB) descriptor that is highly efficient, exploits gradient information from the nonlinear scale space, is scale and rotation invariant and has low storage requirements. Our features are called Accelerated-KAZE (A-KAZE) due to the dramatic speed-up introduced by FED schemes embedded in a pyramidal framework.

[1]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[2]  David J. Kriegman,et al.  Locally Uniform Comparison Image Descriptor , 2012, NIPS.

[3]  Henrik Aanæs,et al.  Interesting Interest Points , 2011, International Journal of Computer Vision.

[4]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[5]  K. S. Pedersen,et al.  A Comparative Study of Interest Point Performance on a Unique Data Set , 2011 .

[6]  Adrien Bartoli,et al.  KAZE Features , 2012, ECCV.

[7]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[8]  Thomas S. Huang,et al.  Image processing , 1971 .

[9]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[10]  Joachim Weickert,et al.  From Box Filtering to Fast Explicit Diffusion , 2010, DAGM-Symposium.

[11]  Vincent Lepetit,et al.  BRIEF: Computing a Local Binary Descriptor Very Fast , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Heinz Handels,et al.  Fast Explicit Diffusion for Registration with Direction-Dependent Regularization , 2012, WBIR.

[14]  Max A. Viergever,et al.  Efficient and reliable schemes for nonlinear diffusion filtering , 1998, IEEE Trans. Image Process..

[15]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[16]  Hanno Scharr,et al.  A Scheme for Coherence-Enhancing Diffusion Filtering with Optimized Rotation Invariance , 2002, J. Vis. Commun. Image Represent..

[17]  Jan-Michael Frahm,et al.  Comparative Evaluation of Binary Features , 2012, ECCV.

[18]  Rami Ben-Ari,et al.  Variational Depth from Defocus in real-time , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[19]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[20]  Jiri Matas,et al.  Tracking by an Optimal Sequence of Linear Predictors , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jian Sun,et al.  Face Alignment by Explicit Shape Regression , 2012, International Journal of Computer Vision.

[22]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[23]  Hongjian You,et al.  BFSIFT: A Novel Method to Find Feature Matches for SAR Image Registration , 2012, IEEE Geoscience and Remote Sensing Letters.

[24]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[25]  Xin Yang,et al.  LDB: An ultra-fast feature for scalable Augmented Reality on mobile devices , 2012, 2012 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[26]  Joachim Weickert,et al.  Cyclic Schemes for PDE-Based Image Analysis , 2016, International Journal of Computer Vision.