Performance comparison of various feature detector-descriptor combinations for content-based image retrieval with JPEG-encoded query images

We study the impact of JPEG compression on the performance of an image retrieval system for different feature detector-descriptor combinations. The VLBenchmarks retrieval framework is used to compare a total of 60 detector-descriptor combinations for a dataset with JPEG-encoded query images. Our results show that among all tested detectors, the Hessian-Affine detector leads to the most robust performance in the presence of strong JPEG compression. Additionally, we compare the retrieval gains of the different detector-descriptor pairs after processing the JPEG-encoded query images with different deblocking filters. The results illustrate that for the MSER, MFD and WαSH detectors, the retrieval results benefit from two of the deblocking approaches at low bit rate irrespective of what descriptor the detectors are combined with. The same two deblocking filters are found to increase the retrieval performance for the MROGH descriptor when combined with most of the tested detectors.

[1]  Pietro Perona,et al.  Evaluation of Features Detectors and Descriptors Based on 3D Objects , 2005, ICCV.

[2]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[3]  Adrien Bartoli,et al.  KAZE Features , 2012, ECCV.

[4]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[5]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[6]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[7]  Hu Chen,et al.  On the design of a novel JPEG quantization table for improved feature detection performance , 2013, 2013 IEEE International Conference on Image Processing.

[8]  Zhanyi Hu,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTION ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 1 Rotationally Invariant Descript , 2011 .

[9]  Henrik Aanæs,et al.  Finding the Best Feature Detector-Descriptor Combination , 2011, 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission.

[10]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, CVPR 2004.

[11]  Wen Gao,et al.  Optimizing JPEG quantization table for low bit rate mobile visual search , 2012, 2012 Visual Communications and Image Processing.

[12]  Yannis Avrithis,et al.  The medial feature detector: Stable regions from image boundaries , 2011, 2011 International Conference on Computer Vision.

[13]  Anas Al-Nuaimi,et al.  Mobile Visual Location Recognition , 2013 .

[14]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[15]  Kurt Konolige,et al.  CenSurE: Center Surround Extremas for Realtime Feature Detection and Matching , 2008, ECCV.

[16]  Krystian Mikolajczyk,et al.  Evaluation of local detectors and descriptors for fast feature matching , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[17]  Aria Nosratinia,et al.  Enhancement of JPEG-Compressed Images by Re-application of JPEG , 2001, J. VLSI Signal Process..

[18]  Yannis Avrithis,et al.  WαSH: Weighted α-Shapes for Local Feature Detection , 2012, ECCV.

[19]  Lucas J. van Vliet,et al.  Blocking artifacts removal by a hybrid filter method , 2005 .

[20]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[21]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  C. Schmid,et al.  Exploiting descriptor distances for precise image search , 2011 .

[23]  Bin Fan,et al.  Local Intensity Order Pattern for feature description , 2011, 2011 International Conference on Computer Vision.

[24]  Eckehard G. Steinbach,et al.  Preserving SIFT features in JPEG-encoded images , 2011, 2011 18th IEEE International Conference on Image Processing.

[25]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[26]  Karen O. Egiazarian,et al.  Pointwise Shape-Adaptive DCT for High-Quality Denoising and Deblocking of Grayscale and Color Images , 2007, IEEE Transactions on Image Processing.

[27]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[28]  Bernd Girod,et al.  Mobile Visual Search , 2011, IEEE Signal Processing Magazine.

[29]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[30]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..