Localized global descriptors for image retrieval: An extensive evaluation on adaptations to the SIMPLE model

SIMPLE (Searching Images with MPEG-7 (& MPEG-7-like) Powered Localized dEscriptors) is a model that proposes the reuse of well-established global descriptors by localizing their description mechanism on image patches located by local features' detectors. Having displayed impressive retrieval results on two different databases, in this paper we extend the family by replacing the originally picked global descriptors and by applying VLAD, a BOVW alternative, for the vectorization of the features. We re-evaluate all SIMPLE descriptors (original and proposed here) on five benchmarking databases featuring very diverse scenarios, so as to investigate how SIMPLE works with different image retrieval cases, ranging from near duplicate search to visual object similarity. The experimental results show the robustness of the scheme with SIMPLE descriptors presenting stable and high retrieval performances across all tested collections, outperforming not only the methods they originated from, but also some of the best reported state-of-the-art methods as well.

[1]  Jan Flusser,et al.  Affine moment invariants generated by graph method , 2011, Pattern Recognit..

[2]  Yiannis S. Boutalis,et al.  Co.Vi.Wo.: Color Visual Words Based on Non-Predefined Size Codebooks , 2013, IEEE Transactions on Cybernetics.

[3]  Mathias Lux,et al.  Lire: lucene image retrieval: an extensible java CBIR library , 2008, ACM Multimedia.

[4]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[5]  Cordelia Schmid,et al.  On the burstiness of visual elements , 2009, CVPR.

[6]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Songhe Feng,et al.  An error-tolerant approximate matching algorithm for labeled combinatorial maps , 2015, Neurocomputing.

[8]  Kristin J. Dana,et al.  Compact representation of bidirectional texture functions , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Hermann Ney,et al.  Features for image retrieval: an experimental comparison , 2008, Information Retrieval.

[10]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[11]  Savvas A. Chatzichristofis,et al.  Image moment invariants as local features for content based image retrieval using the Bag-of-Visual-Words model , 2015, Pattern Recognit. Lett..

[12]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[13]  Mathias Lux,et al.  Localizing global descriptors for content-based image retrieval , 2015, EURASIP Journal on Advances in Signal Processing.

[14]  Cordelia Schmid,et al.  Improving Bag-of-Features for Large Scale Image Search , 2010, International Journal of Computer Vision.

[15]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Yiannis S. Boutalis,et al.  Searching images with MPEG-7 (& MPEG-7-like) Powered Localized dEscriptors: The SIMPLE answer to effective Content Based Image Retrieval , 2014, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI).

[17]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[18]  Dharmendra Saraswat,et al.  Smartphone-based hierarchical crowdsourcing for weed identification , 2015, Comput. Electron. Agric..

[19]  Yiannis S. Boutalis,et al.  Selection of the proper Compact Composite Descriptor for improving content based image retrieval , 2009 .

[20]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Yiannis S. Boutalis,et al.  CEDD: Color and Edge Directivity Descriptor: A Compact Descriptor for Image Indexing and Retrieval , 2008, ICVS.

[22]  Dim P. Papadopoulos,et al.  Automatic summarization and annotation of videos with lack of metadata information , 2013, Expert Syst. Appl..

[23]  Qi Tian,et al.  Coupled Binary Embedding for Large-Scale Image Retrieval , 2014, IEEE Transactions on Image Processing.

[24]  C. Schmid,et al.  On the burstiness of visual elements , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Sašo Džeroski,et al.  Improved medical image modality classification using a combination of visual and textual features , 2015, Comput. Medical Imaging Graph..

[26]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[27]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[28]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[29]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Yiannis S. Boutalis,et al.  FCTH: Fuzzy Color and Texture Histogram - A Low Level Feature for Accurate Image Retrieval , 2008, 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services.

[31]  Gerald Schaefer,et al.  UCID: an uncompressed color image database , 2003, IS&T/SPIE Electronic Imaging.

[32]  Matthijs Douze,et al.  Bag-of-colors for improved image search , 2011, ACM Multimedia.