Convolutional neural networks for relevance feedback in content based image retrieval

Given the great success of Convolutional Neural Network (CNN) for image representation and classification tasks, we argue that Content-Based Image Retrieval (CBIR) systems could also leverage on CNN capabilities, mainly when Relevance Feedback (RF) mechanisms are employed. On the one hand, to improve the performances of CBIRs, that are strictly related to the effectiveness of the descriptors used to represent an image, as they aim at providing the user with images similar to an initial query image. On the other hand, to reduce the semantic gap between the similarity perceived by the user and the similarity computed by the machine, by exploiting an RF mechanism where the user labels the returned images as being relevant or not concerning her interests. Consequently, in this work, we propose a CBIR system based on transfer learning from a CNN trained on a vast image database, thus exploiting the generic image representation that it has already learned. Then, the pre-trained CNN is also fine-tuned exploiting the RF supplied by the user to reduce the semantic gap. In particular, after the user’s feedback, we propose to tune and then re-train the CNN according to the labelled set of relevant and non-relevant images. Then, we suggest different strategies to exploit the updated CNN for returning a novel set of images that are expected to be relevant to the user’s needs. Experimental results on different data sets show the effectiveness of the proposed mechanisms in improving the representation power of the CNN with respect to the user concept of image similarity. Moreover, the pros and cons of the different approaches can be clearly pointed out, thus providing clear guidelines for the implementation in production environments.

[1]  Marina Bosch,et al.  ImageCLEF, Experimental Evaluation in Visual Information Retrieval , 2010 .

[2]  Shuang Liang,et al.  Sketch retrieval and relevance feedback with biased SVM classification , 2008, Pattern Recognit. Lett..

[3]  Zhen Li,et al.  An efficient approach for scene categorization based on discriminative codebook learning in bag-of-words framework , 2013, Image Vis. Comput..

[4]  Anastasios Tefas,et al.  Deep convolutional learning for Content Based Image Retrieval , 2018, Neurocomputing.

[5]  Joani Mitro Content-based image retrieval tutorial , 2016, ArXiv.

[6]  Hélio Pedrini,et al.  Efficient fusion of multidimensional descriptors for image retrieval , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[7]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[8]  Rehan Ashraf,et al.  MDCBIR-MF: multimedia data for content-based image retrieval by using multiple features , 2018, Multimedia Tools and Applications.

[9]  Anastasios Tefas,et al.  Relevance Feedback in Deep Convolutional Neural Networks for Content Based Image Retrieval , 2016, SETN.

[10]  Alejandro Baldominos Gómez,et al.  Evolutionary convolutional neural networks: An application to handwriting recognition , 2017, Neurocomputing.

[11]  Wei-Chao Lin,et al.  Aggregation of Multiple Pseudo Relevance Feedbacks for Image Search Re-Ranking , 2019, IEEE Access.

[12]  Jie Xu,et al.  Query Aware Determinization of Uncertain Objects , 2015, IEEE Transactions on Knowledge and Data Engineering.

[13]  Andrew Piper,et al.  Visual information retrieval from historical document images , 2019, Journal of Cultural Heritage.

[14]  Smita Selot,et al.  Efficient System for Color Logo Recognition Based on Self-Organizing Map and Relevance Feedback Technique , 2018 .

[15]  Andrew Zisserman,et al.  Efficient Visual Search for Objects in Videos , 2008, Proceedings of the IEEE.

[16]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[17]  Murat Erten,et al.  HSV Color Histogram Based Image Retrieval with Background Elimination , 2019, 2019 1st International Informatics and Software Engineering Conference (UBMYK).

[18]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Hugo Jair Escalante,et al.  Late fusion of heterogeneous methods for multimedia image retrieval , 2008, MIR '08.

[20]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[21]  Francesco G. B. De Natale,et al.  Multimodal Retrieval with Diversification and Relevance Feedback for Tourist Attraction Images , 2017, ACM Trans. Multim. Comput. Commun. Appl..

[22]  Shahrel Azmin Suandi,et al.  Finger Vein Recognition Using Local Line Binary Pattern , 2011, Sensors.

[23]  Chun Chen,et al.  EMR: A Scalable Graph-Based Ranking Model for Content-Based Image Retrieval , 2015, IEEE Transactions on Knowledge and Data Engineering.

[24]  Shipeng Li,et al.  A bag-of-objects retrieval model for web image search , 2012, ACM Multimedia.

[25]  Mohammad Ali Bagheri,et al.  Error correcting output codes for multiclass classification: Application to two image vision problems , 2012, The 16th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP 2012).

[26]  Giorgio Giacinto,et al.  Neighborhood-based feature weighting for relevance feedback in content-based retrieval , 2009, 2009 10th Workshop on Image Analysis for Multimedia Interactive Services.

[27]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[28]  Giorgio Giacinto,et al.  Ten Years of Relevance Score for Content Based Image Retrieval , 2018, MLDM.

[29]  Giorgio Giacinto,et al.  A nearest-neighbor approach to relevance feedback in content based image retrieval , 2007, CIVR '07.

[30]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[31]  Anastasios Tefas,et al.  Deep convolutional image retrieval: A general framework , 2018, Signal Process. Image Commun..

[32]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[33]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[35]  Amol D. Rahulkar,et al.  New design of adaptive Gabor wavelet filter bank for medical image retrieval , 2020, IET Image Process..

[36]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[38]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[39]  Dimitris N. Metaxas,et al.  Computer-Aided Diagnosis of Mammographic Masses Using Scalable Image Retrieval , 2015, IEEE Transactions on Biomedical Engineering.

[40]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[41]  Chi-Ren Shyu,et al.  Relevance feedback decision trees in content-based image retrieval , 2000, 2000 Proceedings Workshop on Content-based Access of Image and Video Libraries.

[42]  K. Hemachandran,et al.  Content-based image retrieval using the combination of the fast wavelet transformation and the colour histogram , 2012 .

[43]  Mireya Saraí García Vázquez,et al.  Human vision perceptual color based semantic image retrieval with relevance feedback , 2018 .

[44]  Manisha Sharma,et al.  Deep CNN Combined With Relevance Feedback for Trademark Image Retrieval , 2018, J. Intell. Syst..

[45]  Oussama Aiadi,et al.  Image representation using complete multi-texton histogram , 2020, Multimedia Tools and Applications.

[46]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[47]  Marcello Pelillo,et al.  Content-based image retrieval with relevance feedback using random walks , 2011, Pattern Recognit..

[48]  Xi Zhang,et al.  Feature integration analysis of bag-of-features model for image retrieval , 2013, Neurocomputing.

[49]  Edward A. Fox,et al.  A genetic programming framework for content-based image retrieval , 2009, Pattern Recognit..

[50]  Jurandy Almeida,et al.  A scalable re-ranking method for content-based image retrieval , 2014, Inf. Sci..

[51]  Oge Marques Visual Information Retrieval: The State of the Art , 2016, IT Professional.

[52]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[53]  Wei-Yang Lin,et al.  The effect of low-level image features on pseudo relevance feedback , 2015, Neurocomputing.

[54]  P. Mohan Kumar,et al.  Relevance Feedback Base User Convenient Semantic Query Processing Using Neural Network , 2018 .

[55]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.