Extracting information from a query image, for content based image retrieval

This paper describes a method to solve the problem of Object-Centric Content Based Image Retrieval (CBIR), motivated by concepts from theory of cognitive sciences. According to cognitive models, there are two lobes in human brain, one is responsible to solve the problem of object recognition, while the other solves the problem of localization (or detection). It is the exchange of mutual information (back and forth) between these two lobes which enables the human brain to simultaneously detect and recognize the objects in a complex scene. Based on the above cognitive paradigm, we propose a RADAR (Retrieval After Detection and Recognition) framework, to solve the problem of object-centric CBIR. Our proposed RADAR framework helps to simultaneously detect and recognize the object in a given test sample for retrieval task. Object-Centric CBIR retrieves a set of samples from a database of identified categories, similar to the categories in a query image using a matching criteria based on features extracted from the localized region(s). Results are shown using various real-world datasets (including PASCAL VOC) to exhibit the performance of the proposed method.

[1]  ZissermanAndrew,et al.  The Pascal Visual Object Classes Challenge , 2015 .

[2]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  B. Baars,et al.  Cognition, Brain, and Consciousness: Introduction to Cognitive Neuroscience , 2007 .

[4]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[5]  Hui Zhang,et al.  Localized Content-Based Image Retrieval , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[7]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[8]  Feiping Nie,et al.  Heterogeneous Visual Features Fusion via Sparse Multimodal Machine , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Tieniu Tan,et al.  Boosted local structured HOG-LBP for object localization , 2011, CVPR 2011.

[10]  Rong-Tai Chen,et al.  A smart content-based image retrieval system based on color and texture feature , 2009, Image Vis. Comput..

[11]  Lei Zhang,et al.  Image retrieval based on micro-structure descriptor , 2011, Pattern Recognit..

[12]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Gareth Funka-Lea,et al.  Graph Cuts and Efficient N-D Image Segmentation , 2006, International Journal of Computer Vision.

[14]  Yang Wang,et al.  Kernel Latent SVM for Visual Recognition , 2012, NIPS.

[15]  Jian Sun,et al.  Saliency Optimization from Robust Background Detection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[17]  Gustavo Deco,et al.  Computational neuroscience of vision , 2002 .

[18]  Sukhendu Das,et al.  SLAR (Simultaneous Localization And Recognition) Framework for Smart CBIR , 2012, PerMIn.

[19]  Jing-Yu Yang,et al.  Content-based image retrieval using color difference histogram , 2013, Pattern Recognit..

[20]  Honggang Zhang,et al.  Re-ranking using compression-based distance measure for Content-based Commercial Product Image Retrieval , 2012, 2012 19th IEEE International Conference on Image Processing.

[21]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[22]  Antonio Criminisi,et al.  Object categorization by learned universal visual dictionary , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[23]  Yiquan Wu,et al.  Shape-Based Image Retrieval Using Combining Global and Local Shape Features , 2009, 2009 2nd International Congress on Image and Signal Processing.

[24]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[25]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[27]  Urbano Nunes,et al.  Trainable classifier-fusion schemes: An application to pedestrian detection , 2009, 2009 12th International IEEE Conference on Intelligent Transportation Systems.

[28]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[29]  Huchuan Lu,et al.  Saliency Detection via Graph-Based Manifold Ranking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Lei Zhang,et al.  Contents lists available at ScienceDirect Pattern Recognition , 2022 .

[31]  Bernhard Schölkopf,et al.  Kernel Principal Component Analysis , 1997, ICANN.