Learning to rank for content-based image retrieval

In Content-based Image Retrieval (CBIR), accurately ranking the returned images is of paramount importance, since users consider mostly the topmost results. The typical ranking strategy used by many CBIR systems is to employ image content descriptors, so that returned images that are most similar to the query image are placed higher in the rank. While this strategy is well accepted and widely used, improved results may be obtained by combining multiple image descriptors. In this paper we explore this idea, and introduce algorithms that learn to combine information coming from different descriptors. The proposed learning to rank algorithms are based on three diverse learning techniques: Support Vector Machines (CBIR-SVM), Genetic Programming (CBIR-GP), and Association Rules (CBIR-AR). Eighteen image content descriptors(color, texture, and shape information) are used as input and provided as training to the learning algorithms. We performed a systematic evaluation involving two complex and heterogeneous image databases (Corel e Caltech) and two evaluation measures (Precision and MAP). The empirical results show that all learning algorithms provide significant gains when compared to the typical ranking strategy in which descriptors are used in isolation. We concluded that, in general, CBIR-AR and CBIR-GP outperforms CBIR-SVM. A fine-grained analysis revealed the lack of correlation between the results provided by CBIR-AR and the results provided by the other two algorithms, which indicates the opportunity of an advantageous hybrid approach.

[1]  Jamshid Shanbehzadeh,et al.  Image retrieval based on shape similarity by edge orientation autocorrelogram , 2003, Pattern Recognit..

[2]  Pável Calado,et al.  A combined component approach for finding collection-adapted ranking functions based on genetic programming , 2007, SIGIR.

[3]  Tie-Yan Liu,et al.  Adapting ranking SVM to document retrieval , 2006, SIGIR.

[4]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[5]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[6]  Mohammed J. Zaki,et al.  Lazy Associative Classification , 2006, Sixth International Conference on Data Mining (ICDM'06).

[7]  Nenghai Yu,et al.  Multiple-instance ranking: Learning to rank images for image retrieval , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[9]  Filip Radlinski,et al.  A support vector method for optimizing average precision , 2007, SIGIR.

[10]  Bo Tao,et al.  Texture Recognition and Image Retrieval Using Gradient Indexing , 2000, J. Vis. Commun. Image Represent..

[11]  Wagner Meira,et al.  Learning to rank at query-time using association rules , 2008, SIGIR '08.

[12]  Alistair Moffat,et al.  Exploring the similarity space , 1998, SIGF.

[13]  Ricardo da Silva Torres,et al.  Content-Based Image Retrieval: Theory and Applications , 2006, RITA.

[14]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[15]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Adam Williams,et al.  Content-based image retrieval using joint correlograms , 2007, Multimedia Tools and Applications.

[17]  Carla E. Brodley,et al.  Interactive Content-based Image Retrieval Using Relevance Feedback , 2002 .

[18]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Matthieu Cord,et al.  Active Learning Methods for Interactive Image Retrieval , 2008, IEEE Transactions on Image Processing.

[20]  Chin-Chen Chang,et al.  Color image retrieval technique based on color features and image bitmap , 2007, Inf. Process. Manag..

[21]  Hong Zhao,et al.  Automatic feature weight assignment based on genetic algorithm for image retrieval , 2003, IEEE International Conference on Robotics, Intelligent Systems and Signal Processing, 2003. Proceedings. 2003.

[22]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[23]  Edward A. Fox,et al.  A genetic programming framework for content-based image retrieval , 2009, Pattern Recognit..

[24]  Neucimar J. Leite,et al.  Wavelet-based fingerprint image retrieval , 2009 .

[25]  Stephen J. McKenna,et al.  Learning Query-Dependent Distance Metrics for Interactive Image Retrieval , 2009, ICVS.

[26]  Fatos T. Yarman-Vural,et al.  SASI: a generic texture descriptor for image retrieval , 2003, Pattern Recognit..

[27]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[28]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[29]  Weiguo Fan,et al.  Image Retrieval with Relevance Feedback based on Genetic Programming , 2008, SBBD.

[30]  Thore Graepel,et al.  Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[31]  Bo Zhang,et al.  Support vector machine learning for image retrieval , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[32]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[33]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[34]  Tao Qin,et al.  LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval , 2007 .

[35]  Hyoung-Joo Kim,et al.  A fast content-based indexing and retrieval technique by the shape information in large image database , 2001, J. Syst. Softw..

[36]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[37]  Jitendra Malik,et al.  Image Retrieval and Classification Using Local Distance Functions , 2006, NIPS.

[38]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[39]  Quan Liu,et al.  An Orientation Independent Texture Descriptor for Image Retrieval , 2007, 2007 International Conference on Communications, Circuits and Systems.

[40]  Mario A. Nascimento,et al.  A compact and efficient image retrieval approach based on border/interior pixel classification , 2002, CIKM '02.

[41]  Qi Tian,et al.  Incorporate support vector machines to content-based image retrieval with relevance feedback , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[42]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[44]  Stephan Volmer,et al.  Color co-occurrence descriptors for querying-by-example , 1998, Proceedings 1998 MultiMedia Modeling. MMM'98 (Cat. No.98EX200).

[45]  Peter J. Bickel,et al.  The Earth Mover's distance is the Mallows distance: some insights from statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[46]  Michael Unser,et al.  Sum and Difference Histograms for Texture Classification , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Neucimar J. Leite,et al.  Wavelet-based Feature Extraction for Fingerprint Image Retrieval , 2007 .

[48]  Weiguo Fan,et al.  Genetic Programming-Based Discovery of Ranking Functions for Effective Web Search , 2005, J. Manag. Inf. Syst..