Efficient benchmarking of content-based image retrieval via resampling

While content-based image retrieval (CBIR) is an expanding field, and new approaches to ever more effective retrieval are frequently proposed, relatively little attention has so far been paid to the process of evaluating the effectiveness of CBIR methods. Most of the reported evaluations use standard IR evaluation methodologies, with little consideration of their statistical significance or appropriateness for CBIR, which makes it difficult to assess the precise impact of individual methods. In this paper, we present a new approach for evaluating CBIR systems which provides both efficient and statistically-sound performance evaluation. The approach is based on stratified sampling, and provides a significant improvement over existing evaluation approaches. Comprehensive experiments using our approach to evaluate a range of CBIR methods have shown that the approach reduces not only the estimation error, but also reduces the size of the test data set required to achieve specific estimation error levels.

[1]  Clement H. C. Leung,et al.  Benchmarking for Content-Based Visual Information Search , 2000, VISUAL.

[2]  Alexander Dimai Assessment of Effectiveness of Content Based Image Retrieval Systems , 1999, VISUAL.

[3]  Lih-Yuan Deng,et al.  Orthogonal Arrays: Theory and Applications , 1999, Technometrics.

[4]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Proceedings of International Conference on Image Processing.

[5]  T. Fearn The Jackknife , 2000 .

[6]  K. Wolter Introduction to Variance Estimation , 1985 .

[7]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Multimedia Systems.

[8]  Nicu Sebe,et al.  How to complete performance graphs in content-based image retrieval: add generality and normalize scope , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Henning Müller,et al.  Automated benchmarking in content-based image retrieval , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[10]  Kyuseok Shim,et al.  WALRUS: A Similarity Retrieval Algorithm for Image Databases , 2004, IEEE Trans. Knowl. Data Eng..

[11]  Jing Huang,et al.  Combining supervised learning with color correlograms for content-based image retrieval , 1997, MULTIMEDIA '97.

[12]  B. Efron,et al.  A Leisurely Look at the Bootstrap, the Jackknife, and , 1983 .

[13]  Thierry Pun,et al.  Performance evaluation in content-based image retrieval: overview and proposals , 2001, Pattern Recognit. Lett..

[14]  George Economou,et al.  A generic scheme for color image retrieval based on the multivariate Wald-Wolfowitz test , 2005, IEEE Transactions on Knowledge and Data Engineering.

[15]  Anne H. H. Ngu,et al.  CMVF: a novel dimension reduction scheme for efficient indexing in a large image database , 2003, SIGMOD '03.

[16]  Thomas Pfund,et al.  Dynamic multimedia annotation tool , 2001, IS&T/SPIE Electronic Imaging.

[17]  Neil J. Gunther,et al.  Benchmark for image retrieval using distributed systems over the Iinternet: BIRDS-I , 2000, IS&T/SPIE Electronic Imaging.

[18]  Kobus Barnard,et al.  Evaluating image retrieval , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[19]  A. Winsor Sampling techniques. , 2000, Nursing times.

[20]  Sharad Mehrotra,et al.  The hybrid tree: an index structure for high dimensional feature spaces , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[21]  Carl-Erik Särndal,et al.  Model Assisted Survey Sampling , 1997 .

[22]  Mohan S. Kankanhalli,et al.  Benchmarking Multimedia Databases , 1997, Multimedia Tools and Applications.

[23]  M. Gurney,et al.  Constructing Orthogonal Replications for Variance Estimation , 1975 .

[24]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[26]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[27]  Thomas Martin Deserno,et al.  Evaluation axes for medical image retrieval systems: the imageCLEF experience , 2005, MULTIMEDIA '05.

[28]  R. A. Visser,et al.  Applying the Bootstrap to Generate Confidence Regions in Multiple Correspondence Analysis , 1992 .

[29]  Anne H. H. Ngu,et al.  Combining multi-visual features for efficient indexing in a large image database , 2001, The VLDB Journal.