Qualitative evaluation of automatic assignment of keywords to images

In image retrieval, most systems lack user-centred evaluation since they are assessed by some chosen ground truth dataset. The results reported through precision and recall assessed against the ground truth are thought of as being an acceptable surrogate for the judgment of real users. Much current research focuses on automatically assigning keywords to images for enhancing retrieval effectiveness. However, evaluation methods are usually based on system-level assessment, e.g. classification accuracy based on some chosen ground truth dataset. In this paper, we present a qualitative evaluation methodology for automatic image indexing systems. The automatic indexing task is formulated as one of image annotation, or automatic metadata generation for images. The evaluation is composed of two individual methods. First, the automatic indexing annotation results are assessed by human subjects. Second, the subjects are asked to annotate some chosen images as the test set whose annotations are used as ground truth. Then, the system is tested by the test set whose annotation results are judged against the ground truth. Only one of these methods is reported for most systems on which user-centred evaluation are conducted. We believe that both methods need to be considered for full evaluation. We also provide an example evaluation of our system based on this methodology. According to this study, our proposed evaluation methodology is able to provide deeper understanding of the system's performance.

[1]  John Tait,et al.  Automatic Metadata Annotation of Images via a Two-Level Learning Framework , 2004 .

[2]  Sudeep Sarkar,et al.  Comparison of edge detectors: a methodology and initial study , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Mohan S. Kankanhalli,et al.  Content-Based Image Retrieval Using a Composite Color-Shape Approach , 1998, Inf. Process. Manag..

[4]  Stan Z. Li,et al.  A Performance Evaluation Protocol for Content-Based Image Retrieval Algorithms/Systems , 2001 .

[5]  Howard Greisdorf Information Seeking Behaviour in Image Retrieval: VISOR I Final Report , 2002, J. Documentation.

[6]  Donna Harman,et al.  Information Processing and Management , 2022 .

[7]  Sudeep Sarkar,et al.  Comparison of Edge Detectors: A Methodology and Initial Study , 1998, Comput. Vis. Image Underst..

[8]  Andrew Turpin,et al.  Challenging conventional assumptions of automated information retrieval with real users: Boolean searching and batch retrieval evaluations , 2001, Inf. Process. Manag..

[9]  Jian-Kang Wu,et al.  Fuzzy Content-based Retrieval in Image Databases , 1998, Inf. Process. Manag..

[10]  F. Moore Cognitive development and the acquisition of language , 1973 .

[11]  Tefko Saracevic,et al.  Evaluation of evaluation in information retrieval , 1995, SIGIR '95.

[12]  E. Rosch ON THE INTERNAL STRUCTURE OF PERCEPTUAL AND SEMANTIC CATEGORIES1 , 1973 .

[13]  Raimondo Schettini,et al.  A relevance feedback mechanism for content-based image retrieval , 1999, Inf. Process. Manag..

[14]  R. Pagano Understanding Statistics in the Behavioral Sciences , 1981 .

[15]  Ian H. Jermyn,et al.  Psychovisual evaluation of image segmentation algorithms , 2002 .

[16]  Marius Tico,et al.  A Test Collection for the Evaluation of Content-Based Image Retrieval Algorithms—A User and Task-Based Approach , 2001, Information Retrieval.

[17]  John P. Eakins,et al.  Towards intelligent image retrieval , 2002, Pattern Recognit..

[18]  Sung-Hyon Myaeng,et al.  Image organization and retrieval with automatically constructed feature vectors , 1996, SIGIR '96.

[19]  J. Chamorro-Mart Modelling subjectivity in visual perception of orientation for image retrieval , 2003 .

[20]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[21]  Raya Fidel,et al.  The effect of query type on subject searching behavior of image databases (poster session): an exploratory study , 2000, SIGIR '00.

[22]  Kerry Rodden,et al.  Does organisation by similarity assist image browsing? , 2001, CHI.

[23]  Amanda Spink,et al.  A user-centered approach to evaluating human interaction with Web search engines: an exploratory study , 2002, Inf. Process. Manag..

[24]  Thierry Pun,et al.  Assessing agreement between human and machine clusterings of image databases , 1998, Pattern Recognit..

[25]  Anil K. Jain,et al.  Image classification for content-based indexing , 2001, IEEE Trans. Image Process..

[26]  Joemon M. Jose,et al.  Spatial querying for image retrieval: a user-oriented evaluation , 1998, SIGIR '98.

[27]  Kobus Barnard,et al.  Method for comparing content based image retrieval methods , 2003, IS&T/SPIE Electronic Imaging.

[28]  Linda Schamber Relevance and Information Behavior. , 1994 .

[29]  John Tait,et al.  Search strategies in content-based image retrieval , 2003, SIGIR.

[30]  Corinne Jörgensen,et al.  Attributes of Images in Describing Tasks , 1998, Inf. Process. Manag..

[31]  Sethuraman Panchanathan,et al.  A Method for Evaluating the Performance of Content-Based Image Retrieval Systems Based on Subjectively Determined Similarity between Images , 2002, CIVR.

[32]  Edie M. Rasmussen,et al.  Users' relevance criteria in image retrieval in American history , 2002, Inf. Process. Manag..

[33]  Chris Buckley,et al.  OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[34]  Amanda Spink,et al.  Image searching on the Excite Web search engine , 2001, Inf. Process. Manag..

[35]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  John Tait,et al.  Evaluating a content based image retrieval system , 2001, SIGIR '01.

[37]  Rosalind W. Picard,et al.  Texture orientation for sorting photos "at a glance" , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[38]  Masafumi Hagiwara,et al.  An image retrieval system by impression words and specific object names - IRIS , 2002, Neurocomputing.

[39]  Jitendra Malik,et al.  A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[40]  W. Bruce Croft,et al.  Corpus-based stemming using cooccurrence of word variants , 1998, TOIS.

[41]  Peter G. B. Enser,et al.  Analysis of user need in image archives , 1997, J. Inf. Sci..

[42]  John Tait,et al.  Image classification using hybrid neural networks , 2003, SIGIR.

[43]  Henning Müller,et al.  Learning Feature Weights from User Behavior in Content-Based Image Retrieval , 2000, MDM/KDD.

[44]  Mark D. Dunlop Reflections on Mira: Interactive evaluation in information retrieval , 2000, J. Am. Soc. Inf. Sci..

[45]  James Ze Wang,et al.  Evaluation strategies for automatic linguistic indexing of pictures , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[46]  Daniel Sánchez,et al.  Modelling subjectivity in visual perception of orientation for image retrieval , 2003, Inf. Process. Manag..

[47]  Eero Sormunen,et al.  End-User Searching Challenges Indexing Practices in the Digital Newspaper Photo Archive , 2004, Information Retrieval.

[48]  Tom Minka,et al.  Interactive learning with a "society of models" , 1997, Pattern Recognit..

[49]  Nicholas J. Belkin,et al.  Iterative exploration, design and evaluation of support for query reformulation in interactive information retrieval , 2001, Inf. Process. Manag..

[50]  Shih-Fu Chang,et al.  A conceptual framework and empirical research for classifying visual descriptors , 2001 .

[51]  Ingemar J. Cox,et al.  The Bayesian image retrieval system, PicHunter: theory, implementation, and psychophysical experiments , 2000, IEEE Trans. Image Process..

[52]  Jae Won Lee,et al.  Content-based image classification using a neural network , 2004, Pattern Recognit. Lett..

[53]  Hsin-Liang Chen,et al.  An analysis of image retrieval tasks in the field of art history , 2001, Inf. Process. Manag..

[54]  David Hawking,et al.  Overview of the TREC 2003 Web Track , 2003, TREC.

[55]  Hong Xie,et al.  Planned and Situated Aspects in Interactive IR: Patterns of User Interactive Intentions and Information Seeking Strategies , 1997 .

[56]  Vijay V. Raghavan,et al.  Modeling and retrieving images by content , 1997, Inf. Process. Manag..

[57]  Raya Fidel,et al.  The Effect of Query Type on Subject Searching Behavior of Image Databases: an Exploratory Study , 2000, SIGIR 2000.

[58]  Raya Fidel,et al.  The image retrieval task: implications for the design and evaluation of image databases , 1997, New Rev. Hypermedia Multim..

[59]  Rachel Applegate,et al.  Models of User Satisfaction: Understanding False Positives , 1993 .

[60]  Thierry Pun,et al.  The Truth about Corel - Evaluation in Image Retrieval , 2002, CIVR.