Tag-based social image retrieval: An empirical evaluation

Tags associated with social images are valuable information source for superior image search and retrieval experiences. Although various heuristics are valuable to boost tag-based search for images, there is a lack of general framework to study the impact of these heuristics. Specifically, the task of ranking images matching a given tag query based on their associated tags in descending order of relevance has not been well studied. In this article, we take the first step to propose a generic, flexible, and extensible framework for this task and exploit it for a systematic and comprehensive empirical evaluation of various methods for ranking images. To this end, we identified five orthogonal dimensions to quantify the matching score between a tagged image and a tag query. These five dimensions are: (i) tag relatedness to measure the degree of effectiveness of a tag describing the tagged image; (ii) tag discrimination to quantify the degree of discrimination of a tag with respect to the entire tagged image collection; (iii) tag length normalization analogous to document length normalization in web search; (iv) tag-query matching model for the matching score computation between an image tag and a query tag; and (v) query model for tag query rewriting. For each dimension, we identify a few implementations and evaluate their impact on NUS-WIDE dataset, the largest human-annotated dataset consisting of more than 269K tagged images from Flickr . We evaluated 81 single-tag queries and 443 multi-tag queries over 288 search methods and systematically compare their performances using standard metrics including Precision at top-K, Mean Average Precision ( MAP ), Recall, and Normalized Discounted Cumulative Gain (NDCG). (This work was done during Ge Bai's intership at NTU.) © 2011 Wiley Periodicals, Inc.

[1]  Qi Tian,et al.  What are the high-level concepts with small semantic gaps? , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Shuicheng Yan,et al.  Image tag refinement towards low-rank, content-tag prior and error sparsity , 2010, ACM Multimedia.

[3]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[4]  Marcel Worring,et al.  Learning tag relevance by neighbor voting for social image retrieval , 2008, MIR '08.

[5]  Jon M. Kleinberg,et al.  Mapping the world's photos , 2009, WWW '09.

[6]  Roelof van Zwol,et al.  TAGEXPLORER: FACETED BROWSING OF FLICKR PHOTOS , 2010 .

[7]  Roelof van Zwol,et al.  Classifying tags using open content resources , 2009, WSDM '09.

[8]  Dong Liu,et al.  Image retagging , 2010, ACM Multimedia.

[9]  Shuicheng Yan,et al.  Inferring semantic concepts from community-contributed images and noisy tags , 2009, ACM Multimedia.

[10]  Sourav S. Bhowmick,et al.  Quantifying tag representativeness of visual content of social images , 2010, ACM Multimedia.

[11]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[12]  Besiki Stvilia,et al.  Member activities and quality of tags in a collection of historical photographs in Flickr , 2010, J. Assoc. Inf. Sci. Technol..

[13]  Wolfgang Nejdl,et al.  Automatically Identifying Tag Types , 2009, ADMA.

[14]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[15]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[16]  Alton Yeow-Kuan Chua,et al.  Fight or unite: Investigating game genres for image tagging , 2011, J. Assoc. Inf. Sci. Technol..

[17]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[18]  Nenghai Yu,et al.  Flickr distance , 2008, ACM Multimedia.

[19]  Gerhard Weikum,et al.  Gathering and ranking photos of named entities with high precision, high recall, and diversity , 2010, WSDM '10.

[20]  Schubert Foo,et al.  Upper tag ontology for integrating social tagging data , 2010, J. Assoc. Inf. Sci. Technol..

[21]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.

[22]  Sourav S. Bhowmick,et al.  Social image tag recommendation by concept matching , 2011, ACM Multimedia.

[23]  Marcel Worring,et al.  Learning Social Tag Relevance by Neighbor Voting , 2009, IEEE Transactions on Multimedia.

[24]  Alistair Moffat,et al.  Exploring the similarity space , 1998, SIGF.

[25]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Abebe Rorissa,et al.  A comparative study of Flickr tags and index terms in a general image collection , 2010, J. Assoc. Inf. Sci. Technol..

[27]  Mor Naaman,et al.  Why we tag: motivations for annotation in mobile and online media , 2007, CHI.

[28]  Kilian Q. Weinberger,et al.  Resolving tag ambiguity , 2008, ACM Multimedia.

[29]  Pavel Serdyukov,et al.  Placing flickr photos on a map , 2009, SIGIR.

[30]  Mor Naaman,et al.  Generating diverse and representative image search results for landmarks , 2008, WWW.

[31]  Hila Becker,et al.  Learning similarity metrics for event identification in social media , 2010, WSDM '10.

[32]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[33]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[34]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[36]  Ximena Olivares,et al.  Visual diversification of image search results , 2009, WWW '09.

[37]  Vladimir Pavlovic,et al.  A New Baseline for Image Annotation , 2008, ECCV.

[38]  Sourav S. Bhowmick,et al.  Image tag clarity: in search of visual-representative tags for social images , 2009, WSM@MM.

[39]  Wolfgang Nejdl,et al.  Can all tags be used for search? , 2008, CIKM '08.

[40]  Ramesh C. Jain,et al.  Content without context is meaningless , 2010, ACM Multimedia.

[41]  R. Manmatha,et al.  Multiple Bernoulli relevance models for image and video annotation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[42]  Alla Zollers,et al.  Emerging Motivations for Tagging: Expression, Performance, and Activism , 2007 .

[43]  Nenghai Yu,et al.  Learning to tag , 2009, WWW '09.

[44]  Bettina Berendt,et al.  Tags are not metadata, but "just more content" - to some people , 2007, ICWSM.

[45]  Mor Naaman,et al.  HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.

[46]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[47]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[48]  Marcel Worring,et al.  Unsupervised multi-feature tag relevance learning for social image retrieval , 2010, CIVR '10.

[49]  Ling Chen,et al.  Event detection from flickr data through wavelet-based spatial analysis , 2009, CIKM.

[50]  Cordelia Schmid,et al.  TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[51]  Dong Liu,et al.  Tag ranking , 2009, WWW '09.

[52]  Keiji Yanai,et al.  Evaluation strategies for image understanding and retrieval , 2005, MIR '05.