论文信息 - Tag-based social image retrieval: An empirical evaluation

Tag-based social image retrieval: An empirical evaluation

Tags associated with social images are valuable information source for superior image search and retrieval experiences. Although various heuristics are valuable to boost tag-based search for images, there is a lack of general framework to study the impact of these heuristics. Specifically, the task of ranking images matching a given tag query based on their associated tags in descending order of relevance has not been well studied. In this article, we take the first step to propose a generic, flexible, and extensible framework for this task and exploit it for a systematic and comprehensive empirical evaluation of various methods for ranking images. To this end, we identified five orthogonal dimensions to quantify the matching score between a tagged image and a tag query. These five dimensions are: (i) tag relatedness to measure the degree of effectiveness of a tag describing the tagged image; (ii) tag discrimination to quantify the degree of discrimination of a tag with respect to the entire tagged image collection; (iii) tag length normalization analogous to document length normalization in web search; (iv) tag-query matching model for the matching score computation between an image tag and a query tag; and (v) query model for tag query rewriting. For each dimension, we identify a few implementations and evaluate their impact on NUS-WIDE dataset, the largest human-annotated dataset consisting of more than 269K tagged images from Flickr . We evaluated 81 single-tag queries and 443 multi-tag queries over 288 search methods and systematically compare their performances using standard metrics including Precision at top-K, Mean Average Precision ( MAP ), Recall, and Normalized Discounted Cumulative Gain (NDCG). (This work was done during Ge Bai's intership at NTU.) © 2011 Wiley Periodicals, Inc.

[1] Qi Tian,et al. What are the high-level concepts with small semantic gaps? , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2] Shuicheng Yan,et al. Image tag refinement towards low-rank, content-tag prior and error sparsity , 2010, ACM Multimedia.

[3] W. Bruce Croft,et al. Query expansion using local and global document analysis , 1996, SIGIR '96.

[4] Marcel Worring,et al. Learning tag relevance by neighbor voting for social image retrieval , 2008, MIR '08.

[5] Jon M. Kleinberg,et al. Mapping the world's photos , 2009, WWW '09.

[6] Roelof van Zwol,et al. TAGEXPLORER: FACETED BROWSING OF FLICKR PHOTOS , 2010 .

[7] Roelof van Zwol,et al. Classifying tags using open content resources , 2009, WSDM '09.

[8] Dong Liu,et al. Image retagging , 2010, ACM Multimedia.

[9] Shuicheng Yan,et al. Inferring semantic concepts from community-contributed images and noisy tags , 2009, ACM Multimedia.

[10] Sourav S. Bhowmick,et al. Quantifying tag representativeness of visual content of social images , 2010, ACM Multimedia.

[11] James Ze Wang,et al. Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[12] Besiki Stvilia,et al. Member activities and quality of tags in a collection of historical photographs in Flickr , 2010, J. Assoc. Inf. Sci. Technol..

[13] Wolfgang Nejdl,et al. Automatically Identifying Tag Types , 2009, ADMA.

[14] Bernardo A. Huberman,et al. Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[15] Gerard Salton,et al. Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[16] Alton Yeow-Kuan Chua,et al. Fight or unite: Investigating game genres for image tagging , 2011, J. Assoc. Inf. Sci. Technol..

[17] Fabrizio Sebastiani,et al. Machine learning in automated text categorization , 2001, CSUR.

[18] Nenghai Yu,et al. Flickr distance , 2008, ACM Multimedia.

[19] Gerhard Weikum,et al. Gathering and ranking photos of named entities with high precision, high recall, and diversity , 2010, WSDM '10.

[20] Schubert Foo,et al. Upper tag ontology for integrating social tagging data , 2010, J. Assoc. Inf. Sci. Technol..

[21] Roelof van Zwol,et al. Flickr tag recommendation based on collective knowledge , 2008, WWW.

[22] Sourav S. Bhowmick,et al. Social image tag recommendation by concept matching , 2011, ACM Multimedia.

[23] Marcel Worring,et al. Learning Social Tag Relevance by Neighbor Voting , 2009, IEEE Transactions on Multimedia.

[24] Alistair Moffat,et al. Exploring the similarity space , 1998, SIGF.

[25] Gustavo Carneiro,et al. Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Abebe Rorissa,et al. A comparative study of Flickr tags and index terms in a general image collection , 2010, J. Assoc. Inf. Sci. Technol..

[27] Mor Naaman,et al. Why we tag: motivations for annotation in mobile and online media , 2007, CHI.

[28] Kilian Q. Weinberger,et al. Resolving tag ambiguity , 2008, ACM Multimedia.

[29] Pavel Serdyukov,et al. Placing flickr photos on a map , 2009, SIGIR.

[30] Mor Naaman,et al. Generating diverse and representative image search results for landmarks , 2008, WWW.

[31] Hila Becker,et al. Learning similarity metrics for event identification in social media , 2010, WSDM '10.

[32] Tat-Seng Chua,et al. NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[33] R. Manmatha,et al. Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[34] M E J Newman,et al. Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[35] Christopher D. Manning,et al. Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[36] Ximena Olivares,et al. Visual diversification of image search results , 2009, WWW '09.

[37] Vladimir Pavlovic,et al. A New Baseline for Image Annotation , 2008, ECCV.

[38] Sourav S. Bhowmick,et al. Image tag clarity: in search of visual-representative tags for social images , 2009, WSM@MM.

[39] Wolfgang Nejdl,et al. Can all tags be used for search? , 2008, CIKM '08.

[40] Ramesh C. Jain,et al. Content without context is meaningless , 2010, ACM Multimedia.

[41] R. Manmatha,et al. Multiple Bernoulli relevance models for image and video annotation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[42] Alla Zollers,et al. Emerging Motivations for Tagging: Expression, Performance, and Activism , 2007 .

[43] Nenghai Yu,et al. Learning to tag , 2009, WWW '09.

[44] Bettina Berendt,et al. Tags are not metadata, but "just more content" - to some people , 2007, ICWSM.

[45] Mor Naaman,et al. HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.

[46] Paul M. B. Vitányi,et al. The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[47] William W. Cohen. Fast Effective Rule Induction , 1995, ICML.

[48] Marcel Worring,et al. Unsupervised multi-feature tag relevance learning for social image retrieval , 2010, CIVR '10.

[49] Ling Chen,et al. Event detection from flickr data through wavelet-based spatial analysis , 2009, CIKM.

[50] Cordelia Schmid,et al. TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[51] Dong Liu,et al. Tag ranking , 2009, WWW '09.

[52] Keiji Yanai,et al. Evaluation strategies for image understanding and retrieval , 2005, MIR '05.