Image tag re-ranking by coupled probability transition

The large amount of user-tagged images on social networks is helpful to facilitate image management and image search. However, many tags are weakly relevant or irrelevant to the visual content, resulting in unsatisfactory performance in tag related applications. In this paper, we propose a coupled probability transition algorithm to estimate the text-visual group relevance from the observed data and then leverage it to predict tag relevance for a new query image. The visual group for a given tag is a cluster of images that are visually similar and share the same tag. The tag-visual group relevance is uncovered by exploiting the mutual reinforcement in visual space and semantic space alternatively. Experiments on NUS-WIDE dataset show the validity and superiority of the proposed approach over existing methods.

[1]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Dong Liu,et al.  Tag ranking , 2009, WWW '09.

[3]  Changhu Wang,et al.  Image annotation refinement using random walk with restarts , 2006, MM '06.

[4]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[5]  Wei-Ying Ma,et al.  Multi-model similarity propagation and its application for web image retrieval , 2004, MULTIMEDIA '04.

[6]  Qi Tian,et al.  Semi-automatic Flickr Group Suggestion , 2011, MMM.

[7]  Qi Tian,et al.  Exploring tag relevance for image tag re-ranking , 2012, SIGIR '12.

[8]  Qi Tian,et al.  Query expansion enhancement by fast binary matching , 2012, ACM Multimedia.

[9]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[10]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[11]  Qi Tian,et al.  Refining image retrieval using one-class classification , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[12]  Qi Tian,et al.  Spatial coding for large scale partial-duplicate web image search , 2010, ACM Multimedia.

[13]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[14]  Gang Hua,et al.  Descriptive visual words and visual phrases for image applications , 2009, ACM Multimedia.

[15]  Changhu Wang,et al.  Content-Based Image Annotation Refinement , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.