The tags on social media websites such as Flickr are frequently imprecise and incomplete, thus there is still a gap between these tags and the actual content of the images. This paper proposes a social image ``retagging'' scheme that aims at assigning images with better content descriptors. The refining process is formulated as an optimization framework based on the consistency between ``visual similarity'' and ``semantic similarity'' in social images. An effective iterative bound optimization algorithm is applied to learn the optimal tag assignment. In addition, as many tags are intrinsically not closely-related to the visual content of the images, we employ a knowledge-based method to differentiate visual content related from unrelated tags and then constrain the tagging vocabulary of our automatic algorithm within the content related tags. Experimental results on a Flickr image collection demonstrate the effectiveness of this approach.
[1]
Dong Liu,et al.
Tag ranking
,
2009,
WWW '09.
[2]
Christiane Fellbaum,et al.
Book Reviews: WordNet: An Electronic Lexical Database
,
1999,
CL.
[3]
Dekang Lin,et al.
Using Syntactic Dependency as Local Context to Resolve Word Sense Ambiguity
,
1997,
ACL.
[4]
Dong Liu,et al.
Tag quality improvement for social images
,
2009,
2009 IEEE International Conference on Multimedia and Expo.
[5]
Yi Liu,et al.
Semi-supervised Multi-label Learning by Constrained Non-negative Matrix Factorization
,
2006,
AAAI.
[6]
Changhu Wang,et al.
Content-Based Image Annotation Refinement
,
2007,
2007 IEEE Conference on Computer Vision and Pattern Recognition.