Toward an Estimation of User Tagging Credibility for Social Image Retrieval

Existing image retrieval systems exploit textual or/and visual information to return results. The retrieval process is mostly focused on data themselves and disregards the data sources. In Web 2.0 platforms, the quality of annotations provided by different users can vary strongly. To account for this variability, we complement existing methods by introducing user tagging credibility in the retrieval process. Tagging credibility is automatically estimated by leveraging a large set of visual concept classifiers learned with Overfeat, a convolutional neural network (CNN) based feature. A good image retrieval system should return results that are both relevant and diversified and tackle both challenges. We diversify results by using a k-Means algorithm with user based cluster ranking. We increase relevance by favoring images uploaded from users with good credibility estimates. Evaluation is performed on DIV400, a publicly available social image retrieval dataset. Experiments show that our method provides interesting performances compared to existing approaches.

[1]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[2]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[3]  Ximena Olivares,et al.  Visual diversification of image search results , 2009, WWW '09.

[4]  Jia Deng,et al.  A large-scale hierarchical image database , 2009, CVPR 2009.

[5]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[6]  I. Vlahavas,et al.  Finding Diverse Images at MediaEval 2014 , 2014 .

[7]  Henning Müller,et al.  Div400: a social image retrieval result diversification dataset , 2014, MMSys '14.

[8]  Chih-Jen Lin,et al.  Trust Region Newton Method for Logistic Regression , 2008, J. Mach. Learn. Res..

[9]  Chih-Jen Lin,et al.  Trust region Newton methods for large-scale logistic regression , 2007, ICML '07.

[10]  Neha Jain,et al.  Experiments in Diversifying Flickr Result Sets , 2013, MediaEval.

[11]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Yiannis Kompatsiaris,et al.  USEMP: Finding Diverse Images at MediaEval 2015 , 2015, MediaEval.

[13]  Mor Naaman,et al.  Generating diverse and representative image search results for landmarks , 2008, WWW.

[14]  Panagiotis G. Ipeirotis,et al.  Quality management on Amazon Mechanical Turk , 2010, HCOMP '10.

[15]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[16]  Katsumi Tanaka,et al.  ImageAlert: credibility analysis of text-image pairs on the web , 2011, SAC '11.

[17]  Marcel Worring,et al.  Learning tag relevance by neighbor voting for social image retrieval , 2008, MIR '08.

[18]  Yiannis Kompatsiaris,et al.  SocialSensor: Finding Diverse Images at MediaEval 2014 , 2014, MediaEval.

[19]  Karl Aberer,et al.  Web Credibility: Features Exploration and Credibility Prediction , 2013, ECIR.

[20]  Ling Xu,et al.  Credibility-Oriented Ranking of Multimedia News Based on a Material-Opinion Model , 2011, WAIM.