Optimizing social image search with multiple criteria: Relevance, diversity, and typicality

The explosive growth and wide-spread accessibility of community-contributed multimedia contents on the Internet have led to a surging research activity in social image search. However, the existing tag-based search methods frequently return irrelevant or redundant results. To quickly target user's intention in the result returned by an ambiguous query, we first put forward that the top-ranked search results should meet some criteria, i.e., relevance, typicality and diversity. With the three criteria, a novel ranking scheme for social image search is proposed which incorporates both semantic similarity and visual similarity. The ranking list with relevance, typicality and diversity is returned by optimizing a measure named Average Diverse Precision. The typicality score of samples is estimated via the probability density in the space of visual features. The diversity among the top-ranked list is achieved by fusing both semantic and visual similarities of images. A comprehensive approach for calculating visual similarity is considered by fusing the similarity values according to different features. To further benefit ranking performance, a data-driven method is implemented to refine the tags of social image. Comprehensive experiments demonstrate the effectiveness of the approach proposed in this paper.

[1]  Shanshan Li,et al.  Which Tags Are Related to Visual Content? , 2010, MMM.

[2]  Nenghai Yu,et al.  Learning to tag , 2009, WWW '09.

[3]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[4]  Xian-Sheng Hua,et al.  Active Reranking for Web Image Search , 2010, IEEE Transactions on Image Processing.

[5]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[6]  Meng Wang,et al.  Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation , 2009, IEEE Transactions on Multimedia.

[7]  Dong Liu,et al.  Tag ranking , 2009, WWW '09.

[8]  Shumeet Baluja,et al.  Pagerank for product image search , 2008, WWW.

[9]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.

[10]  Richang Hong,et al.  A Probability Model for Image Annotation , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[11]  Shih-Fu Chang,et al.  Video search reranking through random walk over document-level context graph , 2007, ACM Multimedia.

[12]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[13]  Tao Mei,et al.  CrowdReranking: exploring multiple search engines for visual search reranking , 2009, SIGIR.

[14]  Xiaoou Tang,et al.  Real time google and live image search re-ranking , 2008, ACM Multimedia.

[15]  Bin Wang,et al.  Large-Scale Duplicate Detection for Web Image Search , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[16]  Meng Wang,et al.  Typicality-Based Visual Search Reranking , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Meng Wang,et al.  Accessible image search for colorblindness , 2010, TIST.

[18]  Xian-Sheng Hua,et al.  A joint appearance-spatial distance for kernel-based image categorization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Eleanor Rosch,et al.  Principles of Categorization , 1978 .

[20]  Xian-Sheng Hua,et al.  Towards a Relevant and Diverse Search of Social Images , 2010, IEEE Transactions on Multimedia.

[21]  Ximena Olivares,et al.  Visual diversification of image search results , 2009, WWW '09.

[22]  Marcel Worring,et al.  Learning tag relevance by neighbor voting for social image retrieval , 2008, MIR '08.

[23]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[24]  Meng Wang,et al.  Visual tag dictionary: interpreting tags with visual words , 2009, WSMC '09.