Integrating visual saliency and consistency for re-ranking image search results

The paper investigates two mechanisms, visual consistency and visual saliency, in web image search: (1) In most current web image search engines, such as Google Image Search and Yahoo Image Search, the images that closely related to the search query are typically visually similar. These visually consistent images which occur most frequently in the first few web pages will be given higher ranks. (2) From visual aspect, it is obvious that salient images would be easier to catch users' eyes and more likely to be clicked than the cluttered ones in low-level vision. In addition, we also observe the fact that the visually salient images in the front pages are often relevant to the user's query. The principal novelty of this paper is in combining visual saliency and consistency to re-rank the results from search engines to make the re-ranked images more satisfying in both vision and content. The experimental results on a real world web image dataset demonstrate that our approach can effectively improve the performance of image retrieval.

[1]  Zheru Chi,et al.  Attention-driven image interpretation with application to image retrieval , 2006, Pattern Recognit..

[2]  Pietro Perona,et al.  Learning object categories from Google's image search , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[3]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[4]  John R. Smith,et al.  Normalized classifier fusion for semantic visual concept detection , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[5]  Pietro Perona,et al.  A Visual Category Filter for Google Images , 2004, ECCV.

[6]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  P. Burt Fast filter transform for image processing , 1981 .

[8]  David A. Forsyth,et al.  Animals on the Web , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[10]  P. J. Burt,et al.  Fast Filter Transforms for Image Processing , 1981 .

[11]  Boris Babenko,et al.  ImprovingWeb-based Image Search via Content Based Clustering , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[12]  Xiaokang Yang,et al.  Re-ranking image search results by multiscale visual saliency model , 2010, 2010 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[13]  Fred Stentiford,et al.  An Attention-Based Approach to Content-Based Image Retrieval , 2004 .

[14]  Shih-Fu Chang,et al.  Reranking Methods for Visual Search , 2007, IEEE MultiMedia.

[15]  Ximena Olivares,et al.  Visual diversification of image search results , 2009, WWW '09.

[16]  James M. Rehg,et al.  Where am I: Place instance and category recognition using spatial PACT , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[18]  Anil K. Jain,et al.  Image retrieval using color and shape , 1996, Pattern Recognit..

[19]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[20]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[21]  Li Wang,et al.  Query aware visual similarity propagation for image search reranking , 2009, MM '09.

[22]  Shih-Fu Chang,et al.  Video search reranking via information bottleneck principle , 2006, MM '06.

[23]  Pietro Perona,et al.  Is bottom-up attention useful for object recognition? , 2004, CVPR 2004.

[24]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[25]  Robert B. Fisher,et al.  Object-based visual attention for computer vision , 2003, Artif. Intell..