Smart Thumbnail: Automatic Image Cropping by Mining Canonical Query Objects

In this paper, we present a query-dependent thumbnailing approach for web image search. Motivated by the fact that uniform down-sampling cannot emphasize query objects while saliency-based methods may present incorrect foreground objects, we propose to employ common object discovery (COD) algorithms to mine the underlying canonical query objects from the result image collection and adopt the detected object regions of interest (ROIs) as a guide for image cropping. To make the employed COD approach more adaptive to our scenario, we enhance it by introducing text-based search rankings. We then decide for each image whether it should be cropped and determine the final cropping boundary by expanding the detected bounding box, so that the produced thumbnails are of proper appearances. The experimental results demonstrate that our method can outperform down-sampling and saliency-based methods on both object localization accuracy and general thumbnail quality.

[1]  Lihi Zelnik-Manor,et al.  Context-Aware Saliency Detection , 2012, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Benjamin B. Bederson,et al.  Automatic thumbnail cropping and its effectiveness , 2003, UIST '03.

[3]  Jian Sun,et al.  Salient object detection by composition , 2011, 2011 International Conference on Computer Vision.

[4]  Zhuowen Tu,et al.  Unsupervised object class discovery via saliency-guided multiple class learning , 2012, CVPR.

[5]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[6]  Thomas Deselaers,et al.  Localizing Objects While Learning Their Appearance , 2010, ECCV.

[7]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[8]  Yan Liu,et al.  Image retargeting based on global energy optimization , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[9]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10]  Takeo Kanade,et al.  Distributed cosegmentation via submodular optimization on anisotropic diffusion , 2011, 2011 International Conference on Computer Vision.

[11]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[12]  Matthieu Guillaumin,et al.  Segmentation Propagation in ImageNet , 2012, ECCV.

[13]  Thomas Deselaers,et al.  What is an object? , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Antonio Torralba,et al.  Unsupervised Detection of Regions of Interest Using Iterative Link Analysis , 2009, NIPS.

[15]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[16]  Yan Liu,et al.  Image retargeting using multi-map constrained region warping , 2009, ACM Multimedia.

[17]  Jian Sun,et al.  Geodesic Saliency Using Background Priors , 2012, ECCV.

[18]  Michael Gleicher,et al.  Automatic image retargeting with fisheye-view warping , 2005, UIST.

[19]  K. N. Balasubramanya Murthy,et al.  A smart automatic thumbnail cropping based on attention driven regions of interest extraction , 2009, ICIS.

[20]  Ariel Shamir,et al.  Seam Carving for Content-Aware Image Resizing , 2007, ACM Trans. Graph..

[21]  Jean Ponce,et al.  Discriminative clustering for image co-segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Thomas Deselaers,et al.  ClassCut for Unsupervised Class Segmentation , 2010, ECCV.

[23]  Yan Liu,et al.  Rapid image retargeting based on curve-edge grid representation , 2010, 2010 IEEE International Conference on Image Processing.

[24]  Frédéric Jurie,et al.  Improving web image search results using query-relative classifiers , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.