Salient region detection and its application to video retargeting

In spite of extensive studies on visual saliency, e.g., generating a saliency map from an image, less work has been addressed how to crop salient regions from saliency maps. We present a new approach to detect salient regions with maximum saliency density from videos. A branch-and-bound search algorithm is developed to find the global optimal solution efficiently. The proposed detection approach can automatically adapt to the shapes and motions of salient objects regardless of cluttered backgrounds. Moreover, by introducing an intermediate cropping window, video retargeting as an application of salient region detection gets optimized saliency coverage. Extensive experimental results validate the advantages of the proposed method.

[1]  Lie Lu,et al.  A generic framework of user attention model and its application in video summarization , 2005, IEEE Trans. Multim..

[2]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[3]  Daniel Cohen-Or,et al.  Non-homogeneous Content-driven Video-retargeting , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[4]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Michael Gleicher,et al.  Video retargeting: automating pan and scan , 2006, MM '06.

[6]  Nanning Zheng,et al.  Video attention: Learning to detect a salient object sequence , 2008, 2008 19th International Conference on Pattern Recognition.

[7]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Larry S. Davis,et al.  Multi-scale video cropping , 2007, ACM Multimedia.

[9]  Liqing Zhang,et al.  Dynamic visual attention: searching for coding length increments , 2008, NIPS.

[10]  Hermann Ney,et al.  Pan, zoom, scan — Time-coherent, trained automatic video cropping , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Frédo Durand,et al.  Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[12]  Jiebo Luo,et al.  User guided semantic image adaptation for mobile display devices , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[13]  Ling-Yu Duan,et al.  Consumer video retargeting: context assisted spatial-temporal grid optimization , 2009, ACM Multimedia.

[14]  Christoph H. Lampert,et al.  Efficient Subwindow Search: A Branch and Bound Framework for Object Localization , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[16]  Qi Tian,et al.  Saliency Density Maximization for Object Detection and Localization , 2010, ACCV.