Video Adaptation for Small Display Based on Content Recomposition

The browsing of quality videos on small hand-held devices is a common scenario in pervasive media environments. In this paper, we propose a novel framework for video adaptation based on content recomposition. Our objective is to provide effective small size videos which emphasize the important aspects of a scene while faithfully retaining the background context. That is achieved by explicitly separating the manipulation of different video objects. A generic video attention model is developed to extract user-interest objects, in which a high-level combination strategy is proposed for fusing the adopted three types of visual attention features: intensity, color, and motion. Based on the knowledge of media aesthetics, a set of aesthetic criteria is presented. Accordingly, these objects are well reintegrated with the direct-resized background to optimally match the specific screen sizes. Experimental results demonstrate the efficiency and effectiveness of our approach

[1]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[2]  S. Engel,et al.  Colour tuning in human visual cortex measured with functional magnetic resonance imaging , 1997, Nature.

[3]  Javier Díaz,et al.  FPGA-based real-time optical-flow system , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Yao Wang,et al.  Video Processing and Communications , 2001 .

[5]  John R. Smith,et al.  Adapting Multimedia Internet Content for Universal Access , 1999, IEEE Trans. Multim..

[6]  Touradj Ebrahimi,et al.  Semantic video analysis for adaptive content delivery and automatic description , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Xing Xie,et al.  A visual attention model for adapting images on small displays , 2003, Multimedia Systems.

[8]  Laurent Itti,et al.  A Goal Oriented Attention Guidance Model , 2002, Biologically Motivated Computer Vision.

[9]  Brendan J. Frey,et al.  Video Epitomes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10]  Jeho Nam,et al.  Visual content adaptation according to user perception characteristics , 2005, IEEE Transactions on Multimedia.

[11]  Ramesh R. Sarukkai,et al.  Video search: opportunities & challenges , 2005, MIR '05.

[12]  Keansub Lee,et al.  Perception-based image transcoding for universal multimedia access , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[13]  Eric R. Ziegel,et al.  Probability and Statistics for Engineering and the Sciences , 2004, Technometrics.

[14]  Ian Burnett,et al.  Universal multimedia experiences for tomorrow , 2003 .

[15]  B. J Hne,et al.  Spatio - temporal Image Processing: Theory and Scientific Applications , 1991 .

[16]  Ja-Ling Wu,et al.  Robust Algorithm for Exemplar-based Image Inpainting , 2005 .

[17]  Sigeru Omatu,et al.  Regular moments for symmetric images , 1998 .

[18]  Michael Gleicher,et al.  Automatic image retargeting with fisheye-view warping , 2005, UIST.

[19]  David Bordwell,et al.  Film Art: An Introduction , 1979 .

[20]  Ming-Ting Sun,et al.  Digital Video Transcoding , 2005, Proceedings of the IEEE.

[21]  Bernd Jähne,et al.  Spatio-Temporal Image Processing , 1993, Lecture Notes in Computer Science.

[22]  Yeong-Ho Ha,et al.  Spatial color descriptor for image retrieval and video segmentation , 2003, IEEE Trans. Multim..

[23]  Xian-Sheng Hua,et al.  An Attention-Based Decision Fusion Scheme for Multimedia Information Retrieval , 2004, PCM.

[24]  Shipeng Li,et al.  Interactive tracker - a semi-automatic video object tracking and segmentation system , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[25]  Miska M. Hannuksela,et al.  Isolated regions in video coding , 2004, IEEE Transactions on Multimedia.

[26]  Christof Koch,et al.  Comparison of feature combination strategies for saliency-based visual attention systems , 1999, Electronic Imaging.

[27]  Fernando Pereira Universal Multimedia Experience for tomorrow , 2003 .

[28]  Sing-Tze Bow,et al.  Pattern recognition and image preprocessing , 1992 .

[29]  Rik Van de Walle,et al.  MPEG-21: goals and achievements , 2001 .

[30]  Ming-Ting Sun,et al.  Dynamic region of interest transcoding for multipoint video conferencing , 2003, IEEE Trans. Circuits Syst. Video Technol..

[31]  Jun Xin,et al.  Video Adaptation : Concepts , Technologies , and Open Issues , .

[32]  M. Angela Sasse,et al.  Can small be beautiful?: assessing image resolution requirements for mobile TV , 2005, MULTIMEDIA '05.

[33]  Ramesh Raskar,et al.  Automatic image retargeting , 2005, MUM '05.

[34]  Thomas H. Cormen,et al.  Introduction to algorithms [2nd ed.] , 2001 .

[35]  Xing Xie,et al.  Automatic browsing of large pictures on mobile devices , 2003, MULTIMEDIA '03.

[36]  Guillermo Sapiro,et al.  Video inpainting of occluding and occluded objects , 2005, IEEE International Conference on Image Processing 2005.

[37]  Wen-Huang Cheng,et al.  A user-attention based focus detection framework and its applications , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[38]  Chong-Wah Ngo,et al.  Motion analysis and segmentation through spatio-temporal slices processing , 2003, IEEE Trans. Image Process..

[39]  Wen-Huang Cheng,et al.  A Visual Attention Based Region-of-Interest Determination Framework for Video Sequences , 2005, IEICE Trans. Inf. Syst..

[40]  Jun-Cheng Chen,et al.  A real-time semi-automatic video segmentation system based on mathematical morphology , 2005, Visual Communications and Image Processing.

[41]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[42]  H. Zettl Sight, Sound, Motion: Applied Media Aesthetics , 1973 .

[43]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[44]  Wen-Huang Cheng,et al.  A practical foveation-based rate-shaping mechanism for MPEG videos , 2005, IEEE Trans. Circuits Syst. Video Technol..

[45]  Svetha Venkatesh,et al.  Computational Media Aesthetics: Finding Meaning Beautiful , 2001, IEEE Multim..

[46]  Claudio M. Privitera,et al.  Algorithms for Defining Visual Regions-of-Interest: Comparison with Eye Fixations , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Andrew Perkis,et al.  MPEG-21: The 21st century multimedia framework , 2003, IEEE Signal Process. Mag..

[48]  Yu Sun,et al.  Video transcoding: an overview of various techniques and research issues , 2005, IEEE Transactions on Multimedia.

[49]  David S. Taubman,et al.  Realizing Low-Cost High-Throughput General-Purpose Block Encoder for JPEG2000 , 2006, IEEE Transactions on Circuits and Systems for Video Technology.