论文信息 - Video Adaptation for Small Display Based on Content Recomposition

Video Adaptation for Small Display Based on Content Recomposition

The browsing of quality videos on small hand-held devices is a common scenario in pervasive media environments. In this paper, we propose a novel framework for video adaptation based on content recomposition. Our objective is to provide effective small size videos which emphasize the important aspects of a scene while faithfully retaining the background context. That is achieved by explicitly separating the manipulation of different video objects. A generic video attention model is developed to extract user-interest objects, in which a high-level combination strategy is proposed for fusing the adopted three types of visual attention features: intensity, color, and motion. Based on the knowledge of media aesthetics, a set of aesthetic criteria is presented. Accordingly, these objects are well reintegrated with the direct-resized background to optimally match the specific screen sizes. Experimental results demonstrate the efficiency and effectiveness of our approach

[1] Patrick Pérez,et al. Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[2] S. Engel,et al. Colour tuning in human visual cortex measured with functional magnetic resonance imaging , 1997, Nature.

[3] Javier Díaz,et al. FPGA-based real-time optical-flow system , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[4] Yao Wang,et al. Video Processing and Communications , 2001 .

[5] John R. Smith,et al. Adapting Multimedia Internet Content for Universal Access , 1999, IEEE Trans. Multim..

[6] Touradj Ebrahimi,et al. Semantic video analysis for adaptive content delivery and automatic description , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[7] Xing Xie,et al. A visual attention model for adapting images on small displays , 2003, Multimedia Systems.

[8] Laurent Itti,et al. A Goal Oriented Attention Guidance Model , 2002, Biologically Motivated Computer Vision.

[9] Brendan J. Frey,et al. Video Epitomes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10] Jeho Nam,et al. Visual content adaptation according to user perception characteristics , 2005, IEEE Transactions on Multimedia.

[11] Ramesh R. Sarukkai,et al. Video search: opportunities & challenges , 2005, MIR '05.

[12] Keansub Lee,et al. Perception-based image transcoding for universal multimedia access , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[13] Eric R. Ziegel,et al. Probability and Statistics for Engineering and the Sciences , 2004, Technometrics.

[14] Ian Burnett,et al. Universal multimedia experiences for tomorrow , 2003 .

[15] B. J Hne,et al. Spatio - temporal Image Processing: Theory and Scientific Applications , 1991 .

[16] Ja-Ling Wu,et al. Robust Algorithm for Exemplar-based Image Inpainting , 2005 .

[17] Sigeru Omatu,et al. Regular moments for symmetric images , 1998 .

[18] Michael Gleicher,et al. Automatic image retargeting with fisheye-view warping , 2005, UIST.

[19] David Bordwell,et al. Film Art: An Introduction , 1979 .

[20] Ming-Ting Sun,et al. Digital Video Transcoding , 2005, Proceedings of the IEEE.

[21] Bernd Jähne,et al. Spatio-Temporal Image Processing , 1993, Lecture Notes in Computer Science.

[22] Yeong-Ho Ha,et al. Spatial color descriptor for image retrieval and video segmentation , 2003, IEEE Trans. Multim..

[23] Xian-Sheng Hua,et al. An Attention-Based Decision Fusion Scheme for Multimedia Information Retrieval , 2004, PCM.

[24] Shipeng Li,et al. Interactive tracker - a semi-automatic video object tracking and segmentation system , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[25] Miska M. Hannuksela,et al. Isolated regions in video coding , 2004, IEEE Transactions on Multimedia.

[26] Christof Koch,et al. Comparison of feature combination strategies for saliency-based visual attention systems , 1999, Electronic Imaging.

[27] Fernando Pereira. Universal Multimedia Experience for tomorrow , 2003 .

[28] Sing-Tze Bow,et al. Pattern recognition and image preprocessing , 1992 .

[29] Rik Van de Walle,et al. MPEG-21: goals and achievements , 2001 .

[30] Ming-Ting Sun,et al. Dynamic region of interest transcoding for multipoint video conferencing , 2003, IEEE Trans. Circuits Syst. Video Technol..

[31] Jun Xin,et al. Video Adaptation : Concepts , Technologies , and Open Issues , .

[32] M. Angela Sasse,et al. Can small be beautiful?: assessing image resolution requirements for mobile TV , 2005, MULTIMEDIA '05.

[33] Ramesh Raskar,et al. Automatic image retargeting , 2005, MUM '05.

[34] Thomas H. Cormen,et al. Introduction to algorithms [2nd ed.] , 2001 .

[35] Xing Xie,et al. Automatic browsing of large pictures on mobile devices , 2003, MULTIMEDIA '03.

[36] Guillermo Sapiro,et al. Video inpainting of occluding and occluded objects , 2005, IEEE International Conference on Image Processing 2005.

[37] Wen-Huang Cheng,et al. A user-attention based focus detection framework and its applications , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[38] Chong-Wah Ngo,et al. Motion analysis and segmentation through spatio-temporal slices processing , 2003, IEEE Trans. Image Process..

[39] Wen-Huang Cheng,et al. A Visual Attention Based Region-of-Interest Determination Framework for Video Sequences , 2005, IEICE Trans. Inf. Syst..

[40] Jun-Cheng Chen,et al. A real-time semi-automatic video segmentation system based on mathematical morphology , 2005, Visual Communications and Image Processing.

[41] Anil K. Jain,et al. Data clustering: a review , 1999, CSUR.

[42] H. Zettl. Sight, Sound, Motion: Applied Media Aesthetics , 1973 .

[43] Christof Koch,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[44] Wen-Huang Cheng,et al. A practical foveation-based rate-shaping mechanism for MPEG videos , 2005, IEEE Trans. Circuits Syst. Video Technol..

[45] Svetha Venkatesh,et al. Computational Media Aesthetics: Finding Meaning Beautiful , 2001, IEEE Multim..

[46] Claudio M. Privitera,et al. Algorithms for Defining Visual Regions-of-Interest: Comparison with Eye Fixations , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[47] Andrew Perkis,et al. MPEG-21: The 21st century multimedia framework , 2003, IEEE Signal Process. Mag..

[48] Yu Sun,et al. Video transcoding: an overview of various techniques and research issues , 2005, IEEE Transactions on Multimedia.

[49] David S. Taubman,et al. Realizing Low-Cost High-Throughput General-Purpose Block Encoder for JPEG2000 , 2006, IEEE Transactions on Circuits and Systems for Video Technology.