Integrating semantic analysis and scalable video coding for efficient content-based adaptation

Scalable video coding has become a key technology to deploy systems where the adaptation of content to diverse constrained usage environments (such as PDAs, mobile phones and networks) is carried out in a simple and efficient way. Content-based adaptation and summarization are fields that aim for providing improved adaptation to the user, trying to optimize the semantic coverage in the adapted/summarized version. This paper proposes the integration of content analysis with scalable video adaptation paradigm. They must be fitted in such a way that the efficiency of scalable adaptation is not damaged. An integrated framework is proposed for semantic video adaptation, as well as an adaptive skimming scheme that can use the results of semantic analysis. They are described using the MPEG-21 DIA tools to provide the adaptation in a standard framework. Particularly, the case of activity analysis is described to illustrate the integration of semantic analysis in the framework, and its use for online content summarization and adaptation. Overall efficiency is achieved by means of computing activity using compressed domain analysis with several metrics evaluated as measures of activity.

[1]  Edward J. Delp,et al.  A fast algorithm for video parsing using MPEG compressed sequences , 1995, Proceedings., International Conference on Image Processing.

[2]  Fernando Pereira,et al.  Automatic video summarization based on MPEG-7 descriptions , 2004, Signal Process. Image Commun..

[3]  H. Schwarz,et al.  Overview of the Scalable H.264/MPEG4-AVC Extension , 2006, 2006 International Conference on Image Processing.

[4]  J. Pearl,et al.  Comparison of the cosine and Fourier transforms of Markov-1 signals , 1976 .

[5]  Liang-Tien Chia,et al.  MPEG-21 digital item adaptation by applying perceived motion energy to H.264 video , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[6]  Jens-Rainer Ohm,et al.  Advances in Scalable Video Coding , 2005, Proceedings of the IEEE.

[7]  Ajay Divakaran,et al.  Constant pace skimming and temporal sub-sampling of video using motion activity , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[8]  Ajay Divakaran,et al.  MPEG-7 visual motion descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[9]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[10]  Takeo Kanade,et al.  Video skimming and characterization through the combination of image and language understanding , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[11]  Wolfgang Effelsberg,et al.  Abstracting Digital Movies Automatically , 1996, J. Vis. Commun. Image Represent..

[12]  Avideh Zakhor,et al.  Applications of Video-Content Analysis and Retrieval , 2002, IEEE Multim..

[13]  Anthony Vetro,et al.  MPEG-21 digital item adaptation: enabling universal multimedia access , 2004, IEEE MultiMedia.

[14]  Joern Ostermann,et al.  Special issue on analysis and understanding for video adaptation , 2005 .

[15]  Lie Lu,et al.  A generic framework of user attention model and its application in video summarization , 2005, IEEE Trans. Multim..

[16]  R. Venkatesh Babu,et al.  Video object segmentation: a compressed domain approach , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Anthony G. Constantinides,et al.  Variable size block matching motion compensation with applications to video coding , 1990 .

[18]  Marta Mrak,et al.  A SCALABLE CODING FRAMEWORK FOR EFFICIENT VIDEO ADAPTATION , 2005 .

[19]  Jesús Bescós,et al.  Extraction of Motion Activity from Scalable-Coded Video Sequences , 2006, SAMT.

[20]  Mihaela van der Schaar,et al.  Interframe wavelet coding - motion picture representation for universal scalability , 2004, Signal Process. Image Commun..

[21]  Kiyoharu Aizawa,et al.  Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30 - December 3, 2004, Proceedings, Part I , 2005, Pacific Rim Conference on Multimedia.

[22]  Yong Wang,et al.  Optimal adaptation decision-taking for terminal and network quality-of-service , 2005, IEEE Transactions on Multimedia.

[23]  Sanjeev R. Kulkarni,et al.  Rapid estimation of camera motion from compressed video with application to video annotation , 2000, IEEE Trans. Circuits Syst. Video Technol..

[24]  Jun Xin,et al.  Video Adaptation : Concepts , Technologies , and Open Issues , .

[25]  V. Ghini,et al.  A video frame dropping mechanism based on audio perception , 2004, IEEE Global Telecommunications Conference Workshops, 2004. GlobeCom Workshops 2004..

[26]  Shih-Fu Chang,et al.  Survey of compressed-domain features used in audio-visual indexing and analysis , 2003, J. Vis. Commun. Image Represent..

[27]  Hojung Cha,et al.  Dynamic Frame Dropping for Bandwidth Control in MPEG Streaming System , 2004, Multimedia Tools and Applications.

[28]  Chong-Wah Ngo,et al.  Automatic video summarization by graph modeling , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[29]  Jesús Bescós,et al.  Real-time shot change detection over online MPEG-2 video , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  Nikola Sprljan,et al.  Bitstream syntax description based adaptation of scalable video , 2005 .

[31]  Sang Uk Lee,et al.  Efficient video indexing scheme for content-based retrieval , 1999, IEEE Trans. Circuits Syst. Video Technol..

[32]  Ahmed K. Elmagarmid,et al.  InsightVideo: toward hierarchical video content organization for efficient browsing, summarization and retrieval , 2005, IEEE Transactions on Multimedia.

[33]  Xiaodong Gu,et al.  Perceptual Video Streaming by Adaptive Spatial-temporal Scalability , 2004, PCM.

[34]  Jens-Rainer Ohm,et al.  Three-dimensional subband coding with motion compensation , 1994, IEEE Trans. Image Process..

[35]  Joshua R. Smith,et al.  Metadata-driven multimedia access , 2003, IEEE Signal Process. Mag..

[36]  Aggelos K. Katsaggelos,et al.  Rate-distortion optimal video summary generation , 2005, IEEE Transactions on Image Processing.

[37]  Yu Sun,et al.  Video transcoding: an overview of various techniques and research issues , 2005, IEEE Transactions on Multimedia.

[38]  John W. Woods,et al.  Embedded video coding using invertible motion compensated 3-D subband/wavelet filter bank , 2001, Signal Process. Image Commun..