Annotation based personalized adaptation and presentation of videos for mobile applications

Personalized multimedia content which suits user preferences and the usage environment, and as a result improves the user experience, gains more importance. In this paper, we describe an architecture for personalized video adaptation and presentation for mobile applications which is guided by automatically generated annotations. By including this annotation information, more intelligent adaptation techniques can be realized which primarily reduce the quality of unimportant regions in case a bit rate reduction is necessary. Furthermore, a presentation layer is added to enable advanced multimedia viewers to adequately present the interesting parts of a video in case the user wants to zoom in. This architecture is the result of collaborative research done in the EU FP6 IST INTERMEDIA project.

[1]  Rik Van de Walle,et al.  The MPEG-21 Book , 2006 .

[2]  Rik Van de Walle,et al.  Enabling universal media experiences through semantic adaptation in the creative drama productionworkflow , 2009, 2009 10th Workshop on Image Analysis for Multimedia Interactive Services.

[3]  David D. Cox,et al.  A High-Throughput Screening Approach to Discovering Good Forms of Biologically Inspired Visual Representation , 2009, PLoS Comput. Biol..

[4]  P. Pérez,et al.  Tracking multiple objects with particle filtering , 2002 .

[5]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[6]  Cedric Nishan Canagarajah,et al.  An efficient complexity-scalable video transcoder with mode refinement , 2007, Signal Process. Image Commun..

[7]  Wu-chi Feng,et al.  Supporting region-of-interest cropping through constrained compression , 2008, ACM Multimedia.

[8]  Antonio Torralba,et al.  Sharing Visual Features for Multiclass and Multiview Object Detection , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Peter Hosten,et al.  Enhanced background subtraction using global motion compensation and mosaicing , 2008, 2008 15th IEEE International Conference on Image Processing.

[10]  M. Angela Sasse,et al.  The big picture on small screens delivering acceptable video quality in mobile TV , 2009, TOMCCAP.

[11]  Anthony Vetro,et al.  MPEG-4 rate control for multiple video objects , 1999, IEEE Trans. Circuits Syst. Video Technol..

[12]  Bernd Girod,et al.  Optimal slice size for streaming regions of high resolution video with virtual pan/tilt/zoom functionality , 2007, 2007 15th European Signal Processing Conference.

[13]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, Electronic Imaging.

[14]  Rik Van de Walle,et al.  Mixed architectures for H.264/AVC digital video transrating , 2009, Multimedia Tools and Applications.

[15]  Touradj Ebrahimi,et al.  Semantic video analysis for adaptive content delivery and automatic description , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Wei Tsang Ooi,et al.  Supporting zoomable video streams with dynamic region-of-interest cropping , 2010, MMSys '10.

[17]  Heiko Schwarz,et al.  Overview of the Scalable Video Coding Extension of the H.264/AVC Standard , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Paul Over,et al.  Video shot boundary detection: Seven years of TRECVid activity , 2010, Comput. Vis. Image Underst..

[19]  Rita Cucchiara,et al.  Semantic transcoding for live video server , 2002, MULTIMEDIA '02.

[20]  Shih-Fu Chang,et al.  Video Adaptation: Concepts, Technologies, and Open Issues , 2005, Proceedings of the IEEE.

[21]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[22]  Alberto Del Bimbo,et al.  An Integrated Framework for Semantic Annotation and Adaptation , 2005, Multimedia Tools and Applications.

[23]  Anthony Vetro,et al.  Surveillance System with Object-Aware Video Transcoder , 2005, 2005 IEEE 7th Workshop on Multimedia Signal Processing.

[24]  Rik Van de Walle,et al.  System architecture for semantic annotation and adaptation in content sharing environments , 2008, The Visual Computer.

[25]  Peter Lambert,et al.  Requantization transcoding for H.264/AVC video coding , 2010, Signal Process. Image Commun..

[26]  Rik Van de Walle,et al.  A context-aware architecture for QoS and transcoding management of multimedia streams in smart homes , 2008, 2008 IEEE International Conference on Emerging Technologies and Factory Automation.

[27]  Fernando Pereira,et al.  Using MPEG standards for multimedia customization , 2004, Signal Process. Image Commun..

[28]  Bernd Girod,et al.  Network-Aware H . 264 / AVC Region-of-Interest Coding for a Multi-Camera Wireless Surveillance Network ⋆ , 2006 .

[29]  Nicu Sebe,et al.  Special section from the ACM multimedia conference 2007 , 2008, TOMCCAP.

[30]  Cyril Concolato,et al.  GPAC: open source multimedia framework , 2007, ACM Multimedia.

[31]  Fernando Pereira,et al.  MPEG-4 video subjective test procedures and results , 1997, IEEE Trans. Circuits Syst. Video Technol..

[32]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[33]  Chen-Hsiu Huang Video Transcoding Architectures and Techniques : An Overview , 2003 .

[34]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[35]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[36]  Rik Van de Walle,et al.  The MPEG-21 Book: Burnett/The MPEG-21 Book , 2006 .

[37]  Michael Unger,et al.  Segment Based Diffusion - A Post-Processing Step (Not Only) for Background Subtraction , 2008, 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services.