ROI adaptive scalable video coding for limited bandwidth wireless networks

Much of the attention in the field of video adaptation has been directed to the Scalable Video Coding (SVC), which is the extension of the H.264/AVC standard, since the bit-stream scalability for video is a desirable feature for many multimedia applications. The need for the scalability mainly arises from the need for spatial formats, bit rates or power. To fulfill these requirements, it would be beneficial to simultaneously transmit or store video in variety of spatial/temporal resolutions and qualities, leading to the video bit-stream scalability. Regions-of-interest (ROI) coding is a desirable feature in future applications of Scalable Video Coding. For those SVC applications, users at the decoder side usually wish to receive a high-quality decoded video stream, containing the desired ROI, which should be adaptively selected from the pre-encoded scalable bit-stream. In this work, we present a novel ROI adaptive scalable video coding scheme, enabling to adaptively set desirable ROI location, size, resolution and bit-rate, according to the limited network bandwidth and predefined settings. This, in turn, will enable providing an effective rate control for multiple ROIs, thereby enabling adaptively selecting the required ROI from multiple ROIs in the scalable bit-stream, and adaptively changing ROI spatial resolution, ROI visual quality or amount of bits allocated for each ROI, according to the network bandwidth and user's settings (i.e., user's display resolution, etc.).

[1]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[2]  Wesley De Neve,et al.  A real-time content adaptation framework for exploiting ROI scalability in H.264/AVC , 2006 .

[3]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[4]  Munchurl Kim,et al.  Moving object tracking in H.264/AVC bitstream , 2007 .

[5]  Heiko Schwarz,et al.  Overview of the Scalable Video Coding Extension of the H.264/AVC Standard , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Xiuzi Ye,et al.  Error resilience schemes of H.264/AVC for 3G conversational video services , 2005, The Fifth International Conference on Computer and Information Technology (CIT'05).

[7]  Wen Gao,et al.  Robust moving object segmentation on H.264/AVC compressed video using the block-based MRF model , 2005, Real Time Imaging.

[8]  Ruimin Hu,et al.  Adaptive FMO selection strategy for error resilient H.264 coding , 2008, 2008 International Conference on Audio, Language and Image Processing.

[9]  Henri Nicolas,et al.  Compressed domain indexing of scalable H.264/SVC streams , 2009, Signal Process. Image Commun..

[10]  Ming-Chieh Chi,et al.  ROI video coding based on H.263+ with robust skin-color detection technique , 2003, IEEE Trans. Consumer Electron..

[11]  T. Ogunfunmi,et al.  On the performance of a 3D flexible macroblock ordering for H.264/AVC , 2006, 2006 Digest of Technical Papers International Conference on Consumer Electronics.

[12]  Yiqiang Zhan,et al.  Rapid Object Tracking on Compressed Video , 2001, IEEE Pacific Rim Conference on Multimedia.

[13]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[14]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[15]  Warnakulasuriya Anil Chandana Fernando,et al.  Unequal Error Protection Technique for ROI Based H.264 Video Coding , 2006, 2006 Canadian Conference on Electrical and Computer Engineering.

[16]  Huifang Sun,et al.  Compressed Domain Video Object Segmentation , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Charles D. Creusere,et al.  Tracking of extended size targets in H.264 compressed video using the probabilistic data association filter , 2004, 2004 12th European Signal Processing Conference.