Merge Frame Design for Video Stream Switching Using Piecewise Constant Functions

The ability to efficiently switch from one pre-encoded video stream to another (e.g., for bitrate adaptation or view switching) is important for many interactive streaming applications. Recently, stream-switching mechanisms based on distributed source coding (DSC) have been proposed. In order to reduce the overall transmission rate, these approaches provide a merge mechanism, where information is sent to the decoder, such that the exact same frame can be reconstructed given that any one of a known set of side information (SI) frames is available at the decoder (e.g., each SI frame may correspond to a different stream from which we are switching). However, the use of bit-plane coding and channel coding in many DSC approaches leads to complex coding and decoding. In this paper, we propose an alternative approach for merging multiple SI frames, using a piecewise constant (PWC) function as the merge operator. In our approach, for each block to be reconstructed, a series of parameters of these PWC merge functions are transmitted in order to guarantee identical reconstruction given the known SI blocks. We consider two different scenarios. In the first case, a target frame is first given, and then merge parameters are chosen, so that this frame can be reconstructed exactly at the decoder. In contrast, in the second scenario, the reconstructed frame and the merge parameters are jointly optimized to meet a rate-distortion criteria. Experiments show that for both scenarios, our proposed merge techniques can outperform both a recent approach based on DSC and the SP-frame approach in H.264, in terms of compression efficiency and decoder complexity.

[1]  Pascal Frossard,et al.  Near-optimal content replication for interactive multiview video streaming , 2012, 2012 19th International Packet Video Workshop (PV).

[2]  Marta Karczewicz,et al.  The SP- and SI-frames design for H.264/AVC , 2003, IEEE Trans. Circuits Syst. Video Technol..

[3]  Antonio Ortega,et al.  On media data structures for interactive streaming in immersive applications , 2010, Visual Communications and Image Processing.

[4]  Antonio Ortega,et al.  Optimized frame structure using distributed source coding for interactive multiview video streaming , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[5]  Bernd Girod,et al.  ClassX: an open source interactive lecture StreamingSystem , 2011, ACM Multimedia.

[6]  Oscar C. Au,et al.  Rate-distortion optimized merge frame using piecewise constant functions , 2013, 2013 IEEE International Conference on Image Processing.

[7]  Bernd Girod,et al.  Rate-Distortion Optimized Interactive Light Field Streaming , 2007, IEEE Transactions on Multimedia.

[8]  Antonio Ortega,et al.  Rate-distortion based reconstruction optimization in distributed source coding for interactive multiview video streaming , 2010, 2010 IEEE International Conference on Image Processing.

[9]  Wen Gao,et al.  Wyner–Ziv Switching Scheme for Multiple Bit-Rate Video Streaming , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Jian Zhou,et al.  MPEG video streaming with VCR functionality , 2000, Proceedings International Symposium on Multimedia Software Engineering.

[11]  Kannan Ramchandran,et al.  Distributed source coding using syndromes (DISCUS): design and construction , 2003, IEEE Trans. Inf. Theory.

[12]  Christophe De Vleeschouwer,et al.  A Flexible Video Transmission System Based on JPEG 2000 Conditional Replenishment with Multiple References , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[13]  Eckehard G. Steinbach,et al.  RDTC Optimized Compression of Image-Based Scene Representations (Part II): Practical Coding , 2008, IEEE Transactions on Image Processing.

[14]  Antonio Ortega,et al.  Coding structure optimization for interactive multiview streaming in virtual world observation , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[15]  Wen Gao,et al.  Drift-free switching of compressed video bitstreams at predictive frames , 2006, IEEE Trans. Circuits Syst. Video Technol..

[16]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[17]  Bernd Girod,et al.  Wyner-Ziv coding of light fields for random access , 2004, IEEE 6th Workshop on Multimedia Signal Processing, 2004..

[18]  Antonio Ortega,et al.  Overlapped Tiling for Fast Random Oblique Plane Access of 3D Object Datasets , 2009, 2009 Data Compression Conference.

[19]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Jiang Li,et al.  A real-time interactive multi-view video system , 2005, MULTIMEDIA '05.

[21]  Gary J. Sullivan,et al.  Efficient scalar quantization of exponential and Laplacian random variables , 1996, IEEE Trans. Inf. Theory.

[22]  Antonio Ortega,et al.  Video compression with flexible playback order based on distributed source coding , 2006, Electronic Imaging.

[23]  Chia-Wen Lin,et al.  A New Seamless Bitstream Switching Scheme for H.264 Video Adaptation with Enhanced Coding Performance , 2006, 2006 International Conference on Image Processing.

[24]  Ngai-Man Cheung,et al.  Generation of redundant frame structure for interactive multiview streaming , 2009, 2009 17th International Packet Video Workshop.

[25]  Bernd Girod,et al.  An interactive region-of-interest video streaming system for online lecture viewing , 2010, 2010 18th International Packet Video Workshop.

[26]  Wei Cai,et al.  Optimal frame structure design using landmarks for interactive light field streaming , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  Antonio Ortega,et al.  Compression algorithms for flexible video decoding , 2008, Electronic Imaging.

[28]  Antonio Ortega,et al.  Distributed source coding techniques for interactive multiview video streaming , 2009, 2009 Picture Coding Symposium.

[29]  Antonio Ortega,et al.  Interactive Streaming of Stored Multiview Video Using Redundant Frame Structures , 2011, IEEE Transactions on Image Processing.

[30]  Antonio Ortega,et al.  Bandwidth-Efficient Interactive Multiview Live Video Streaming using Redundant Frame Structures , 2009 .

[31]  A T Naman,et al.  JPEG2000-Based Scalable Interactive Video (JSIV) , 2011, IEEE Transactions on Image Processing.

[32]  Gene Cheung,et al.  Designing coding structures with merge frames for interactive multiview video streaming , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[33]  Raj Jain,et al.  Analysis of the Increase and Decrease Algorithms for Congestion Avoidance in Computer Networks , 1989, Comput. Networks.

[34]  Yui-Lam Chan,et al.  Efficient reverse-play algorithms for MPEG video with VCR support , 2006, IEEE Trans. Circuits Syst. Video Technol..

[35]  Susie J. Wee,et al.  Compressed-domain reverse play of MPEG video streams , 1999, Other Conferences.

[36]  David S. Taubman,et al.  Rate-distortion optimized interactive browsing of JPEG2000 images , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[37]  Eckehard G. Steinbach,et al.  RDTC Optimized Compression of Image-Based Scene Representations (Part I): Modeling and Theoretical Analysis , 2008, IEEE Transactions on Image Processing.

[38]  Bernd Girod,et al.  Spatial-Random-Access-Enabled Video Coding for Interactive Virtual Pan/Tilt/Zoom Functionality , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[39]  Bernd Girod,et al.  Mobile interactive region-of-interest video streaming with crowd-driven prefetching , 2011, IMMPD '11.

[40]  Gene Cheung,et al.  Coding for Interactive Navigation in High-Dimensional Media Data , 2013, Emerging Technologies for 3D Video.