A Framework for the Analysis and Optimization of Encoding Latency for Multiview Video

We present a novel framework for the analysis and optimization of encoding latency for multiview video. First, we characterize the elements that have an influence in the encoding latency performance: 1) the multiview prediction structure and 2) the hardware encoder model. Then, we provide algorithms to find the encoding latency of any arbitrary multiview prediction structure. The proposed framework relies on the directed acyclic graph encoder latency (DAGEL) model, which provides an abstraction of the processing capacity of the encoder by considering an unbounded number of processors. Using graph theoretic algorithms, the DAGEL model allows us to compute the encoding latency of a given prediction structure, and determine the contribution of the prediction dependencies to it. As an example of DAGEL application, we propose an algorithm to reduce the encoding latency of a given multiview prediction structure up to a target value. In our approach, a minimum number of frame dependencies are pruned, until the latency target value is achieved, thus minimizing the degradation of the rate-distortion performance due to the removal of the prediction dependencies. Finally, we analyze the latency performance of the DAGEL derived prediction structures in multiview encoders with limited processing capacity.

[1]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[2]  Fernando Jaureguizar,et al.  Comparative latency analysis for arbitrary multiview video prediction structures , 2009, Electronic Imaging.

[3]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Wei Feng,et al.  A Framework for Heuristic Scheduling for Parallel Processing on Multicore Architecture: A Case Study With Multiview Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Richard Szeliski,et al.  The geometry-image representation tradeoff for rendering , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[6]  Thomas Wiegand,et al.  3D Video and Free Viewpoint Video - Technologies, Applications and MPEG Standards , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[7]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[8]  Fernando Jaureguizar,et al.  A graph-based approach for latency modeling and optimization in multiview video encoding , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[9]  M. N. Shanmukha Swamy,et al.  Graphs: Theory and Algorithms , 1992 .

[10]  Fernando Jaureguizar,et al.  Latency Analysis for a Multi-Processor Multiview Video Encoder Implementation , 2009 .

[11]  Lifeng Sun,et al.  Spatial and Temporal Data Parallelization of Multi-view Video Encoding Algorithm , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.

[12]  Toshiaki Fujii,et al.  Free-Viewpoint TV , 2011, IEEE Signal Processing Magazine.

[13]  Gary J. Sullivan,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[14]  Y.-K. Kwok,et al.  Static scheduling algorithms for allocating directed task graphs to multiprocessors , 1999, CSUR.

[15]  M. N. S. Swamy,et al.  Graphs: Theory and Algorithms: Thulasiraman/Graphs , 1992 .

[16]  I. Feldmann,et al.  Immersive Multi-User 3 D Video Communication , 2009 .

[17]  Gunnar Karlsson,et al.  Asynchronous transfer of video , 1996, IEEE Commun. Mag..

[18]  Aljoscha Smolic,et al.  Multi-View Video Plus Depth Representation and Coding , 2007, 2007 IEEE International Conference on Image Processing.

[19]  Mei Yu,et al.  Parallel Process of Hyper-Space-Based Multiview Video Compression , 2006, 2006 International Conference on Image Processing.