On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences

Rate-distortion (R-D) models are functions that describe the relationship between the bitrate and expected level of distortion in the reconstructed video stream. R-D models enable optimization of the received video quality in different network conditions. Several R-D models have been proposed for the increasingly popular fine-grained scalable video sequences. However, the models' relative performance has not been thoroughly analyzed. Moreover, the time complexity of each model is not known, nor is the range of bitrates in which the model produces valid results. This lack of quantitative performance analysis makes it difficult to select the model that best suits a target streaming system. In this article, we classify, analyze, and rigorously evaluate all R-D models proposed for FGS coders in the literature. We classify R-D models into three categories: analytic, empirical, and semi-analytic. We describe the characteristics of each category. We analyze the R-D models by following their mathematical derivations, scrutinizing the assumptions made, and explaining when the assumptions fail and why. In addition, we implement all R-D models, a total of eight, and evaluate them using a diverse set of video sequences. In our evaluation, we consider various source characteristics, diverse channel conditions, different encoding/decoding parameters, different frame types, and several performance metrics including accuracy, range of applicability, and time complexity of each model. We also present clear systematic ways (pseudo codes) for constructing various R-D models from a given video sequence. Based on our experimental results, we present a justified list of recommendations on selecting the best R-D models for video-on-demand, video conferencing, real-time, and peer-to-peer streaming systems.

[1]  Wen Gao,et al.  Statistical model, analysis and approximation of rate-distortion function in MPEG-4 FGS videos , 2005, Visual Communications and Image Processing.

[2]  Feng Wu,et al.  A framework for efficient progressive fine granularity scalable video coding , 2001, IEEE Trans. Circuits Syst. Video Technol..

[3]  Hsueh-Ming Hang,et al.  Source model for transform video coder and its application. I. Fundamental theory , 1997, IEEE Trans. Circuits Syst. Video Technol..

[4]  Sanjit K. Mitra,et al.  A unified rate-distortion analysis framework for transform coding , 2001, IEEE Trans. Circuits Syst. Video Technol..

[5]  William K. Pratt Image Processing Research , 1974 .

[6]  Alberto Leon-Garcia,et al.  Estimation of shape parameter for generalized Gaussian distributions in subband decompositions of video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[7]  Min Dai,et al.  Rate-distortion analysis and traffic modeling of scalable video coders , 2004 .

[8]  F. Muller Distribution shape of two-dimensional DCT coefficients of natural images , 1993 .

[9]  G. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[10]  Dmitri Loguinov,et al.  Analysis of rate-distortion functions and congestion control in scalable internet video streaming , 2003, NOSSDAV '03.

[11]  Shuichi Matsumoto,et al.  A Bit-Plane Coding Scheme of MPEG-4 FGS with High Efficiency Based on the Distribution of Significant Coefficients , 2002, IEEE Pacific Rim Conference on Multimedia.

[12]  Thomas R. Fischer,et al.  Comparison of generalized Gaussian and Laplacian modeling in DCT image coding , 1995, IEEE Signal Processing Letters.

[13]  Wei Ding,et al.  Rate control of MPEG video coding and recording by rate-quantization modeling , 1996, IEEE Trans. Circuits Syst. Video Technol..

[14]  J. Giedt,et al.  Rensselaer Polytechnic Institute , 1960, Nature.

[15]  HsuCheng-Hsin,et al.  On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences , 2008 .

[16]  Sanjit K. Mitra,et al.  A linear source model and a unified rate control algorithm for DCT video coding , 2002, IEEE Trans. Circuits Syst. Video Technol..

[17]  Antonio Ortega,et al.  Bit-rate control using piecewise approximated rate-distortion characteristics , 1998, IEEE Trans. Circuits Syst. Video Technol..

[18]  Hayder Radha,et al.  Rate-Distortion Analysis and Quality Control in Scalable Internet Streaming , 2006, IEEE Transactions on Multimedia.

[19]  Newton Lee,et al.  ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP) , 2007, CIE.

[20]  Yao Wang,et al.  Video Processing and Communications , 2001 .

[21]  Yun Q. Shi,et al.  Constant quality constrained rate allocation for FGS-coded video , 2003, IEEE Trans. Circuits Syst. Video Technol..

[22]  Iso/iec 14496-2 Information Technology — Coding of Audio-visual Objects — Part 2: Visual , .

[23]  Tihao Chiang,et al.  A new rate control scheme using quadratic rate distortion model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[24]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[25]  Jun Sun,et al.  Statistical model, analysis and approximation of rate-distortion function in MPEG-4 FGS videos , 2006, IEEE Trans. Circuits Syst. Video Technol..

[26]  Hayder Radha,et al.  Rate-distortion modeling of scalable video coders , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[27]  Angel R. Martinez,et al.  Computational Statistics Handbook with MATLAB , 2001 .

[28]  M. Varanasi,et al.  Parametric generalized Gaussian density estimation , 1989 .

[29]  Mihaela van der Schaar,et al.  The MPEG-4 fine-grained scalable video coding method for multimedia streaming over IP , 2001, IEEE Trans. Multim..

[30]  Chien-Ming Wu,et al.  High-performance low-complexity bit-plane coding scheme for MPEG-4 FGS , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[31]  Tihao Chiang,et al.  A new rate control scheme using quadratic rate distortion model , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[32]  Hayder Radha,et al.  Statistical analysis and distortion modeling of MPEG-4 FGS , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[33]  Stéphane Mallat,et al.  Analysis of low bit rate image transform coding , 1998, IEEE Trans. Signal Process..

[34]  Weiping Li,et al.  Overview of fine granularity scalability in MPEG-4 video standard , 2001, IEEE Trans. Circuits Syst. Video Technol..

[35]  Sanjit K. Mitra,et al.  Low-delay rate control for DCT video coding via ?-domain source modeling , 2001, IEEE Trans. Circuits Syst. Video Technol..

[36]  Donald A. Adjeroh,et al.  Scene-adaptive transform domain video partitioning , 2004, IEEE Transactions on Multimedia.

[37]  Te-Won Lee,et al.  Modeling Nonlinear Dependencies in Natural Images using Mixture of Laplacian Distribution , 2004, NIPS.