论文信息 - Adaptive Downsampling Video Coding With Spatially Scalable Rate-Distortion Modeling

Adaptive Downsampling Video Coding With Spatially Scalable Rate-Distortion Modeling

Downsampling video coding, whereby downsampled frames are encoded, provides improved perceptual quality in rate-constrained situations. This method shows considerable advantages over other approaches, particularly in wide-spreading high-definition video formats. This paper provides a comprehensive analysis of downsampling video coding. The study proposes a spatially scalable rate-distortion (RD) model, comprising quantization-distortion and quantization-rate models, and develops an optimal encoding frame size determination framework. The proposed method achieves a gain up to 2.3 dB peak signal-to-noise ratio (PSNR) at 1 Mb/s when compared with conventional full frame size coding. The RD performance is close to the optimal scenario, in which the ideal frame size is obtained by heuristically performing downsampling coding in various allowable sizes.

[1] Weisi Lin,et al. Adaptive downsampling/upsampling for better video compression at low bit rate , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[2] Jungyoup Yang,et al. Macroblock-level adaptive dynamic resolution conversion technique , 2006, SPIE Optics East.

[3] Oscar C. Au,et al. A Novel Analytic Quantization-Distortion Model for Hybrid Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[4] Wen-Hsiao Peng,et al. Analytical mode-dependent rate and distortion models for H.264/SVC coarse grain scalability , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[5] C. G. Broyden. The Convergence of a Class of Double-rank Minimization Algorithms 2. The New Algorithm , 1970 .

[6] Itu-T and Iso Iec Jtc. Advanced video coding for generic audiovisual services , 2010 .

[7] André Kaup,et al. Laplace Distribution Based Lagrangian Rate Distortion Optimization for Hybrid Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[8] Hyuk-Jae Lee,et al. Bitrate control using a heuristic spatial resolution adjustment for a real-time H.264/AVC encoder , 2012, EURASIP J. Adv. Signal Process..

[9] Yong Man Ro,et al. Joint control for hybrid transcoding using multidimensional rate distortion modeling , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[10] Yong Man Ro,et al. Distortion Measures in MPEG-Compressed Domain for Multidimensional Transcoding , 2005, 2005 IEEE 7th Workshop on Multimedia Signal Processing.

[11] Anthony Vetro,et al. Rate-distortion models for video transcoding , 2003, IS&T/SPIE Electronic Imaging.

[12] Hyun Wook Park,et al. L/M-fold image resizing in block-DCT domain using symmetric convolution , 2003, IEEE Trans. Image Process..

[13] Michael Elad,et al. Improved high-definition video by encoding at an intermediate resolution , 2004, IS&T/SPIE Electronic Imaging.

[14] Deepak S. Turaga,et al. No reference PSNR estimation for compressed pictures , 2002, Proceedings. International Conference on Image Processing.

[15] Yücel Altunbasak,et al. Frame bit allocation for the H.264/AVC video coder via Cauchy-density-based rate and distortion models , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[16] Heonshik Shin,et al. Design of a mobile video streaming system using adaptive spatial resolution control , 2009, IEEE Transactions on Consumer Electronics.

[17] Gary J. Sullivan,et al. Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[18] Thomas Wedi,et al. Motion- and aliasing-compensated prediction for hybrid video coding , 2003, IEEE Trans. Circuits Syst. Video Technol..

[19] Ci Wang,et al. Down-Sampling Based Video Coding Using Super-Resolution Technique , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[20] Michael Elad,et al. Down-Scaling for Better Transform Compression , 2001, Scale-Space.

[21] Jae S. Lim,et al. Optimal multidimensional bit-rate control for video communication , 2002, IEEE Trans. Image Process..

[22] Rajeev Kumar,et al. An Efficient Motion Vector Composition Scheme for Arbitrary Frame Down-Sampling Video Transcoder , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[23] Pao-Chi Chang,et al. Quality Driven Frame Rate Optimization for Rate Constrained Video Encoding , 2012, IEEE Trans. Broadcast..

[24] Aggelos K. Katsaggelos,et al. Region-based super-resolution for compression , 2007, Multidimens. Syst. Signal Process..

[25] Ming-Ting Sun,et al. Modeling DCT coefficients for fast video encoding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[26] Lap-Pui Chau,et al. The realization of arbitrary downsizing video transcoding , 2006, IEEE Transactions on Circuits and Systems for Video Technology.