A computation-distortion optimized framework for efficient DCT-based video coding

The rapidly expanding field of multimedia communications has fueled significant research and development work in the area of real-time video encoding. Dedicated hardware solutions have reached maturity and cost-efficient hardware encoders are being developed by several manufacturers. However, software solutions based on a general purpose processor or a programmable digital signal processor (DSP) have significant merits. Toward this objective, we have developed a flexible framework for video encoding that yields very good computation-performance tradeoffs. The proposed framework consists of a set of optimized core components: motion estimation (ME), the discrete cosine transform (DCT), quantization, and mode selection. Each of the components can be configured to achieve a desired computation-performance tradeoff. The components can be assembled to obtain encoders with varying degrees of computational complexity. Computation control has been implemented within the proposed framework to allow the resulting algorithms to adapt to the available computational resources. The proposed framework was applied to MPEG-2 and H.263 encoding using Intel's Pentium/MMX desktop processor. Excellent speed-performance tradeoffs were obtained.

[1]  Wilson C. Chung,et al.  Rate-distortion-constrained statistical motion estimation for video coding , 1995, Proceedings., International Conference on Image Processing.

[2]  Faouzi Kossentini,et al.  Quantized discrete cosine transform: a combination of DCT and scalar quantization , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[3]  Rabab Kreidieh Ward,et al.  Very low rate DCT-based video coding using dynamic VQ , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[4]  Wilson C. Chung,et al.  A new approach to scalable video coding , 1995, Proceedings DCC '95 Data Compression Conference.

[5]  Rabab Kreidieh Ward,et al.  Motion estimation using long-term motion vector prediction , 1999, Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096).

[6]  Y.-Q. Zhang,et al.  Predictive block-matching motion estimation for TV coding. I. Inter-block prediction , 1991, IEEE Transactions on Broadcasting.

[7]  Faouzi Kossentini,et al.  An efficient computation-constrained block-based motion estimation algorithm for low bit rate video coding , 1998, Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284).

[8]  Y.-Q. Zhang,et al.  Predictive block-matching motion estimation for TV coding. II. Inter-frame prediction , 1991, IEEE Transactions on Broadcasting.

[9]  C.-C. Jay Kuo,et al.  Fast motion vector estimation by using spatiotemporal correlation of motion field , 1995, Other Conferences.

[10]  Aggelos K. Katsaggelos,et al.  Fast and efficient mode and quantizer selection in the rate distortion sense for H.263 , 1996, Other Conferences.

[11]  Konstantinos Konstantinides,et al.  Image and Video Compression Standards: Algorithms and Architectures , 1997 .

[12]  C.-C. Jay Kuo,et al.  Fast motion vector estimation using multiresolution-spatio-temporal correlations , 1997, IEEE Trans. Circuits Syst. Video Technol..

[13]  Sanjit K. Mitra,et al.  Rate-distortion optimized mode selection for very low bit rate video coding and the emerging H.263 standard , 1996, IEEE Trans. Circuits Syst. Video Technol..

[14]  Faouzi Kossentini,et al.  Efficient coding and mapping algorithms for software-only real-time video coding at low bit rates , 2000, IEEE Trans. Circuits Syst. Video Technol..

[15]  Itu-T Video coding for low bitrate communication , 1996 .

[16]  Rabab Kreidieh Ward,et al.  Towards MPEG4: An improved H.263-based video coder , 1997, Signal Process. Image Commun..

[17]  Rabab Kreidieh Ward,et al.  Predictive RD Optimized Motion Estimation for Very Low Bit-Rate Video Coding , 1997, IEEE J. Sel. Areas Commun..

[18]  Faouzi Kossentini,et al.  Efficient MPEG-2 encoding of interlaced video , 1998, Canadian Journal of Electrical and Computer Engineering.