Energy-Efficient Acceleration of MPEG-4 Compression Tools

We propose novel hardware accelerator architectures for the most computationally demanding algorithms of the MPEG-4 video compression standard-motion estimation, binary motion estimation (for shape coding), and the forward/inverse discrete cosine transforms (incorporating shape adaptive modes). These accelerators have been designed using general low-energy design philosophies at the algorithmic/architectural abstraction levels. The themes of these philosophies are avoiding waste and trading area/performance for power and energy gains. Each core has been synthesised targeting TSMC 0.09 μ m TCBN90LP technology, and the experimental results presented in this paper show that the proposed cores improve upon the prior art.

[1]  Thomas Sikora,et al.  Shape-adaptive DCT for generic coding of video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[2]  C. Cheung,et al.  Normalized partial distortion search algorithm for block motion estimation , 2000, IEEE Trans. Circuits Syst. Video Technol..

[3]  Hsueh-Ming Hang,et al.  A comparison of block-matching algorithms mapped to systolic-array implementation , 1997, IEEE Trans. Circuits Syst. Video Technol..

[4]  Po-chih Tseng,et al.  Advances in Hardware Architectures for Image and Video Coding - A Survey , 2005, Proceedings of the IEEE.

[5]  Noel E. O'Connor,et al.  A low complexity hardware architecture for motion estimation , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[6]  Liang-Gee Chen,et al.  An efficient and simple VLSI tree architecture for motion estimation algorithms , 1993, IEEE Trans. Signal Process..

[7]  Israel Koren Computer arithmetic algorithms , 1993 .

[8]  Thomas Sikora,et al.  The MPEG-4 video standard verification model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[9]  Liang-Gee Chen,et al.  Reconfigurable discrete cosine transform processor for object-based video signal processing , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[10]  Manfred Glesner,et al.  Flexible architectures for DCT of variable-length targeting shape-adaptive transform , 2000, IEEE Trans. Circuits Syst. Video Technol..

[11]  Chein-Wei Jen,et al.  A cost-effective MPEG-4 shape-adaptive DCT with auto-aligned transpose memory organization , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[12]  Chein-Wei Jen,et al.  Optimal frame memory and data transfer scheme for MPEG-4 shape coding , 2003, 2003 IEEE International Conference on Consumer Electronics, 2003. ICCE..

[13]  Anantha Chandrakasan,et al.  A binary block matching architecture with reduced power consumption and silicon area requirement , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[14]  Jiun-In Guo,et al.  A power-aware IP core design for the variable-length DCT/IDCT targeting at MPEG4 shape-adaptive transforms , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[15]  Naehyuck Chang,et al.  Cycle-accurate energy measurement and characterization with a case study of the ARM7TDMI [microprocessors] , 2002, IEEE Trans. Very Large Scale Integr. Syst..

[16]  Konstantinos Konstantinides,et al.  Low-complexity block-based motion estimation via one-bit transforms , 1997, IEEE Trans. Circuits Syst. Video Technol..

[17]  Jong Beom Ra,et al.  Fast motion estimation for shape coding in MPEG-4 , 2003, IEEE Trans. Circuits Syst. Video Technol..

[18]  Noel E. O'Connor,et al.  Energy-Efficient Hardware Architecture for Variable N-point 1D DCT , 2004, PATMOS.

[19]  K. Ohmori,et al.  A 60 MHz 240 mW MPEG-4 video-phone LSI with 16 Mb embedded DRAM , 2000, 2000 IEEE International Solid-State Circuits Conference. Digest of Technical Papers (Cat. No.00CH37056).

[20]  Peter Kuhn,et al.  Algorithms, Complexity Analysis and VLSI Architectures for MPEG-4 Motion Estimation , 1999, Springer US.

[21]  Andrew Kinane Energy efficient hardware acceleration of multimedia processing tools , 2006 .

[22]  Xuemin Chen,et al.  A fast motion estimation method for MPEG-4 arbitrarily shaped objects , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[23]  King Ngi Ngan,et al.  Face segmentation using skin-color map in videophone applications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[24]  Y. Watanabe,et al.  An MPEG-4 video LSI with an error-resilient codec core based on a fast motion estimation algorithm , 2002, 2002 IEEE International Solid-State Circuits Conference. Digest of Technical Papers (Cat. No.02CH37315).

[25]  S. Panchanathan,et al.  Motion Estimation Architecture For Video Compression , 1993, IEEE 1993 International Conference on Consumer Electronics Digest of Technical Papers.

[26]  Liang-Gee Chen,et al.  Efficient algorithms and architectures for MPEG-4 object-based video coding , 2000, 2000 IEEE Workshop on SiGNAL PROCESSING SYSTEMS. SiPS 2000. Design and Implementation (Cat. No.00TH8528).

[27]  Noel Brady MPEG-4 standardized methods for the compression of arbitrarily shaped video objects , 1999, IEEE Trans. Circuits Syst. Video Technol..

[28]  Yeong-Kang Lai,et al.  A data-interlacing architecture with two-dimensional data-reuse for full-search block-matching algorithm , 1998, IEEE Trans. Circuits Syst. Video Technol..

[29]  Jiun-In Guo,et al.  An Energy-Aware IP Core Design for the Variable-Length DCT/IDCT Targeting at MPEG4 Shape-Adaptive Transforms , 2005, IEEE Trans. Circuits Syst. Video Technol..

[30]  Miodrag Potkonjak,et al.  Multiple constant multiplications: efficient and versatile framework and algorithms for exploring common subexpression elimination , 1996, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[31]  Paul E. Landman,et al.  Low-power architectural design methodologies , 1995 .

[32]  T. Takayanagi,et al.  A 60-MHz 240-mW MPEG-4 videophone LSI with 16-Mb embedded DRAM , 2000, IEEE Journal of Solid-State Circuits.

[33]  Gary K. Yeap,et al.  Practical Low Power Digital VLSI Design , 1997 .

[34]  Tian-Sheuan Chang,et al.  An MPEG-4 shape-adaptive inverse DCT with zero skipping and auto-aligned transpose memory , 2004, The 2004 IEEE Asia-Pacific Conference on Circuits and Systems, 2004. Proceedings..

[35]  Tsung-Han Tsai,et al.  A fast binary motion estimation algorithm for MPEG-4 shape coding , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[36]  Kenneth Y. Yun,et al.  A low-power VLSI architecture for full-search block-matching motion estimation , 1998, IEEE Trans. Circuits Syst. Video Technol..