High throughput and low memory access sub-pixel interpolation architecture for H.264/AVC HDTV decoder

In this paper, we proposed parallel and pipeline architecture for the sub-pixel interpolation filter in H.264/AVC conformed HDTV decoder. To efficiently use the bus bandwidth, we bring forward three memory access optimization strategies to avoid redundant data transfer and improve data bus utilization. To improve the processing throughput, we use parallel and multi-stage pipeline architecture for conducting data transmission and interpolation filtering in parallel. Moreover, to balance the tradeoff between memory accessing scheme and sub-pixel interpolation processing granularity we devise a dedicated buffer organization to convert tree-structured block size reading to fixable and sequential processing. As compared to the traditional designs, our scheme offers 60% reduced memory data transfer. While clocking at 66 MHz, our design can support 1280 /spl times/ 720 @30 Hz processing throughput. The proposed design is suitable for low cost and real-time applications. Moreover, it can easily be applied in system-on-chip design.

[1]  Faouzi Kossentini,et al.  H.264/AVC baseline profile decoder complexity analysis , 2003, IEEE Trans. Circuits Syst. Video Technol..

[2]  Wen-Hsiao Peng,et al.  A platform-based MPEG-4 advanced video coding (AVC) decoder with block level pipelining , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[3]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[4]  Ming-Ting Sun VLSI architecture and implementation of a high-speed entropy decoder , 1991, 1991., IEEE International Sympoisum on Circuits and Systems.

[5]  Ronggang Wang,et al.  High performance synchronous DRAMs controller in H.264 HDTV decoder , 2004, Proceedings. 7th International Conference on Solid-State and Integrated Circuits Technology, 2004..

[6]  Adrian Wise,et al.  Model for estimating prediction bandwidth for H.26L , 2002 .

[7]  In-Cheol Park,et al.  High-performance and low-power memory-interface architecture for video processing applications , 2001, IEEE Trans. Circuits Syst. Video Technol..

[8]  Peter Pirsch,et al.  Architecture and memory requirements for stand-alone and hierarchical MPEG2 HDTV-decoders with synchronous DRAMs , 1995, Proceedings of ISCAS'95 - International Symposium on Circuits and Systems.

[9]  D. Marpe,et al.  Video coding with H.264/AVC: tools, performance, and complexity , 2004, IEEE Circuits and Systems Magazine.

[10]  Shih-Chang Hsia,et al.  Efficient memory IP design for HDTV coding applications , 2003, IEEE Trans. Circuits Syst. Video Technol..