Cost Effective VLSI Architectures for Full-Search Block-Matching Motion Estimation Algorithm

In this paper, we present efficient VLSI architectures for full-search block-matching motion estimation (BMME) algorithm. Given a search range, we partition it into sub-search arrays called tiles. By fully exploiting data dependency within a tile, efficient VLSI architectures can be obtained. Using the proposed VLSI architectures, all the block-matchings in a tile can be processed in parallel. All the tiles within a search range can be processed serially or concurrently depending on various requirements. With the consideration of processing speed, hardware cost, and I/O bandwidth, the optimal tile size for a specific video application is analyzed. By partitioning a search range into tiles with appropriate size, flexible VLSI designs with different throughput can be obtained. In this way, cost effective VLSI designs for a wide range of video applications, from H.261 to HDTV, can be achieved.

[1]  Chaitali Chakrabarti,et al.  Architectures for hierarchical and other block matching algorithms , 1995, IEEE Trans. Circuits Syst. Video Technol..

[2]  Liang-Gee Chen,et al.  Parallel architectures for 3-step hierarchical search block-matching algorithm , 1994, IEEE Trans. Circuits Syst. Video Technol..

[3]  Chaur-Heh Hsieh,et al.  VLSI architecture for block-matching motion estimation algorithm , 1992, IEEE Trans. Circuits Syst. Video Technol..

[4]  Michael Stegherr,et al.  Parameterizable VLSI architectures for the full-search block-matching algorithm , 1989 .

[5]  Moon Key Lee,et al.  Flexible VLSI architecture of full search motion estimation for video applications , 1994 .

[6]  Pieter Fockens,et al.  The digital spectrum-compatible HDTV system , 1992, Signal Process. Image Commun..

[7]  Monica S. Lam,et al.  A Loop Transformation Theory and an Algorithm to Maximize Parallelism , 1991, IEEE Trans. Parallel Distributed Syst..

[8]  Yu Hen Hu,et al.  A novel modular systolic array architecture for full-search block matching motion estimation , 1995, IEEE Trans. Circuits Syst. Video Technol..

[9]  P. R. Burfield,et al.  A VLSI implementation study of a 10 Mbit/s video decoder , 1993, Signal Process. Image Commun..

[10]  D. Mlynek,et al.  A multiprocessors architecture for a HDTV motion estimation system , 1992 .

[11]  Chun-Hsien Chou,et al.  A VLSI architecture for real-time and flexible image template matching , 1989 .

[12]  Liang-Gee Chen,et al.  An efficient and simple VLSI tree architecture for motion estimation algorithms , 1993, IEEE Trans. Signal Process..

[13]  Chein-Wei Jen,et al.  Scalable array architecture design for full search block matching , 1995, IEEE Trans. Circuits Syst. Video Technol..

[14]  Ming Lei Liou,et al.  Design Trade-Offs for Real-Time Block Matching Motion Estimation Algorithms , 1995, ACCV.

[15]  Rainer Laur,et al.  VLSI architecture for HDTV motion estimation based on block-matching algorithm , 1994, Proceedings of 7th International Conference on VLSI Design.

[16]  Ming-Ting Sun,et al.  A family of vlsi designs for the motion compensation block-matching algorithm , 1989 .

[17]  Sung Bum Pan,et al.  VLSI architectures for block matching algorithms using systolic arrays , 1996, IEEE Trans. Circuits Syst. Video Technol..

[18]  Vipin Chaudhary,et al.  Minimum dependence distance tiling of nested loops with non-uniform dependences , 1994, Proceedings of 1994 6th IEEE Symposium on Parallel and Distributed Processing.

[19]  Peter A. Ruetz,et al.  A high-performance full-motion video compression chip set , 1992, IEEE Trans. Circuits Syst. Video Technol..

[20]  Peter Pirsch,et al.  Array architectures for block matching algorithms , 1989 .

[21]  Masahiko Yoshimoto,et al.  A half-pel precision motion estimation processor for NTSC-resolution video , 1993, Proceedings of IEEE Custom Integrated Circuits Conference - CICC '93.