Mesh of Linear Arrays for Template Matching

This paper presents the architecture and the implementation of template matching on a 3-D piece-wise regular processor space that forms a two-dimensional array of linear systolic arrays. Template matching can be considered as a 2-D convolution of an image of sizeN × Nwith a kernel of sizer× r. Conventional high-speed implementations use 2-D systolic arrays of sizeO(r2) which compute inO(N2) time. The drawback of this solution is that the size of the processor array follows on the size of the convolution kernel. This does not permit the allocation of more processors in order to meet the real-time requirements. With the approach used in this paper, the size of the processor array may be extended up toO(sr2), 1 ?s?N, thereby accomplishing the calculations inO(N2/s) time. In the case whens=r, ther × rmesh of 1-D systolic arrays of sizeO(r) is yielded. The piecewise regularity of the 3-D processor array allows also easy physical realization.

[1]  Charles E. Leiserson,et al.  Optimizing synchronous systems , 1981, 22nd Annual Symposium on Foundations of Computer Science (sfcs 1981).

[2]  P.R. Cappello,et al.  Computer-aided design of VLSI FIR filters , 1987, Proceedings of the IEEE.

[3]  Patrice Quinton,et al.  Systolic algorithms and architectures , 1987 .

[4]  Dan I. Moldovan,et al.  Partitioning and Mapping Algorithms into Fixed Size Systolic Arrays , 1986, IEEE Transactions on Computers.

[5]  H. T. Kung Why systolic architectures? , 1982, Computer.

[6]  Magdy A. Bayoumi,et al.  Systematic Algorithm Mapping for Multidimensional Systolic Arrays , 1989, J. Parallel Distributed Comput..

[7]  Maurice Tchuente Parallel computation on regular arrays , 1991, Algorithms and architectures for advanced scientific computing.

[8]  Jürgen Teich,et al.  Partitioning of processor arrays: a piecewise regular approach , 1993, Integr..

[9]  Sartaj Sahni,et al.  Parallel algorithms for image template matching , 1990 .

[10]  S. Kung,et al.  VLSI Array processors , 1985, IEEE ASSP Magazine.

[11]  Graham M. Megson Introduction to systolic algorithm design , 1992 .

[12]  Nicolai Petkov,et al.  Systolic Parallel Processing , 1992 .

[13]  Toomas P. Plaks Systolic Processor for Generalized Convolution , 1995, PARCO.

[14]  J. JaJa,et al.  VLSI architecture for template matching , 1990, IEEE International Symposium on Circuits and Systems.

[15]  H. T. Kung,et al.  Wafer-scale integration and two-level pipelined implementations of systolic arrays , 1984, J. Parallel Distributed Comput..

[16]  Alain Darte Regular partitioning for synthesizing fixed-size systolic arrays , 1991, Integr..

[17]  Ding-Ming Kwai,et al.  Multi-dimensional parallel computing structures for regular iterative algorithms , 1989, Integr..

[18]  Christian Lengauer,et al.  Loop Parallelization in the Polytope Model , 1993, CONCUR.