A Class of an Almost-Optimal Size-Independent Parallel Prefix Circuits

Prefix computation is one of the fundamental problems that can be used in many applications such as fast adders. Most proposed parallel prefix circuits assume that the circuit is of the same width as the input size. In this paper, we present a class of parallel prefix circuits that perform well when the input size, n, is more than the width of the circuit, m. That is, the proposed circuit is an almost optimal in speed when n >; m. Specifically, we derive a lower bound for the depth of the circuit and prove that the circuit requires one time step more than the optimal number of time steps needed to generate its first output. We also show that the size of the circuit is optimal within one. The input is divided into subsets each of width m-1 and presented to the circuit in subsequent time steps. The circuit is compared to other circuits to show its outperforming speed. The circuit is faster than any other circuit of the same width and fan-out.

[1]  D. S. SzyId,et al.  Parallel Computation: Models And Methods , 1998, IEEE Concurrency.

[2]  Chung-Kuan Cheng,et al.  Constructing Zero-deficiency Parallel Prefix Circuits of Minimum Depth , 2005 .

[3]  Torben Hagerup The Parallel Complexity of Integer Prefix Summation , 1995, Inf. Process. Lett..

[4]  Yen-Chun Lin,et al.  Z4: A New Depth-Size Optimal Parallel Prefix Circuit With Small Depth , 2003, Neural Parallel Sci. Comput..

[5]  Carl V. Page,et al.  Parallel tree contraction and prefix computations on a large family of interconnection topologies , 1995, Acta Informatica.

[6]  Yen-Chun Lin,et al.  Faster optimal parallel prefix circuits: New algorithmic construction , 2005, J. Parallel Distributed Comput..

[7]  Richard Cole,et al.  Faster Optimal Parallel Prefix Sums and List Ranking , 2011, Inf. Comput..

[8]  Giorgos Dimitrakopoulos,et al.  High-speed parallel-prefix VLSI Ling adders , 2005, IEEE Transactions on Computers.

[9]  Yen-Chun Lin,et al.  A new approach to constructing optimal parallel prefix circuits with small depth , 2004, J. Parallel Distributed Comput..

[10]  Yen-Chun Lin Optimal Parallel Prefix Circuits with Fan-Ot 2 and Corresponding Parallel Algorithms , 1999, Neural Parallel Sci. Comput..

[11]  Ronald L. Graham,et al.  On the construction of zero-deficiency parallel prefix circuits with minimum depth , 2006, TODE.

[12]  F. Leighton,et al.  Introduction to Parallel Algorithms and Architectures: Arrays, Trees, Hypercubes , 1991 .

[13]  Amitava Datta Multiple Addition and Prefix Sum on a Linear Array with a Reconfigurable Pipelined Bus System , 2004, The Journal of Supercomputing.

[14]  Mary Sheeran,et al.  A new approach to the design of optimal parallel prefix circuits , 2006 .

[15]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[16]  Li-Ling Hung,et al.  Fast problem-size-independent parallel prefix circuits , 2009, J. Parallel Distributed Comput..