Generating fast logic circuits for m-select n-port Round Robin Arbitration

This paper generalizes the problem of Round Robin Arbitration (RRA) from 1-select to m-select (mRRA) and offers new circuit architectures for it. RRAs are found in networking equipment and computer systems with high throughput buses. We first propose fast/novel circuits for the fundamental problem of finding the first m 1's in an n-bit vector (from the left or right), i.e., generalized m-select Priority Encoder (mPE). The obvious solution to mPE is cascading m regular (1-select) PEs. Our solutions, however, are based on parallel prefix networks, where the nodes are replaced by “saturated adder”s. We use mPE as a building block to construct an mRRA, which has single cycle latency and can arbitrate up to m requests per clock cycle. We took two arbiters from the literature, TC-PPA (1-select) and 3DP2S (2-select), and generalized them into mRRAs, which we call mTC-PPA and 3DPmS-RRA. We wrote fully parameterized HDL code generators. Logic synthesis results show that mTC-PPA and 3DPmS-RRA are up to 100% faster than the cascade solution and have up to 65% smaller Area-Timing Products (ATP). Comparing mTC-PPA and 3DPmS-RRA, 3DPmS-RRA circuits are slightly faster than mTC-PPA on the average. In terms of ATP, mTC-PPA is superior by far and can be as small as 30% of 3DPmS-RRA.

[1]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[2]  H. T. Kung,et al.  A Regular Layout for Parallel Adders , 1982, IEEE Transactions on Computers.

[3]  David Harris,et al.  A taxonomy of parallel prefix networks , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[4]  Nick McKeown,et al.  Designing and implementing a fast crossbar scheduler , 1999, IEEE Micro.

[5]  Jer-Min Jou,et al.  An Optimal Round-Robin Arbiter Design for NoC , 2010, J. Inf. Sci. Eng..

[6]  Harold S. Stone,et al.  A Parallel Algorithm for the Efficient Solution of a General Class of Recurrence Equations , 1973, IEEE Transactions on Computers.

[7]  Mei Yang,et al.  Algorithm-Hardware Codesign of Fast Parallel Round-Robin Arbiters , 2007, IEEE Transactions on Parallel and Distributed Systems.

[8]  W. Marsden I and J , 2012 .

[9]  Tack-Don Han,et al.  Fast area-efficient VLSI adders , 1987, 1987 IEEE 8th Symposium on Computer Arithmetic (ARITH).

[10]  Suhwan Kim,et al.  Fast three-dimensional programmable two-selector , 2004 .

[11]  H. Fatih Ugurdag,et al.  Fast parallel prefix logic circuits for n2n round-robin arbitration , 2012, Microelectron. J..

[12]  Giorgos Dimitrakopoulos,et al.  Fast arbiters for on-chip network switches , 2008, 2008 IEEE International Conference on Computer Design.

[13]  C. Emanuel Savin,et al.  Binary tree search architecture for efficient implementation of round robin arbiters , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Nick McKeown,et al.  The Tiny Tera: A Packet Switch Core , 1998, IEEE Micro.

[15]  Onur Baskirt,et al.  Fast two-pick n2n round-robin arbiter circuit , 2012 .