On the Redundant Representation of Partial Remainders in Radix-4 SRT Dividers

Most digit-recurrence algorithms for division, such as the Sweeney–Robertson–Tocher (SRT) algorithm, have been used in order to take advantage of the redundant representations of the partial remainder. This way, full carry propagate additions are avoided, obtaining significant latency improvements. Furthermore, the delay corresponding to one division iteration is independent of the size of the operands. The most frequent redundant form for the partial remainders is the carry-save (CS) representation, which uses 2 bits of representation (carry and sum bits) for each bit of the partial remainder. This paper proposes radix-4 SRT dividers which use (3, 2) redundancy (3 bits of representation for 2 bits of the partial remainder) and (5, 4) redundancy (5 bits of representation for 4 bits of the partial remainder). The goal of using these representations is represented by a decreased cost due to the reduced number of sequential elements required to store the partial remainder. The proposed dividers use 2-bit carry propagate adders and 4-bit carry propagate adders to compute the new partial remainder. Thus, the full carry propagate addition is avoided, while the latency of one division iteration is independent of the operands’ size. The synthesis result for Xilinx Virtex-5 FPGA devices show that similar working frequencies are obtained for divider using the proposed redundant representation with respect to the conventional carry-save, while requiring up to 12% for (3, 2) representation and 18% for (5, 4) representation less sequential elements.

[1]  Michael J. Flynn,et al.  Design Issues in Division and Other Floating-Point Operations , 1997, IEEE Trans. Computers.

[2]  Mark Horowitz,et al.  SRT division architectures and implementations , 1997, Proceedings 13th IEEE Sympsoium on Computer Arithmetic.

[3]  Oana Boncalo,et al.  SRT radix-2 dividers with (5,4) redundant representation of partial remainder , 2013, 2013 NORCHIP.

[4]  Earl E. Swartzlander,et al.  Design of Radix-4 SRT Dividers in 65 Nanometer CMOS Technology , 2006, IEEE 17th International Conference on Application-specific Systems, Architectures and Processors (ASAP'06).

[5]  Alberto Nannarelli Radix-16 Combined Division and Square Root Unit , 2011, 2011 IEEE 20th Symposium on Computer Arithmetic.

[6]  Tomás Lang,et al.  On-the-fly rounding for division and square root , 1989, Proceedings of 9th Symposium on Computer Arithmetic.

[7]  Braden Phillips,et al.  A Fast Radix-4 Floating-Point Divider with Quotient Digit Selection by Comparison Multiples , 2007, Comput. J..

[8]  Tomás Lang,et al.  Fast radix-4 retimed division with selection by comparisons , 2002, Proceedings IEEE International Conference on Application- Specific Systems, Architectures, and Processors.

[9]  Neil Burgess,et al.  Design of the ARM VFP11 Divide and Square Root Synthesisable Macrocell , 2007, 18th IEEE Symposium on Computer Arithmetic (ARITH '07).

[10]  Peter Kornerup Digit selection for SRT division and square root , 2005, IEEE Transactions on Computers.

[11]  Tomás Lang,et al.  On-the-Fly Conversion of Redundant into Conventional Representations , 1987, IEEE Transactions on Computers.

[12]  David M. Russinoff Computation and Formal Verification of SRT Quotient and Square Root Digit Selection Tables , 2013, IEEE Transactions on Computers.

[13]  Hassen Salhi,et al.  Higher Radix and Redundancy Factor for Floating Point SRT Division , 2008, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.