An area-efficient 32-bit floating point multiplier using hybrid GPPs addition

In this paper, we proposed a new design of hybrid adder for area-efficient 32-bit floating point multiplier. By combining conventional ripple carry adder (RCA) and Wallace tree adder for adding Generated Partial Products (GPPs), the speed can be improved. Toom-3 multiplication method applied on 24×24 mantissa multiplier with a reduced complexity of (n1.465). Pre-determined Partial Products Generation (3PG) methods reduce the height of the GPPs to (N/3)/4 for N=24-bit unsigned operands. This is a contrast to Modified Booth Encoding (MBE) GPPs reduction height of N/2. This reduction can use to save area. The design is synthesized on TSMC 0.13 μm CMOS with 62% less area when compared to MBE based FP multiplier.

[1]  Michael J. Schulte,et al.  Dual-mode floating-point multiplier architectures with parallel operations , 2006, J. Syst. Archit..

[2]  Gang Zhou,et al.  Complexity Analysis and Efficient Implementations of Bit Parallel Finite Field Multipliers Based on Karatsuba-Ofman Algorithm on FPGAs , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[3]  Vitit Kantabutra A Recursive Carry-Lookahead/Carry-Select Hybrid Adder , 1993, IEEE Trans. Computers.

[4]  Hossam A. H. Fahmy,et al.  Area Efficient and Fast Combined Binary/Decimal Floating Point Fused Multiply Add Unit , 2017, IEEE Transactions on Computers.

[5]  Michael J. Schulte,et al.  Decimal Floating-Point Multiplication Via Carry-Save Addition , 2007, 18th IEEE Symposium on Computer Arithmetic (ARITH '07).

[6]  Youngmin Kim,et al.  Energy-efficient hybrid adder design by using inexact lower bits adder , 2016, 2016 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS).

[7]  Mark Vesterbacka,et al.  A high-speed low-latency digit-serial hybrid adder , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[8]  Shyh-Jye Jou,et al.  Low-error reduced-width Booth multipliers for DSP applications , 2003 .

[9]  Sri Parameswaran,et al.  Configurable Multimode Embedded Floating-Point Units for FPGAs , 2011, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[10]  Shiann-Rong Kuang,et al.  Energy-Efficient Multiple-Precision Floating-Point Multiplier for Embedded Applications , 2013, J. Signal Process. Syst..

[11]  Michael J. Schulte,et al.  Multiplier architectures for media processing , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[12]  Earl E. Swartzlander,et al.  A comparison of Dadda and Wallace multiplier delays , 2003, SPIE Optics + Photonics.

[13]  Yu-Jen Huang,et al.  A design methodology for hybrid carry-lookahead/carry-select adders with reconfigurability , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[14]  Gaofeng Wang,et al.  Performance effects of pipeline architecture on an FPGA-based binary32 floating point multiplier , 2013, Microprocess. Microsystems.

[15]  J.E. Stine,et al.  A hybrid Ling carry-select adder , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[16]  Michael J. Schulte,et al.  Memory latency consideration for load sharing on heterogeneous network of workstations , 2006 .

[17]  Earl E. Swartzlander,et al.  A Reduced Complexity Wallace Multiplier Reduction , 2010, IEEE Transactions on Computers.

[18]  Chiou-Yng Lee,et al.  Subquadratic space complexity digit-serial multiplier over binary extension fields using Toom-Cook algorithm , 2014, 2014 International Symposium on Integrated Circuits (ISIC).

[19]  Sivanantham Sathasivam,et al.  An Efficient Single Precision Floating Point Multiplier Architecture based on Classical Recoding Algorithm , 2016 .

[20]  Shiann-Rong Kuang,et al.  Variable-Latency Floating-Point Multipliers for Low-Power Applications , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.