论文信息 - An area-efficient 32-bit floating point multiplier using hybrid GPPs addition

An area-efficient 32-bit floating point multiplier using hybrid GPPs addition

In this paper, we proposed a new design of hybrid adder for area-efficient 32-bit floating point multiplier. By combining conventional ripple carry adder (RCA) and Wallace tree adder for adding Generated Partial Products (GPPs), the speed can be improved. Toom-3 multiplication method applied on 24×24 mantissa multiplier with a reduced complexity of (n1.465). Pre-determined Partial Products Generation (3PG) methods reduce the height of the GPPs to (N/3)/4 for N=24-bit unsigned operands. This is a contrast to Modified Booth Encoding (MBE) GPPs reduction height of N/2. This reduction can use to save area. The design is synthesized on TSMC 0.13 μm CMOS with 62% less area when compared to MBE based FP multiplier.

S. Sivanantham | J. Jean Jenifer Nesam

[1] Michael J. Schulte,et al. Dual-mode floating-point multiplier architectures with parallel operations , 2006, J. Syst. Archit..

[2] Gang Zhou,et al. Complexity Analysis and Efficient Implementations of Bit Parallel Finite Field Multipliers Based on Karatsuba-Ofman Algorithm on FPGAs , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[3] Vitit Kantabutra. A Recursive Carry-Lookahead/Carry-Select Hybrid Adder , 1993, IEEE Trans. Computers.

[4] Hossam A. H. Fahmy,et al. Area Efficient and Fast Combined Binary/Decimal Floating Point Fused Multiply Add Unit , 2017, IEEE Transactions on Computers.

[5] Michael J. Schulte,et al. Decimal Floating-Point Multiplication Via Carry-Save Addition , 2007, 18th IEEE Symposium on Computer Arithmetic (ARITH '07).

[6] Youngmin Kim,et al. Energy-efficient hybrid adder design by using inexact lower bits adder , 2016, 2016 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS).

[7] Mark Vesterbacka,et al. A high-speed low-latency digit-serial hybrid adder , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[8] Shyh-Jye Jou,et al. Low-error reduced-width Booth multipliers for DSP applications , 2003 .

[9] Sri Parameswaran,et al. Configurable Multimode Embedded Floating-Point Units for FPGAs , 2011, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[10] Shiann-Rong Kuang,et al. Energy-Efficient Multiple-Precision Floating-Point Multiplier for Embedded Applications , 2013, J. Signal Process. Syst..

[11] Michael J. Schulte,et al. Multiplier architectures for media processing , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[12] Earl E. Swartzlander,et al. A comparison of Dadda and Wallace multiplier delays , 2003, SPIE Optics + Photonics.

[13] Yu-Jen Huang,et al. A design methodology for hybrid carry-lookahead/carry-select adders with reconfigurability , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[14] Gaofeng Wang,et al. Performance effects of pipeline architecture on an FPGA-based binary32 floating point multiplier , 2013, Microprocess. Microsystems.

[15] J.E. Stine,et al. A hybrid Ling carry-select adder , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[16] Michael J. Schulte,et al. Memory latency consideration for load sharing on heterogeneous network of workstations , 2006 .

[17] Earl E. Swartzlander,et al. A Reduced Complexity Wallace Multiplier Reduction , 2010, IEEE Transactions on Computers.

[18] Chiou-Yng Lee,et al. Subquadratic space complexity digit-serial multiplier over binary extension fields using Toom-Cook algorithm , 2014, 2014 International Symposium on Integrated Circuits (ISIC).

[19] Sivanantham Sathasivam,et al. An Efficient Single Precision Floating Point Multiplier Architecture based on Classical Recoding Algorithm , 2016 .

[20] Shiann-Rong Kuang,et al. Variable-Latency Floating-Point Multipliers for Low-Power Applications , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.