Floating-point arithmetic in embedded and reconfigurable computing systems

Modern embedded and reconfigurable systems need to support a wide range of applications, many of which may significantly benefit from hardware support for floating-point arithmetic. Some of these applications include 3D graphics, multiple-input multiple-output (MIMO) wireless communication algorithms, orthogonal frequency division multiplexing (OFDM) based systems, and digital filters. Many of these applications have real-time constraints that cannot tolerate the high latency of software emulated floating-point arithmetic. Moreover, software emulation can lead to higher energy consumption that may be unsuitable for applications in powerconstrained environments. This paper examines applications that can potentially benefit from hardware support for floating-point arithmetic and discusses some approaches taken for floating-point arithmetic in embedded and reconfigurable systems. Precision and range analysis is performed on emerging applications in the MIMO wireless communications domain to investigate the potential for low power floating-point units that utilize reduced precision and exponent range.

[1]  Chen-Yi Lee,et al.  A 2.4-Gsample/s DVFS FFT Processor for MIMO OFDM Communication Systems , 2008, IEEE Journal of Solid-State Circuits.

[2]  Rob A. Rutenbar,et al.  Lightweight Floating-Point Arithmetic: Case Study of Inverse Discrete Cosine Transform , 2002, EURASIP J. Adv. Signal Process..

[3]  Emanuele Viterbo,et al.  Signal Space Diversity: A Power- and Bandwidth-Efficient Diversity Technique for the Rayleigh Fading Channel , 1998, IEEE Trans. Inf. Theory.

[4]  H. Sakai,et al.  A block floating-point treatment to the LMS algorithm: efficient realization and a roundoff error analysis , 2005, IEEE Transactions on Signal Processing.

[5]  Gerhard Fettweis,et al.  A Hierarchical Block-Floating-Point Arithmetic , 2000, J. VLSI Signal Process..

[6]  P. Balsara,et al.  A Fixed-Point Implementation for QR Decomposition , 2006, 2006 IEEE Dallas/CAS Workshop on Design, Applications, Integration and Software.

[7]  Karl S. Hemmert,et al.  An analysis of the double-precision floating-point FFT on FPGAs , 2005, 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM'05).

[8]  Gerald Matz,et al.  Low-Complexity and Full-Diversity MIMO Detection Based on Condition Number Thresholding , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[9]  Karl S. Hemmert,et al.  A comparison of floating point and logarithmic number systems for FPGAs , 2005, 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM'05).

[10]  Antonio M. Vidal,et al.  Parallelization of Sphere-Decoding Methods , 2008, VECPAR.

[11]  M. Rupp On the influence of uncertainties in MIMO decoding algorithms , 2002, Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002..

[12]  Florent de Dinechin,et al.  When FPGAs are better at floating-point than microprocessors , 2008, FPGA '08.

[13]  Karl S. Hemmert,et al.  Architectural Modifications to Enhance the Floating-Point Performance of FPGAs , 2008, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[14]  Zlatko Drmac,et al.  New Fast and Accurate Jacobi SVD Algorithm. I , 2007, SIAM J. Matrix Anal. Appl..

[15]  James Demmel,et al.  IEEE Standard for Floating-Point Arithmetic , 2008 .

[16]  Vincent Lefèvre,et al.  MPFR: A multiple-precision binary floating-point library with correct rounding , 2007, TOMS.

[17]  An-Yeu Wu,et al.  VLSI Design of a Variable-Length FFT/IFFT Processor for OFDM-Based Communication Systems , 2003, EURASIP J. Adv. Signal Process..

[18]  Wayne Luk,et al.  Unifying bit-width optimisation for fixed-point and floating-point designs , 2004, 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.

[19]  Georgios B. Giannakis,et al.  Space-time coding for broadband wireless communications , 2003, Wirel. Commun. Mob. Comput..

[20]  Keith D. Underwood,et al.  FPGAs vs. CPUs: trends in peak floating-point performance , 2004, FPGA '04.

[21]  Florent de Dinechin,et al.  An FPGA-specific approach to floating-point accumulation and sum-of-products , 2008, 2008 International Conference on Field-Programmable Technology.

[22]  A. Oppenheim,et al.  Realization of digital filters using block-floating-point arithmetic , 1970 .

[23]  J. Nash Compact Numerical Methods for Computers , 2018 .

[24]  Babak Hassibi,et al.  On the sphere-decoding algorithm I. Expected complexity , 2005, IEEE Transactions on Signal Processing.

[25]  Arogyaswami Paulraj,et al.  Introduction to Space-Time Codes , .

[26]  Christopher C. Paige,et al.  Loss and Recapture of Orthogonality in the Modified Gram-Schmidt Algorithm , 1992, SIAM J. Matrix Anal. Appl..

[27]  William J. Dally Micro-optimization of floating-point operations , 1989, ASPLOS III.

[28]  Jaakko Astola,et al.  Roundoff errors in block-floating-point systems , 1996, IEEE Trans. Signal Process..