论文信息 - Software floating-point computation on parallel mahcines

Software floating-point computation on parallel mahcines

This thesis examines the ability to optimize the performance of software floating-point (FP) operations on parallel architectures. In particular, instruction level parallelism (ILP) of FP operations is explored, optimization techniques are proposed, and efficient algorithms are developed. In our method, FP operations such as FP add, are decomposed into a set of primitive integer and logic operations, such as integer adds and shifts, and the primitive operations are then scheduled on a parallel architecture. The algorithms for fast division and square root computation also enable the hardware FP unit to be clocked at a faster rate. The design and analysis of such a system is detailed and is tested on Raw, a software-exposed parallel architecture. Results show that division and square root implementations achieve reasonable performance compared to a hardware FP unit. Thesis Supervisor: Anant Agarwal Title: Professor

Michael Ruogu Zhang

[1] M. F.,et al. Bibliography , 1985, Experimental Gerontology.

[2] Michael J. Flynn,et al. Introduction to Arithmetic for Digital Systems Designers , 1995 .

[3] Michael J. Flynn,et al. USING A FLOATING-POINT MULTIPLIER'S INTERNALS FOR HIGH-RADIX DIVISION AND SQUARE ROOT , 1993 .

[4] Guido D. Salvucci,et al. Ieee standard for binary floating-point arithmetic , 1985 .

[5] Vivek Sarkar,et al. Baring it all to Software: The Raw Machine , 1997 .

[6] Vivek Sarkar,et al. The Raw Compiler Project , 1999 .

[7] Rajeev Barua,et al. Memory bank disambiguation using modulo unrolling for Raw machines , 1998, Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238).

[8] Stuart Franklin Oberman,et al. Design issues in high performance floating point arithmetic units , 1996 .

[9] Robert E Goldschmidt,et al. Applications of division by convergence , 1964 .

[10] David A. Patterson,et al. Computer Architecture: A Quantitative Approach , 1969 .

[11] William J. Dally. Micro-optimization of floating-point operations , 1989, ASPLOS III.