Introducing Replicated VLSI to Supercomputing: the FPS-164/MAX Scientific Computer

Large-scale scientific computing is dominated by the "crunching of numbers." By performing billions of multiplies and adds on millions of simulated data points, scientists can model physical phenomena, like the flow of air around a supersonic airplane or the movement of storms in the atmosphere. The data values range over more than fifty orders of magnitude, from the inside of a quark to the circumference of the universe. Because a simulation run is made up of thousands of iterated computation steps, at least eight significant digits must be retained by each arithmetic operation to avoid excessive round-off errors in the results. The 16and 32-bit integer arithmetic common in business computing does not provide adequate range or accuracy for scientific use. Instead, scientific applications normally use 64-bit floating-point arithmetic. IEEE standard double-precision arithmetic, I for example, provides a dynamic range of 616 orders ofmagnitude and a precision of 15 significant digits. Because arithmetic speed usually dominates overall execution times, the performance of scientific computers is traditionally measured in MFLOPs, or millions of floating-point operations per second. Current processor implementations range from a speed ofone twentieth MFLOP for the best personal computer CPU to several hundred MFLOPs per supercomputer CPU. VLSI parts for number crunching

[1]  Alan E. Charlesworth,et al.  An Approach to Scientific Array Processing: The Architectural Design of the AP-120B/FPS-164 Family , 1981, Computer.

[2]  D. Stevenson A Proposed Standard for Binary Floating-Point Arithmetic , 1981, Computer.

[3]  Joseph Cavanagh,et al.  Digital Computer Arithmetic , 1983 .

[4]  Ralph Grishman,et al.  The NYU Ultracomputer—Designing an MIMD Shared Memory Parallel Computer , 1983, IEEE Transactions on Computers.

[5]  Gordon Bell,et al.  C.mmp: a multi-mini-processor , 1972, AFIPS '72 (Fall, part II).

[6]  Geoffrey C. Fox,et al.  Algorithms for concurrent processors , 1984 .

[7]  C. Norrie Supercomputers for Superproblems: An Architectural Introduction , 1984, Computer.

[8]  Jan Fandrianto,et al.  VLSI floating-point processors , 1985, 1985 IEEE 7th Symposium on Computer Arithmetic (ARITH).

[9]  A. Gottleib,et al.  The nyu ultracomputer- designing a mimd shared memory parallel computer , 1983 .

[10]  H. T. Kung,et al.  A systolic array computer , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Charles L. Seitz,et al.  Concurrent VLSI Architectures , 1984, IEEE Transactions on Computers.

[12]  Charles L. Seitz,et al.  The cosmic cube , 1985, CACM.

[13]  F. Gustavson,et al.  Implementing Linear Algebra Algorithms for Dense Matrices on a Vector Pipeline Machine , 1984 .

[14]  Jack J. Dongarra,et al.  Performance of various computers using standard linear equations software in a Fortran environment , 1987, SGNM.

[15]  Charles L. Lawson,et al.  Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.

[16]  V. Vemuri,et al.  Digital Computer Treatment of Partial Differential Equations , 1981 .