The genesis distributed memory benchmarks. Part 2: COMMS1, TRANS1, FFT1 and QCD2 benchmarks on the suprenum and IPSC/860 computers

The Genesis benchmark suite has been assembled to evaluate the performance of distributed-memory MIMD systems. The problems selected all have a scientific origin (mostly from physics or theoretical chemistry), and range from synthetic code fragments designed to measure the basic hardware properties of the computer (especially communication and synchronisation overheads), through commonly used library subroutines, to full application codes. This is the second of a series of papers on the Genesis distributed-memory benchmarks, which were developed under the European ESPRIT research program. Results are presented for the SUPRENUM and iPSC/860 computers when running the following benchmarks: COMMS1 (communications), TRANS1 (matrix transpose), FFT1 (fast Fourier transform) and QCD2 (conjugate gradient kernel). The theoretical predictions are compared with, or fitted to, the measured results, and then used to predict (with due caution) how the performance might scale for larger problems and more processors than were actually available during the benchmarking.

[1]  Roger W. Hockney,et al.  The Communication Challenge for MPP: Intel Paragon and Meiko CS-2 , 1994, Parallel Computing.

[2]  Michael W. Berry,et al.  Public international benchmarks for parallel computers: PARKBENCH committee: Report-1 , 1994 .

[3]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[4]  Anthony J. G. Hey,et al.  The Genesis distributed-memory benchmarks. Part 1: Methodology and general relativity benchmark with results for the SUPRENUM computer , 1993, Concurr. Pract. Exp..

[5]  Jack J. Dongarra,et al.  Computer benchmarks , 1993 .

[6]  Marshall C. Pease,et al.  An Adaptation of the Fast Fourier Transform for Parallel Processing , 1968, JACM.

[7]  Marshall C. Pease,et al.  The Indirect Binary n-Cube Microprocessor Array , 1977, IEEE Transactions on Computers.

[8]  Mark A. Johnson,et al.  Solving problems on concurrent processors. Vol. 1: General techniques and regular problems , 1988 .

[9]  Rudolf Berrendorf,et al.  Evaluating the basic performance of the Intel iPSC/860 parallel computer , 1992, Concurr. Pract. Exp..

[10]  Hubert Kammer The SUPRENUM vector floating-point unit , 1988, Parallel Comput..

[11]  Roger W. Hockney,et al.  Performance parameters and benchmarking of supercomputers , 1991, Parallel Comput..

[12]  Dirk Roose,et al.  The Argonne/GMD macros in FORTRAN for portable parallel programming and their implementation on the Intel iPSC/2 , 1990, Parallel Comput..

[13]  Roger W. Hockney Synchronization and communication overheads on the LCAP multiple FPS-164 computer system , 1989, Parallel Comput..

[14]  Thomas H. Dunigan,et al.  Performance of the Intel iPSC/860 and Ncube 6400 hypercubes , 1991, Parallel Comput..

[15]  R. M. Chamberlain,et al.  Gray codes, Fast Fourier Transforms and hypercubes , 1988, Parallel Comput..

[16]  Chris R. Jesshope,et al.  Parallel Computers 2: Architecture, Programming and Algorithms , 1981 .

[17]  Anthony J. G. Hey,et al.  The Genesis distributed memory benchmarks , 1991, Parallel Comput..

[18]  Paul N. Swarztrauber,et al.  Multiprocessor FFTs , 1987, Parallel Comput..

[19]  Duncan H. Lawrie,et al.  Access and Alignment of Data in an Array Processor , 1975, IEEE Transactions on Computers.

[20]  Roger W. Hockney,et al.  Comparison of communications on the Intel iPSC/860 and Touchstone Delta (Short communication) , 1992, Parallel Comput..

[21]  Vladimir Getov 1-Dimensional Parallel FFT Benchmark on SUPRENUM , 1992, PARLE.

[22]  Roger W. Hockney A framework for benchmark performance analysis , 1991 .

[23]  Aad J. van der Steen The benchmark of the EuroBen group , 1991, Parallel Comput..