COTS clusters vs. the Earth simulator: an application study using IMPACT-3D

In 2002, Japan announced the Earth Simulator - a supercomputer based on low-volume vector processors and a custom network - and reported that computational scientists had used it to achieve 14.9 TFLOPS with the IMPACT-3D code, which is written in high performance Fortran (HPF). Of particular interest was that they had achieved this level of performance using a high-level parallel programming model. There has been considerable concern in the U.S. about the appropriateness of its hardware and software investments in super computing technology. To help assess the U.S. strategy of building systems from commodity-off-the-shelf (COTS) components, we explored using a combination of HPF and scalar compiler technology to tailor IMPACT-3D to microprocessor-based supercomputers and evaluated its performance and scalability on the AlphaServer-based Lemieux cluster at the Pittsburgh Supercomputer Center (PSC). On the Earth Simulator, IMPACT-3D achieved 45% of peak performance on 4096 processors; on 1024 processors of PSC's Lemieux, we achieved 17.29% of peak performance.

[1]  Ken Kennedy,et al.  Advanced optimization strategies in the Rice dHPF compiler , 2002, Concurr. Comput. Pract. Exp..

[2]  Yoshiki Seo,et al.  14.9 TFLOPS Three-Dimensional Fluid Simulation for Fusion Science with HPF on the Earth Simulator , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[3]  C. Angelopoulos High resolution schemes for hyperbolic conservation laws , 1992 .

[4]  J. Mellor-Crummey,et al.  On Reducing Storage Requirement of Scientific Applications , 2003 .

[5]  Wu-chun Feng,et al.  The Quadrics Network: High-Performance Clustering Technology , 2002, IEEE Micro.

[6]  Mitsuo Yokokawa,et al.  16.4-Tflops Direct Numerical Simulation of Turbulence by a Fourier Spectral Method on the Earth Simulator , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[7]  Leonid Oliker,et al.  Scientific Computations on Modern Parallel Vector Systems , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[8]  Daniel Chavarría-Miranda,et al.  Advanced data-parallel compilation , 2004 .

[9]  Apan Qasem,et al.  Improving Performance with Integrated Program Transformations , 2004 .

[10]  Adolfy Hoisie,et al.  A comparison between the Earth Simulator and AlphaServer systems using predictive application performance models , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[11]  Hiroshi Takahara,et al.  A 26.58 Tflops Global Atmospheric Simulation with the Spectral Transform Method on the Earth Simulator , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[12]  Vikram S. Adve,et al.  Using integer sets for data-parallel program analysis and optimization , 1998, PLDI.

[13]  Vikram S. Adve,et al.  High Performance Fortran Compilation Techniques for Parallelizing Scientific Codes , 1998, Proceedings of the IEEE/ACM SC98 Conference.

[14]  John M. Mellor-Crummey,et al.  An evaluation of data-parallel compiler support for line-sweep applications , 2002, Proceedings.International Conference on Parallel Architectures and Compilation Techniques.