Parallel execution of a Fortran 77 weather prediction model

This work presents the first MPP execution of the ARPS storm prediction code, und it is a significant step beyond the best previous runs on the fustest machines uva~lable. ARPS, a real third-party application developed in Fortran 77 by scientists for their own use, was greatly sped up though execution on a massively parallel machine. There are three notable features to our erLtry. First, the execution was made possible because of new automatic parallelizing compiler technology, a new tool called CMAX. The entire 65,000 line program was translated, compiled, and then executed on a CM-5; our results were achieved with the help of G’MAX rather than brute force rewriting. Second, we achieved good speedups as we increased the number of processors. Finally, performance on large CM-5’S is sign~ficantly better than performance on any previous ARPS platform. We obtained the following results. First, a speedup of 907 on a 1024 processor CM-5. The best previous speedup of ARPS on a parallel platform was 8..2. Second, that run was 57 times faster than the fastest previously available ARPS platform with published timings, a Cray Y-MP. Finally, we achieved at least 2 times the price/performance of a Cray Y-MP.

[1]  Gyan Bhanot,et al.  FLO67: a case study in scalable programming , 1992 .

[2]  Gary Sabot,et al.  Compiler parallelization of an elliptic grid generator for 1990 Gordon Bell prize , 1991, Proceedings of the 1991 ACM/IEEE Conference on Supercomputing (Supercomputing '91).

[3]  G. Sabot A compiler for a massively parallel distributed memory MIMD computer , 1992, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation.

[4]  Gary Sabot,et al.  CMAX: a Fortran translator for the connection machine system , 1993, ICS '93.

[5]  G. Sabot Optimized CM Fortran compiler for the Connection Machine computer , 1992, Proceedings of the Twenty-Fifth Hawaii International Conference on System Sciences.

[6]  Ming Xue,et al.  Distributed Processing of a Regional Prediction Model , 1994 .

[7]  Skef Wholey Automatic data mapping for distributed-memory parallel computers , 1992, ICS '92.