Abstract We present implementation and performance issues of a data parallel version of the National Center for Atmospheric Research (NCAR) Community Climate Model (CCM2). We describe automatic conversion tools used to aid in converting a production code written for a traditional vector architecture to data parallel code suitable for the Thinking Machines Corporation CM-5. Also, we describe the 3-D transposition method used to parallelize the spherical harmonic transforms in CCM2. This method employs dynamic data mapping techniques to improve data locality and parallel efficiency of these computations. We present performance data for the 3-D transposition method on the CM-5 for machine size up to 512 processors. We conclude that the parallel performance of the 3-D transposition method is adversely affected on the CM-5 by short vector lengths and array padding. We also find that the CM-5 spherical harmonic transforms spend about 70% of their execution time in communication. We detail a transposition-based data parallel implementation of the semi-Lagrangian Transport (SLT) algorithm used in CCM2. We analyze two approaches to parallelizing the SLT, called the departure point and arrival point based methods. We develop a performance model for choosing between these methods. We present SLT performance data which shows that the localized horizontal interpolation in the SLT takes 70% of the time, while the data remapping itself only require approximately 16%. We discuss the importance of scalable I/O to CCM2, and present the I/O rates measured on the CM-5. We compare the performance of the data parallel version of CCM2 on a 32-processor CM-5 with the optimized vector code running on a single processor Cray Y-MP. We show that the CM-5 code is 75% faster. We also give the overall performance of CCM2 running at higher resolutions on different numbers of CM-5 processors. We conclude by discussing the significance of these results and their implications for data parallel climate models.
[1]
W. Washington,et al.
An Introduction to Three-Dimensional Climate Modeling
,
1986
.
[2]
Ian T. Foster,et al.
Design and Performance of a Scalable Parallel Community Climate Model
,
1995,
Parallel Comput..
[3]
Paul B. Anderson,et al.
Massive Parallization of NMC's Spectral Model
,
1994,
J. Parallel Distributed Comput..
[4]
James J. Hack,et al.
Computational Design of the NCAR Community Climate Model
,
1995,
Parallel Comput..
[5]
P. Rasch,et al.
Description of the NCAR community climate model (CCM2), June 1993. Technical note
,
1993
.
[6]
P. Rasch,et al.
Computational aspects of moisture transport in global models of the atmosphere
,
1990
.
[7]
Matthias Roth,et al.
Turbulent transfer relationships over an urban surface. I. Spectral characteristics
,
1993
.
[8]
G. Rodrigue.
Parallel Computations
,
1982
.
[9]
John M. Dennis,et al.
A Data Parallel Implementation of the NCAR Community Climate Model (CCM2)
,
1995,
PPSC.
[10]
P. Rasch,et al.
Two-dimensional semi-Lagrangian trans-port with shape-preserving interpolation
,
1989
.
[11]
Paul N. Swarztrauber,et al.
Vectorizing the FFTs
,
1982
.
[12]
James J. Hack,et al.
Climate sensitivity of the NCAR Community Climate Model (CCM2) to horizontal resolution
,
1995
.