A Performance Tool for Earth System Models Development

We present a performance tool for Earth system models development to aid in analyzing the performance of the climate modeling applications. It is difficult for existing tools to handle with the complex, coupled structure and the long execution time of models. Our performance tool implements rapid analysis based on statistical sampling and grouping aggregation the calling relationship and the actual computing resource consumption excluding waiting losses. Using this tool, we study an ocean model POP in short-term sampling and analyze its scaling bottleneck and acceleration trend. The measuring results of its entire execution prove our predictions on the scaling efficiencies.

[1]  Allen D. Malony,et al.  Evolution of a Parallel Performance System , 2008, Parallel Tools Workshop.

[2]  Nathan R. Tallent,et al.  Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[3]  Martin Schulz,et al.  Stack Trace Analysis for Large Scale Debugging , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[4]  K. Taylor,et al.  The Community Climate System Model , 2001 .

[5]  Nathan R. Tallent,et al.  Scalable fine-grained call path tracing , 2011, ICS '11.

[6]  Barton P. Miller,et al.  The Paradyn Parallel Performance Measurement Tool , 1995, Computer.

[7]  Wolfgang E. Nagel,et al.  VAMPIR: Visualization and Analysis of MPI Resources , 2010 .

[8]  Susan L. Graham,et al.  Gprof: A call graph execution profiler , 1982, SIGPLAN '82.

[9]  Nathan R. Tallent,et al.  Effective performance measurement and analysis of multithreaded applications , 2009, PPoPP '09.

[10]  Nathan R. Tallent,et al.  Diagnosing performance bottlenecks in emerging petascale applications , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[11]  Mark A. Taylor,et al.  Performance of the Community Earth System Model , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[12]  Allen D. Malony,et al.  The Tau Parallel Performance System , 2006, Int. J. High Perform. Comput. Appl..

[13]  Cecelia DeLuca,et al.  The architecture of the Earth System Modeling Framework , 2003, Computing in Science & Engineering.

[14]  Robert L. Henderson,et al.  Job Scheduling Under the Portable Batch System , 1995, JSSPP.

[15]  Andy B. Yoo,et al.  Approved for Public Release; Further Dissemination Unlimited X-ray Pulse Compression Using Strained Crystals X-ray Pulse Compression Using Strained Crystals , 2002 .