Performance Evaluation of the SGI Origin2000: A Memory-Centric Characterization of LANL ASCI Applications

We compare single-processor performance of the SGI Origin and PowerChallenge and utilize a previously-reported performance model for hierarchical memory systems to explain the results. Both the Origin and PowerChallenge use the same microprocessor (MIPS R10000) but have significant differences in their memory subsystems. Our memory model includes the effect of overlap between CPU and memory operations and allows us to infer the individual contributions of all three improvements in the Origin's memory architecture and relate the effectiveness of each improvement to application characteristics.

[1]  S. Turner,et al.  Performance Analysis Using the MIPS R10000 Performance Counters , 1996, Proceedings of the 1996 ACM/IEEE Conference on Supercomputing.

[2]  James R. Goodman,et al.  The declining effectiveness of dynamic caching for general- purpose microprocessors , 1995 .

[3]  Yong Luo,et al.  Development and validation of a hierarchical memory model incorporating CPU- and memory-operation overlap model , 1998, WOSP '98.

[4]  Harvey J. Wasserman,et al.  The performance realities of massively parallel processors: a case study , 1992, Proceedings Supercomputing '92.

[5]  Yong Luo,et al.  Development and Validation of a Hierarchical Memory Model Incorporating CPU- and Memory-Operation Overlap , 1997 .

[6]  D. Lenoski,et al.  The SGI Origin: A ccnuma Highly Scalable Server , 1997, Conference Proceedings. The 24th Annual International Symposium on Computer Architecture.

[7]  R. D. ODell,et al.  User's guide for TWODANT: a code package for two-dimensional, diffusion-accelerated, neutral-particle transport , 1984 .

[8]  Sally A. McKee,et al.  Hitting the memory wall: implications of the obvious , 1995, CARN.

[9]  William D. Schulz Two-Dimensional Lagrangian Hydrodynamic Difference Equations , 1963 .

[10]  Zarka Cvetanovic,et al.  Performance characterization of the Alpha 21164 microprocessor using TP and SPEC workloads , 1996, Proceedings. Second International Symposium on High-Performance Computer Architecture.

[11]  Dileep Bhandarkar,et al.  Performance characterization of the Pentium Pro processor , 1997, Proceedings Third International Symposium on High-Performance Computer Architecture.

[12]  Eric Williams,et al.  Performance optimizations, implementation, and verification of the SGI Challenge multiprocessor , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[13]  Kenneth C. Yeager The Mips R10000 superscalar microprocessor , 1996, IEEE Micro.