Enabling dual-core mode in BlueGene/L: challenges and solutions

BlueGene/L is a massively parallel computer system with 65536 dual-processor compute nodes. The peak performance of BlueGene/L is in excess of 360 TFLOP/s if both processor cores in a node are used for computation. The main challenge of deploying this dual-core mode of operation is that the L1 caches in each core are not hardware coherent. This forces a software-based approach to cache coherence and guides our design of a programming model for dual-core mode. We describe the design, implementation, and performance evaluation of system software for enabling the use of dual-core mode on BlueGene/L. Our preliminary performance results show that our approach to dual-core mode is effective for key numerical kernels.

[1]  Rohit Chandra,et al.  Parallel programming in openMP , 2000 .

[2]  José E. Moreira,et al.  An Overview Of The Bluegene/L System Software Organization , 2003, Parallel Process. Lett..

[3]  Liria Matsumoto Sato,et al.  Proceedings. 15th Symposium on Computer Architecture and High Performance Computing , 2003, Proceedings. 15th Symposium on Computer Architecture and High Performance Computing.

[4]  David F. Heidel,et al.  An Overview of the BlueGene/L Supercomputer , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[5]  José E. Moreira,et al.  An Overview of the Blue Gene/L System Software Organization , 2003, Euro-Par.