论文信息 - Compile-Time and Run-Time Issues in an Auto-Parallelisation System for the Cell BE Processor

Compile-Time and Run-Time Issues in an Auto-Parallelisation System for the Cell BE Processor

We describe compiler and run-time optimisations for effective auto-parallelisation of C++ programs on the Cell BE architecture. Auto-parallelisation is made easier by annotating sieve scopes , which abstract the "read in, compute in parallel, write out" processing paradigm. We show that the semantics of sieve scopes enables data movement optimisations, such as re-organising global memory reads to minimise DMA transfers and streaming reads from uniformly accessed arrays. We also describe run-time optimisations for committing side-effects to main memory. We provide experimental results showing the benefits of our optimisations, and compare the Sieve-Cell system with IBM's OpenMP implementation for Cell.

Paul Keir | Alastair F. Donaldson | Anton Lokhmotov

[1] Anne-Marie Kermarrec,et al. Proceedings of the 13th European international conference on Parallel Processing , 2007 .

[2] Alastair F. Donaldson,et al. Auto-parallelisation of Sieve C++ Programs , 2007, Euro-Par Workshops.

[3] H. Peter Hofstee,et al. Power efficient processor architecture and the cell processor , 2005, 11th International Symposium on High-Performance Computer Architecture.

[4] Andrew Richards,et al. Delayed Side-Effects Ease Multi-core Programming , 2007, Euro-Par.

[5] Leslie G. Valiant,et al. A bridging model for parallel computation , 1990, CACM.

[6] Rosa M. Badia,et al. CellSs: a Programming Model for the Cell BE Architecture , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[7] L. Bougé. Euro-Par 2007 Workshops: Parallel Processing, HPPC 2007, UNICORE Summit 2007, and VHPC 2007, Rennes, France, August 28-31, 2007, Revised Selected Papers , 2008, Euro-Par Workshops.

[8] William J. Dally,et al. Sequoia: Programming the Memory Hierarchy , 2006, International Conference on Software Composition.