论文信息 - Binding Nested OpenMP Programs on Hierarchical Memory Architectures

Binding Nested OpenMP Programs on Hierarchical Memory Architectures

In this work we discuss the performance problems of nested OpenMP programs concerning thread and data locality particularly on cc-NUMA architectures. We provide a user friendly solution and demonstrate its benefits by comparing the performance of some kernel benchmarks and some real-world applications with and without applying our affinity optimizations.

Dirk Schmidl | Christian Terboven | Dieter an Mey | H. Martin Bücker

[1] Dieter an Mey,et al. Nested Parallelization of the Flow Solver TFS Using the ParaWise Parallelization Environment , 2006, IWOMP.

[2] Bronis R. de Supinski,et al. OpenMP Shared Memory Parallel Programming - International Workshops, IWOMP 2005 and IWOMP 2006, Eugene, OR, USA, June 1-4, 2005, Reims, France, June 12-15, 2006. Proceedings , 2008, IWOMP.

[3] Samuel Thibault,et al. An Efficient OpenMP Runtime System for Hierarchical Arch , 2007, IWOMP.

[4] Barbara Chapman. A Practical Programming Model for the Multi-Core Era, 3rd International Workshop on OpenMP, IWOMP 2007, Beijing, China, June 3-7, 2007, Proceedings , 2008, IWOMP.

[5] Christoph Clauser,et al. Numerical simulation of reactive flow in hot aquifers : SHEMAT and processing SHEMAT , 2003 .

[6] Dirk Schmidl,et al. Data and thread affinity in openmp programs , 2008, MAW '08.

[7] J. M. Bull,et al. Measuring Synchronisation and Scheduling Overheads in OpenMP , 2007 .

[8] Bernd Mohr,et al. Design and Prototype of a Performance Tool Interface for OpenMP , 2002, The Journal of Supercomputing.

[9] Eduard Ayguadé,et al. Exploiting multiple levels of parallelism in OpenMP: a case study , 1999, Proceedings of the 1999 International Conference on Parallel Processing.

[10] Guansong Zhang. Extending the OpenMP Standard for Thread Mapping and Grouping , 2006, IWOMP.

[11] Wolfgang Schröder,et al. Numerical simulation of the flow field in a model of the nasal cavity , 2003 .