Hybrid MPI and OpenMP Parallel Programming

Most HPC systems are clusters of shared memory nodes. Such systems can be PC clusters with dual or quad boards, but also “constelation” type systems with large SMP nodes. Parallel programming must combine the distributed memory parallelization on the node inter-connect with the shared memory parallelization inside of each node.

[1]  D. S. Henty,et al.  Performance of Hybrid Message-Passing and Shared-Memory Parallelism for Discrete Element Modeling , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[2]  Gerhard Wellein,et al.  Fast Sparse Matrix-Vector Multiplication for TeraFlop/s Computers , 2002, VECPAR.

[3]  Rolf Rabenseifner Communication and Optimization Aspects on Hybrid Architectures , 2002, PVM/MPI.

[4]  Mitsuhisa Sato,et al.  Design of OpenMP Compiler for an SMP Cluster , 1999 .

[5]  Bronis R. de Supinski,et al.  Toward Enhancing OpenMP's Work-Sharing Directives , 2006, Euro-Par.

[6]  Andrea Clematis,et al.  Evolutions in parallel distributed and network-based processing , 2003, Journal of systems architecture.

[7]  Emilio L. Zapata,et al.  Nesting OpenMP and MPI in the Conjugate Gradient Method for Band Systems , 2004, PVM/MPI.

[8]  Rolf Rabenseifner,et al.  Load Balanced Parallel Simulated Annealing on a Cluster of SMP Nodes , 2006, Euro-Par.

[9]  Bernd Mohr,et al.  Performance Analysis of Large-Scale OpenMP and Hybrid MPI/OpenMP Applications with Vampir NG , 2005, IWOMP.

[10]  Rolf Rabenseifner,et al.  Nesting OpenMP in MPI to Implement a Hybrid Communication Method of Parallel Simulated Annealing on a Cluster of SMP Nodes , 2005, PVM/MPI.

[11]  Rolf Rabenseifner,et al.  Hybrid Parallel Programming on HPC Platforms , 2003 .

[12]  Haoqiang Jin,et al.  Performance characteristics of the multi-zone NAS parallel benchmarks , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[13]  Gerhard Wellein,et al.  Comparison of Parallel Programming Models on Clusters of SMP Nodes , 2003, HPSC.

[14]  Beniamino Di Martino,et al.  Hierarchical MPI+OpenMP Implementation of Parallel PIC Applications on Clusters of Symmetric MultiProcessors , 2003, PVM/MPI.

[15]  Mark Bull,et al.  Development of mixed mode MPI / OpenMP applications , 2001, Sci. Program..

[16]  Barbara M. Chapman Parallel Application Development with the Hybrid MPI+OpenMP Programming Model , 2002, PVM/MPI.

[17]  Franck Cappello,et al.  MPI versus MPI+OpenMP on the IBM SP for the NAS Benchmarks , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[18]  Thomas R. Gross,et al.  Transparent adaptive parallelism on NOWs using OpenMP , 1999, PPoPP '99.

[19]  Rolf Rabenseifner,et al.  Communication Bandwidth of Parallel Programming Models on Hybrid Architectures , 2009, ISHPC.

[20]  Nectarios Koziris,et al.  Advanced Hybrid MPI/OpenMP Parallelization Paradigms for Nested Loop Algorithms onto Clusters of SMPs , 2003, PVM/MPI.

[21]  Gerhard Wellein,et al.  Communication and Optimization Aspects of Parallel Programming Models on Hybrid Architectures , 2003, Int. J. High Perform. Comput. Appl..

[22]  JinHaoqiang,et al.  Performance characteristics of the multi-zone NAS parallel benchmarks , 2006 .