Parallel FEM Adaptation on Hierarchical Architectures

The parallel FEM package NuscaS allows us to solve adaptive FEM problems with 3D unstructured meshes on distributed-memory parallel computers such as PC-clusters. In our previous works, a new method for parallelizing the FEM adaptation was presented, based on using the 8-tetrahedra longest-edge partition. This method relies on a decentralized approach, and is more scalable in comparison to previous implementations requiring a centralized synchronizing node. At present nodes of clusters contain more and more processing cores. Their efficient utilization is crucial for providing high performance of numerical codes. In this paper, different schemes of mapping the mesh adaptation algorithm on such hierchical architectures are presented and compared. These schemes use either the pure message-passing model, or the hybrid approach which combines shared-memory and message-passing models. Also, we investigate an approach for adapting the pure MPI model to hierarchical topology of clusters with multi-core nodes.

[1]  Roman Wyrzykowski,et al.  FEM Computations on Clusters Using Different Models of Parallel Programming , 2001, PPAM.

[2]  Angel Plaza,et al.  Mesh Refinement Based on the 8-Tetrahedra Longest- Edge Partition , 2003, IMR.

[3]  Roman Wyrzykowski,et al.  Performance of different communication mechanisms for FEM computations on PC-based cluster with SMP nodes , 2002, Proceedings. International Conference on Parallel Computing in Electrical Engineering.

[4]  Ali E. Yilmaz,et al.  A hybrid message passing/shared memory parallelization of the adaptive integral method for multi-core clusters , 2011, Parallel Comput..

[5]  Roman Wyrzykowski,et al.  Object-Oriented Approach to Finite Element Modeling on Clusters , 2000, PARA.

[6]  Roman Wyrzykowski,et al.  Adaptive FEM Package with Decentralized Parallel Adaptation of Tetrahedral Meshes , 2011, LSSC.

[7]  David W. Walker,et al.  Performance analysis of a hybrid MPI/OpenMP application on multi-core clusters , 2010, J. Comput. Sci..

[8]  Jack Dongarra,et al.  Recent Advances in Parallel Virtual Machine and Message Passing Interface, 15th European PVM/MPI Users' Group Meeting, Dublin, Ireland, September 7-10, 2008. Proceedings , 2008, PVM/MPI.

[9]  Guillaume Mercier,et al.  Towards an Efficient Process Placement Policy for MPI Applications in Multicore Environments , 2009, PVM/MPI.

[10]  Mehmet Balman,et al.  Tetrahedral mesh refinement in distributed environments , 2006, 2006 International Conference on Parallel Processing Workshops (ICPPW'06).

[11]  Salvatore Venticinque,et al.  Performance prediction through simulation of a hybrid MPI/OpenMP application , 2005, Parallel Comput..

[12]  Yuefan Deng,et al.  Applied Parallel Computing , 2012 .

[13]  Rupak Biswas,et al.  High performance computing using MPI and OpenMP on multi-core parallel systems , 2011, Parallel Comput..

[14]  Daniel Pizarro-Perez,et al.  Parallel Refinement of Tetrahedral Meshes Using Terminal-Edge Bisection Algorithm , 2004, IMR.

[15]  Pawel Gepner,et al.  Parallel Adaptive Finite Element Package with Dynamic Load Balancing for 3D Thermo-Mechanical Problems , 2009, PPAM.