Hybrid MIC/CPU Parallel Implementation of MoM on MIC Cluster for Electromagnetic Problems

In this paper, a Many Integrated Core Architecture (MIC) accelerated parallel method of moment (MoM) algorithm is proposed to solve electromagnetic problems in practical applications, where MIC means a kind of coprocessor or accelerator in computer systems which is used to accelerate the computation performed by Central Processing Unit (CPU). Three critical points are introduced in this paper in detail. The first one is the design of the parallel framework, which ensures that the algorithm can run on distributed memory platform with multiple nodes. The hybrid Message Passing Interface (MPI) and Open Multi-Processing (OpenMP) programming model is designed to achieve the purposes. The second one is the out-of-core algorithm, which greatly breaks the restriction of MIC memory. The third one is the pipeline algorithm which overlaps the data movement with MIC computation. The pipeline algorithm successfully hides the communication and thus greatly enhances the performance of hybrid MIC/CPU MoM. Numerical result indicates that the proposed algorithm has good parallel efficiency and scalability, and twice faster performance when compared with the corresponding CPU algorithm. key words: MIC accelerating MoM, MPI and OpenMP parallel programming mode, multiple nodes, out-of-core, pipeline

[1]  Wei Hong,et al.  Higher Order Method of Moments With a Parallel Out-of-Core LU Solver on GPU/CPU Platform , 2014, IEEE Transactions on Antennas and Propagation.

[2]  Ying Yan,et al.  RCS computation of engine by parallel higher-order MoM with out-of- core technique , 2013 .

[3]  Yan Chen,et al.  MIC accelerated LU decomposition for method of moments , 2015, 2015 IEEE International Symposium on Antennas and Propagation & USNC/URSI National Radio Science Meeting.

[4]  Lixin Guo,et al.  EM Scattering From a Target Above a 1-D Randomly Rough Sea Surface Using GPU-Based Parallel FDTD , 2015, IEEE Antennas and Wireless Propagation Letters.

[5]  Barbara Chapman,et al.  Using OpenMP - portable shared memory parallel programming , 2007, Scientific and engineering computation.

[6]  Roger F. Harrington,et al.  Field computation by moment methods , 1968 .

[7]  D. Wilton,et al.  Electromagnetic scattering by surfaces of arbitrary shape , 1980 .

[8]  Yu Zhang,et al.  RCS Computation by Parallel MoM Using Higher-Order Basis Functions , 2012 .

[9]  Robert A. van de Geijn,et al.  Parallel Solution of Integral Equation-Based EM Problems in the Frequency Domain , 2009 .

[10]  Zhongchao Lin,et al.  An Optimized Parallel FDTD Topology for Challenging Electromagnetic Simulations on Supercomputers , 2015 .

[11]  James Reinders,et al.  Intel Xeon Phi Coprocessor High Performance Programming , 2013 .

[12]  Robert M. Farber,et al.  CUDA Application Design and Development , 2011 .

[13]  Tapan K. Sarkar,et al.  Performance of a Massively Parallel Higher-Order Method of Moments Code Using Thousands of CPUs and Its Applications , 2014, IEEE Transactions on Antennas and Propagation.