A More Scalable and Efficient Parallelization of the Adaptive Integral Method—Part I: Algorithm

A more effective parallelization of the adaptive integral method (AIM) is proposed for solving the 3-D volume electric field integral equation. The AIM computations are distributed among processes by employing two workload decomposition strategies: 1) a novel 2-D pencil decomposition of the auxiliary regular grid is used to parallelize the anterpolation, interpolation, and 3-D FFT-accelerated propagation steps; and 2) a balanced 3-D block decomposition of the scattering volume is used to parallelize the correction step. The scalability of the proposed parallelization method is investigated theoretically. To compare it with competing methods, the concepts of resource constraints, parallel efficiency constraints, scalability limits, and acceptable parallelization regions are introduced by using N-P plots, where N and P denote the number of unknowns and processes, respectively. Using these concepts, it is shown that the proposed parallelization of AIM has better weak and strong scalability compared to the traditional ones, i.e., it enables not only larger problems to be solved but also faster solution of a given problem by increasing P.

[1]  L. Gurel,et al.  Efficient Parallelization of the Multilevel Fast Multipole Algorithm for the Solution of Large-Scale Scattering Problems , 2008, IEEE Transactions on Antennas and Propagation.

[2]  S. Velamparambil,et al.  10 million unknowns: is it that big? [computational electromagnetics] , 2003, IEEE Antennas and Propagation Magazine.

[3]  Alan H. Karp,et al.  Measuring parallel processor performance , 1990, CACM.

[4]  Samuel Williams,et al.  Optimization of sparse matrix-vector multiplication on emerging multicore platforms , 2009, Parallel Comput..

[5]  Ali E. Yilmaz,et al.  A hybrid message passing/shared memory parallelization of the adaptive integral method for multi-core clusters , 2011, Parallel Comput..

[6]  Ali E. Yilmaz,et al.  A 2-D decomposition based parallelization of AIM for 3-D BIOEM problems , 2011, 2011 IEEE International Symposium on Antennas and Propagation (APSURSI).

[7]  A. Brandt Multilevel computations of integral transforms and particle interactions with oscillatory kernels , 1991 .

[8]  Chris H. Q. Ding,et al.  A Portable 3D FFT Package for Distributed-Memory Parallel Architectures , 1995, PPSC.

[9]  Qing Huo Liu,et al.  A volume adaptive integral method (VAIM) for 3-D inhomogeneous objects , 2002 .

[10]  Ali E. Yilmaz,et al.  Truncated multigrid versus pre-corrected FFT/AIM for bioelectromagnetics: When is O(N) better than O(NlogN)? , 2011, CEM'11 Computational Electromagnetics International Workshop.

[11]  Jan Fostier,et al.  An Open-Source Implementation for Full-Wave 2D Scattering by Million-Wavelength-Size Objects , 2010, IEEE Antennas and Propagation Magazine.

[12]  Vipin Kumar,et al.  Parallel Multilevel series k-Way Partitioning Scheme for Irregular Graphs , 1999, SIAM Rev..

[13]  Ali E. Yilmaz,et al.  A More Scalable and Efficient Parallelization of the Adaptive Integral Method—Part II: BIOEM Application , 2014, IEEE Transactions on Antennas and Propagation.

[14]  Boris Livshitz,et al.  FastMag: Fast micromagnetic simulator for complex magnetic structures , 2011 .

[15]  J. White,et al.  A parallel precorrected FFT based capacitance extraction program for signal integrity analysis , 1996, 33rd Design Automation Conference Proceedings, 1996.

[16]  Caicheng Lu A fast algorithm based on volume integral equation for analysis of arbitrarily shaped dielectric radomes , 2003 .

[17]  Brendan Vastenhouw,et al.  A Two-Dimensional Data Distribution Method for Parallel Sparse Matrix-Vector Multiplication , 2005, SIAM Rev..

[18]  W. Chew,et al.  A coupled surface-volume integral equation approach for the calculation of electromagnetic scattering from composite metallic and material targets , 2000 .

[19]  D. Geer,et al.  Chip makers turn to multicore processors , 2005, Computer.

[20]  M. Bleszynski,et al.  AIM: Adaptive integral method for solving large‐scale electromagnetic scattering and radiation problems , 1996 .

[21]  Mark D. Hill,et al.  What is scalability? , 1990, CARN.

[22]  Robert S. Germain,et al.  Scalable framework for 3D FFTs on the Blue Gene/L supercomputer: Implementation and early performance measurements , 2005, IBM J. Res. Dev..

[23]  B. Shanker,et al.  A Scalable Parallel Wideband MLFMA for Efficient Electromagnetic Simulations on Large Scale Clusters , 2011, IEEE Transactions on Antennas and Propagation.

[24]  D. Wilton,et al.  A tetrahedral modeling method for electromagnetic scattering by arbitrarily shaped inhomogeneous dielectric bodies , 1984 .

[25]  Vipin Kumar,et al.  Parallel Multilevel k-way Partitioning Scheme for Irregular Graphs , 1996, Proceedings of the 1996 ACM/IEEE Conference on Supercomputing.

[26]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[27]  Jiming Song,et al.  Multilevel fast‐multipole algorithm for solving combined field integral equations of electromagnetic scattering , 1995 .

[28]  K. Sertel,et al.  Multilevel fast multipole method solution of volume integral equations using parametric geometry modeling , 2001, IEEE Transactions on Antennas and Propagation.

[29]  E. Michielssen,et al.  Time domain adaptive integral method for surface integral equations , 2004, IEEE Transactions on Antennas and Propagation.

[30]  Ramani Duraiswami,et al.  Fast multipole methods on graphics processors , 2008, J. Comput. Phys..

[31]  Luis Landesa,et al.  MLFMA-FFT PARALLEL ALGORITHM FOR THE SO- LUTION OF LARGE-SCALE PROBLEMS IN ELECTRO- MAGNETICS (INVITED PAPER) , 2010 .

[32]  W.C. Chew,et al.  A portable parallel multilevel fast multipole solver for scattering from perfectly conducting bodies , 1999, IEEE Antennas and Propagation Society International Symposium. 1999 Digest. Held in conjunction with: USNC/URSI National Radio Science Meeting (Cat. No.99CH37010).

[33]  Jacob K. White,et al.  A precorrected-FFT method for electrostatic analysis of complicated 3-D structures , 1997, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[34]  Ning Li,et al.  2DECOMP&FFT - A Highly Scalable 2D Decomposition Library and FFT Interface , 2010 .

[35]  George Karypis,et al.  Multilevel k-way Partitioning Scheme for Irregular Graphs , 1998, J. Parallel Distributed Comput..

[36]  Kai Yang,et al.  A Three-Dimensional Adaptive Integral Method for Scattering From Structures Embedded in Layered Media , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[37]  Tat Soon Yeo,et al.  Precorrected-FFT solution of the volume Integral equation for 3-D inhomogeneous dielectric objects , 2005 .

[38]  John L. Gustafson,et al.  Reevaluating Amdahl's law , 1988, CACM.

[39]  John D. Owens,et al.  GPU Computing , 2008, Proceedings of the IEEE.