Computational Co-design of a Multiscale Plasma Application: A Process and Initial Results

As computer architectures become increasingly heterogeneous the need for algorithms and applications that can exploit these new architectures grows more pressing. This paper demonstrates that co-designing a multi-architecture, multi-scale, highly optimized framework with its associated plasma-physics application can provide both portability across CPUs and accelerators and high performance. Our framework utilizes multiple abstraction layers in order to maximize code reuse between architectures while providing low-level abstractions to incorporate architecture-specific optimizations such as vectorization or hardware fused multiply-add. We describe a co-design process used to enable a plasma physics application to scale well to large systems while also improving on both the accuracy and speed of the simulations. Optimized multi-core results will be presented to demonstrate ability to isolate large amounts of computational work with minimal communication.

[1]  William Daughton,et al.  Advances in petascale kinetic plasma simulation with VPIC and Roadrunner , 2009 .

[2]  J. Payne Implementation and performance evaluation of a GPU particle-in-cell code , 2012 .

[3]  Aaftab Munshi,et al.  The OpenCL specification , 2009, 2009 IEEE Hot Chips 21 Symposium (HCS).

[4]  Luis Chacón,et al.  An energy- and charge-conserving, implicit, electrostatic particle-in-cell algorithm , 2011, J. Comput. Phys..

[5]  J. Brackbill,et al.  An implicit particle-in-cell method for granular materials , 2002 .

[6]  Xianglong Kong,et al.  Preliminary results on GPU Acceleration of the PIC Simulation Code OSIRIS Using CUDA , 2009 .

[7]  R W Hockney,et al.  Computer Simulation Using Particles , 1966 .

[8]  Luis Chacón,et al.  An efficient mixed-precision, hybrid CPU-GPU implementation of a nonlinearly implicit one-dimensional particle-in-cell algorithm , 2011, J. Comput. Phys..

[9]  Scott Pakin Compiler-based application analysis , 2012 .

[10]  H Burau,et al.  PIConGPU: A Fully Relativistic Particle-in-Cell Code for a GPU Cluster , 2010, IEEE Transactions on Plasma Science.

[11]  Dana A Knoll Plasma Paradigm Shift Document for the CoCoMANS Project , 2013 .

[12]  Tamara G. Kolda,et al.  An overview of the Trilinos project , 2005, TOMS.

[13]  C. Birdsall,et al.  Plasma Physics via Computer Simulation , 2018 .

[14]  Luis Chacón,et al.  Development of a Consistent and Stable Fully Implicit Moment Method for Vlasov-Ampère Particle in Cell (PIC) System , 2013, SIAM J. Sci. Comput..

[15]  L Chacón,et al.  Flux pileup in collisionless magnetic reconnection: bursty interaction of large flux ropes. , 2011, Physical review letters.