GPU parallelization of an object-oriented nonlinear dynamic structural analysis platform

Abstract This work parallelized a widely used structural analysis platform called OpenSees using graphical processing units (GPU). This paper presents task decomposition diagrams with data flow and the sequential and parallel flowcharts for element matrix/vector calculations. It introduces a Bulk Model to ease the parallelization of the element matrix/vector calculations. An implementation of this model for shell elements is presented. Three versions of the Bulk Model—sequential, OpenMP multi-threaded, and CUDA GPU parallelized—were implemented in this work. Nonlinear dynamic analyses of two building models subjected to a tri-axial earthquake were tested. The results demonstrate speedups higher than four on a 4-core system, while the GPU parallelism achieves speedups higher than 7.6 on a single GPU device in comparison to the original sequential implementation.

[1]  Shuenn-Yih Chang,et al.  A family of explicit algorithms for general pseudodynamic testing , 2011 .

[2]  Michael H. Scott,et al.  Nonlinear Finite-Element Analysis Software Architecture Using Object Composition , 2010, J. Comput. Civ. Eng..

[3]  Marcelo J. Vénere,et al.  A Lattice-Boltzmann solver for 3D fluid simulation on GPU , 2012, Simul. Model. Pract. Theory.

[4]  Yuan-Sen Yang,et al.  ISEE: Internet‐based Simulation for Earthquake Engineering—Part I: Database approach , 2007 .

[5]  Gregory L. Fenves,et al.  Object-oriented finite element programming: frameworks for analysis, algorithms and parallel computing , 1997 .

[6]  Fatemeh Jalayer,et al.  Alternative non‐linear demand estimation methods for probability‐based seismic assessments , 2009 .

[7]  Gordon Erlebacher,et al.  Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDA , 2009, J. Parallel Distributed Comput..

[8]  Amr S. Elnashai,et al.  An online optimization method for bridge dynamic hybrid simulations , 2012, Simul. Model. Pract. Theory.

[9]  R. Cook,et al.  Concepts and Applications of Finite Element Analysis , 1974 .

[10]  Gregory L. Fenves,et al.  Software framework for distributed experimental–computational simulation of structural systems , 2006 .

[11]  Timothy A. Davis,et al.  Algorithm 832: UMFPACK V4.3---an unsymmetric-pattern multifrontal method , 2004, TOMS.

[12]  Shang-Hsien Hsieh,et al.  Improving Parallel Substructuring Efficiency by Using a Multilevel Approach , 2012, J. Comput. Civ. Eng..

[13]  Dimitrios Vamvatsikos,et al.  Incremental dynamic analysis , 2002 .

[14]  Xiaoye S. Li,et al.  An overview of SuperLU: Algorithms, implementation, and user interface , 2003, TOMS.

[15]  Yi-Lung Mo,et al.  Unified Theory of Concrete Structures , 2010 .

[16]  Makoto Ohsaki,et al.  High‐precision finite element analysis of elastoplastic dynamic responses of super‐high‐rise steel frames , 2009 .

[17]  L. Dagum,et al.  OpenMP: an industry standard API for shared-memory programming , 1998 .