GPU-Accelerated Batch-ACPF Solution for N-1 Static Security Analysis

Graphics processing unit (GPU) has been applied successfully in many scientific computing realms due to its superior performances on float-pointing calculation and memory bandwidth, and has great potential in power system applications. The N-1 static security analysis (SSA) appears to be a candidate application in which massive alternating current power flow (ACPF) problems need to be solved. However, when applying existing GPU-accelerated algorithms to solve N-1 SSA problem, the degree of parallelism is limited because existing researches have been devoted to accelerating the solution of a single ACPF. This paper therefore proposes a GPU-accelerated solution that creates an additional layer of parallelism among batch ACPFs and consequently achieves a much higher level of overall parallelism. First, this paper establishes two basic principles for determining well-designed GPU algorithms, through which the limitation of GPU-accelerated sequential-ACPF solution is demonstrated. Next, being the first of its kind, this paper proposes a novel GPU-accelerated batch-QR solver, which packages massive number of QR tasks to formulate a new larger-scale problem and then achieves higher level of parallelism and better coalesced memory accesses. To further improve the efficiency of solving SSA, a GPU-accelerated batch-Jacobian-Matrix generating and contingency screening is developed and carefully optimized. Lastly, the complete process of the proposed GPU-accelerated batch-ACPF solution for SSA is presented. Case studies on an 8503-bus system show dramatic computation time reduction is achieved compared with all reported existing GPU-accelerated methods. In comparison to UMFPACK-library-based single-CPU counterpart using Intel Xeon E5-2620, the proposed GPU-accelerated SSA framework using NVIDIA K20C achieves up to 57.6 times speedup. It can even achieve four times speedup when compared to one of the fastest multi-core CPU parallel computing solution using KLU library. The proposed batch-solving method is practically very promising and lays a critical foundation for many other power system applications that need to deal with massive subtasks, such as Monte-Carlo simulation and probabilistic power flow.

[1]  Anjan Bose,et al.  On-line power system security analysis , 1992, Proc. IEEE.

[2]  Erik Lindholm,et al.  NVIDIA Tesla: A Unified Graphics and Computing Architecture , 2008, IEEE Micro.

[3]  Fangxing Li,et al.  GPU-based power flow analysis with Chebyshev preconditioner and conjugate gradient method , 2014 .

[4]  Iain S. Duff,et al.  The Design and Use of Algorithms for Permuting Large Entries to the Diagonal of Sparse Matrices , 1999, SIAM J. Matrix Anal. Appl..

[5]  Timothy A. Davis,et al.  Algorithm 832: UMFPACK V4.3---an unsymmetric-pattern multifrontal method , 2004, TOMS.

[6]  Chen Deyan A Parallel Power Flow Algorithm for Large-scale Grid Based on Stratified Path Trees and Its Implementation on GPU , 2014 .

[7]  Fangxing Li,et al.  Exploration of multifrontal method with GPU in power flow computation , 2013, 2013 IEEE Power & Energy Society General Meeting.

[8]  Rui Bo,et al.  A novel GPU-accelerated strategy for contingency screening of static security analysis , 2016 .

[9]  Venkata Dinavahi,et al.  Large-Scale Transient Stability Simulation of Electrical Power Systems on Parallel GPUs , 2012, IEEE Transactions on Parallel and Distributed Systems.

[10]  Timothy A. Davis,et al.  Direct methods for sparse linear systems , 2006, Fundamentals of algorithms.

[11]  J. Pasciak,et al.  Computer solution of large sparse positive definite systems , 1982 .

[12]  Robert C. Green,et al.  Applications and Trends of High Performance Computing for Electric Power Systems: Focusing on Smart Grid , 2013, IEEE Transactions on Smart Grid.

[13]  O. Alsaç,et al.  DC Power Flow Revisited , 2009, IEEE Transactions on Power Systems.

[14]  J.D. McCalley,et al.  Multifrontal Solver for Online Power System Time-Domain Simulation , 2008, IEEE Transactions on Power Systems.

[15]  Olaf Schenk,et al.  Solving unsymmetric sparse systems of linear equations with PARDISO , 2002, Future Gener. Comput. Syst..

[16]  Wei Wu,et al.  An EScheduler-Based Data Dependence Analysis and Task Scheduling for Parallel Circuit Simulation , 2011, IEEE Transactions on Circuits and Systems II: Express Briefs.

[17]  Timothy A. Davis,et al.  Algorithm 836: COLAMD, a column approximate minimum degree ordering algorithm , 2004, TOMS.

[18]  Timothy A. Davis,et al.  Algorithm 907 , 2010 .