Massive parallelization of SPICE device model evaluation on GPU-based SIMD architectures

Device model evaluation is one of the most time-consuming tasks in analog simulators such as SPICE. Graphics Processing Unit (GPU) architectures allow massive utilization of vector data on SIMD architectures. In this paper, the formulation of double precision device model equations into a form compatible with stream computing is presented. We show data on isolating typical bottlenecks, especially the communication and kernel call overheads. Our results indicate speedup of up to 20X when counting overheads, and up to 50X when using techniques to overcome these overheads. In particular, we show that our techniques are valid for small device counts, which is typically a well known problem for accelerated parallel computing with communications overheads.

[1]  P. Sadayappan,et al.  Efficient sparse matrix factorization for circuit simulation on vector supercomputers , 1989, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[2]  P. Sadayappan,et al.  Efficient Sparse Matrix Factorization for Circuit Simulation on Vector Supercomputers , 1989, 26th ACM/IEEE Design Automation Conference.

[3]  Ping Yang,et al.  Direct circuit simulation algorithms for parallel processing [VLSI] , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[4]  Lawrence Pillage,et al.  Electronic Circuit & System Simulation Methods (SRE) , 1998 .

[5]  Michael Garland,et al.  Sparse matrix computations on manycore GPU’s , 2008, 2008 45th ACM/IEEE Design Automation Conference.

[6]  Ronald A. Rohrer,et al.  Electronic Circuit and System Simulation Methods , 1994 .