Parallel GPU Implementation of Iterative PCA Algorithms
暂无分享,去创建一个
Principal component analysis (PCA) is a key statistical technique for multivariate data analysis. For large data sets, the common approach to PCA computation is based on the standard NIPALS-PCA algorithm, which unfortunately suffers from loss of orthogonality, and therefore its applicability is usually limited to the estimation of the first few components. Here we present an algorithm based on Gram-Schmidt orthogonalization (called GS-PCA), which eliminates this shortcoming of NIPALS-PCA. Also, we discuss the GPU (Graphics Processing Unit) parallel implementation of both NIPALS-PCA and GS-PCA algorithms. The numerical results show that the GPU parallel optimized versions, based on CUBLAS (NVIDIA), are substantially faster (up to 12 times) than the CPU optimized versions based on CBLAS (GNU Scientific Library).
[1] J. E. Jackson. A User's Guide to Principal Components , 1991 .
[2] Paul Geladi,et al. Principal Component Analysis , 1987, Comprehensive Chemometrics.
[3] F. J. Lingen. Efficient Gram–Schmidt orthonormalisation on parallel computers , 2000 .
[4] Christopher C. Paige,et al. Loss and Recapture of Orthogonality in the Modified Gram-Schmidt Algorithm , 1992, SIAM J. Matrix Anal. Appl..