Fast Circulant Tensor Power Method for High-Order Principal Component Analysis

To understand high-order intrinsic key patterns in high-dimensional data, tensor decomposition is a more versatile tool for data analysis than standard flat-view matrix models. Several existing tensor models aim to achieve rapid computation of high-order principal components based on the tensor power method. However, since a tensor power method does not enforce orthogonality in subsequently calculated decomposition components, it causes far more challenges on principal component analysis of high-order tensors. To address this problem, several tensor power method variant algorithms incorporating sparsity into decomposition factors have been proposed. However, because these variant algorithms require additional procedures based on data-driven hyper-parameter optimization algorithms, a trade-off between computational cost and convergence exists. In this paper, a novel tensor power method called the fast circulant tensor power method is proposed. The proposed algorithm combines tensor-train decomposition and the power method. Tensor-train decomposition is a high-order tensor decomposition method based on auxiliary unfolding matrix decomposition. Thus, the power method can be embedded into our methodology without any additional processes. Notably, a simple combination of these two methods may cause a local optima problem because the power method only guarantees convergence on each unfolding matrix in tensor-train decomposition. To solve this problem, the circulant updating method is proposed, which globally optimizes all factor vectors by reordering some steps of the factor vector updates. It is experimentally demonstrated that, compared to state-of-the-art tensor power method variant methodologies, the proposed algorithm achieves the lowest computational complexity and quantitatively good performance in various applications including large-scale color image decomposition and convolutional neural network compression.

[1]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Pierre Comon,et al.  Enhanced Line Search: A Novel Method to Accelerate PARAFAC , 2008, SIAM J. Matrix Anal. Appl..

[3]  Shinichi Nakajima,et al.  Global analytic solution of fully-observed variational Bayesian matrix factorization , 2013, J. Mach. Learn. Res..

[4]  Richard A. Harshman,et al.  Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis , 1970 .

[5]  Maja Pantic,et al.  TensorLy: Tensor Learning in Python , 2016, J. Mach. Learn. Res..

[6]  Liqun Qi,et al.  New ALS Methods With Extrapolating Search Directions and Optimal Step Size for Complex-Valued Tensor Decompositions , 2011, IEEE Transactions on Signal Processing.

[7]  Anima Anandkumar,et al.  Tensor decompositions for learning latent variable models , 2012, J. Mach. Learn. Res..

[8]  Ngai Wong,et al.  A Constructive Algorithm for Decomposing a Tensor into a Finite Sum of Orthonormal Rank-1 Terms , 2014, SIAM J. Matrix Anal. Appl..

[9]  Haiping Lu,et al.  A survey of multilinear subspace learning for tensor data , 2011, Pattern Recognit..

[10]  Hanqing Lu,et al.  Recent advances in efficient computation of deep convolutional neural networks , 2018, Frontiers of Information Technology & Electronic Engineering.

[11]  P. Comon,et al.  Tensor decompositions, alternating least squares and other tales , 2009 .

[12]  Anima Anandkumar,et al.  Analyzing Tensor Power Method Dynamics in Overcomplete Regime , 2014, J. Mach. Learn. Res..

[13]  Andrzej Cichocki,et al.  Fast Alternating LS Algorithms for High Order CANDECOMP/PARAFAC Tensor Factorizations , 2013, IEEE Transactions on Signal Processing.

[14]  Eunhyeok Park,et al.  Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications , 2015, ICLR.

[15]  P. Paatero A weighted non-negative least squares algorithm for three-way ‘PARAFAC’ factor analysis , 1997 .

[16]  Nikos D. Sidiropoulos,et al.  Tensor Decomposition for Signal Processing and Machine Learning , 2016, IEEE Transactions on Signal Processing.

[17]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[18]  A. H. Bentbib,et al.  Block Power Method for SVD Decomposition , 2015 .

[19]  Andrzej Cichocki,et al.  Low Complexity Damped Gauss-Newton Algorithms for CANDECOMP/PARAFAC , 2012, SIAM J. Matrix Anal. Appl..

[20]  Morten Mørup,et al.  Applications of tensor (multiway array) factorizations and decompositions in data mining , 2011, WIREs Data Mining Knowl. Discov..

[21]  Han Liu,et al.  Provable sparse tensor decomposition , 2015, 1502.01425.

[22]  Seung-Ik Lee,et al.  CP-decomposition with Tensor Power Method for Convolutional Neural Networks compression , 2017, 2017 IEEE International Conference on Big Data and Smart Computing (BigComp).

[23]  Andrzej Cichocki,et al.  Nonnegative Matrix and Tensor Factorization T , 2007 .

[24]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[25]  Zbynek Koldovský,et al.  Simultaneous search for all modes in multilinear models , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  Bülent Yener,et al.  Unsupervised Multiway Data Analysis: A Literature Survey , 2009, IEEE Transactions on Knowledge and Data Engineering.

[27]  A. D. de Almeida,et al.  High-order tensor estimation via trains of coupled third-order CP and Tucker decompositions , 2020, Linear Algebra and its Applications.

[28]  Andrzej Cichocki,et al.  Fast damped gauss-newton algorithm for sparse and nonnegative tensor factorization , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  Pierre Comon,et al.  Handbook of Blind Source Separation: Independent Component Analysis and Applications , 2010 .

[30]  Ivan Oseledets,et al.  Tensor-Train Decomposition , 2011, SIAM J. Sci. Comput..

[31]  Martin J. Mohlenkamp Musings on multilinear fitting , 2013 .

[32]  B. Sturmfels,et al.  The number of eigenvalues of a tensor , 2010, 1004.4953.

[33]  Genevera I. Allen,et al.  Sparse Higher-Order Principal Components Analysis , 2012, AISTATS.

[34]  Jieun Lee,et al.  Bayesian Optimization-Based Global Optimal Rank Selection for Compression of Convolutional Neural Networks , 2020, IEEE Access.

[35]  Henk A. L. Kiers,et al.  A three–step algorithm for CANDECOMP/PARAFAC analysis of large data sets with multicollinearity , 1998 .

[36]  R. Mises,et al.  Praktische Verfahren der Gleichungsauflösung . , 1929 .

[37]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[38]  J. Landsberg Tensors: Geometry and Applications , 2011 .

[39]  Ivan V. Oseledets,et al.  Speeding-up Convolutional Neural Networks Using Fine-tuned CP-Decomposition , 2014, ICLR.

[40]  Shuzhong Zhang,et al.  Maximum Block Improvement and Polynomial Optimization , 2012, SIAM J. Optim..

[41]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[42]  Andrzej Cichocki,et al.  CANDECOMP/PARAFAC Decomposition of High-Order Tensors Through Tensor Reshaping , 2012, IEEE Transactions on Signal Processing.

[43]  Zhi-Quan Luo,et al.  A Unified Convergence Analysis of Block Successive Minimization Methods for Nonsmooth Optimization , 2012, SIAM J. Optim..

[44]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[45]  Rasmus Bro,et al.  The N-way Toolbox for MATLAB , 2000 .

[46]  David E. Booth,et al.  Multi-Way Analysis: Applications in the Chemical Sciences , 2005, Technometrics.

[47]  B. Khoromskij Tensors-structured Numerical Methods in Scientific Computing: Survey on Recent Advances , 2012 .

[48]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[49]  Seung-Ik Lee,et al.  Rank selection of CP-decomposed convolutional layers with variational Bayesian matrix factorization , 2018, 2018 20th International Conference on Advanced Communication Technology (ICACT).

[50]  Andrzej Cichocki,et al.  Tensor Decompositions for Signal Processing Applications: From two-way to multiway component analysis , 2014, IEEE Signal Processing Magazine.