Efficient Dynamic Latent Variable Analysis for High-Dimensional Time Series Data

Dynamic-inner canonical correlation analysis (DiCCA) extracts dynamic latent variables from high-dimensional time series data with a descending order of predictability in terms of $R^2$. The reduced dimensional latent variables with rank-ordered predictability capture the dynamic features in the data, leading to easy interpretation and visualization. In this article, numerically efficient algorithms for DiCCA are developed to extract dynamic latent components from high-dimensional time series data. The numerically improved DiCCA algorithms avoid repeatedly inverting a covariance matrix inside the iteration loop of the numerical DiCCA algorithms. A further improvement using singular value decomposition converts the generalized eigenvector problem into a standard eigenvector problem for the DiCCA solution. Another improvement in model efficiency in this article is the dynamic model compaction of the extracted latent scores using autoregressive integrated moving average (ARIMA) models. Integrating factors, if existed in the latent variable scores, are made explicit in the ARIMA models. Numerical tests on two industrial datasets are provided to illustrate the improvements.

[1]  S. Joe Qin,et al.  Regression on dynamic PLS structures for supervised learning of dynamic data , 2018, Journal of Process Control.

[2]  Donghua Zhou,et al.  Dynamic latent variable modeling for statistical process monitoring , 2011 .

[3]  S. Joe Qin,et al.  A novel dynamic PCA algorithm for dynamic data modeling and process monitoring , 2017 .

[4]  S. Joe Qin,et al.  Statistical process monitoring: basics and beyond , 2003 .

[5]  Asuman E. Ozdaglar,et al.  Convergence rate of block-coordinate maximization Burer–Monteiro method for solving large SDPs , 2018, Mathematical Programming.

[6]  S. Joe Qin,et al.  Dynamic-Inner Canonical Correlation and Causality Analysis for High Dimensional Time Series Data , 2018 .

[7]  George Athanasopoulos,et al.  Forecasting: principles and practice , 2013 .

[8]  H. Akaike Canonical Correlation Analysis of Time Series and the Use of an Information Criterion , 1976 .

[9]  H. Bourlard,et al.  Auto-association by multilayer perceptrons and singular value decomposition , 1988, Biological Cybernetics.

[10]  Pablo Basanta-Val,et al.  An Efficient Industrial Big-Data Engine , 2018, IEEE Transactions on Industrial Informatics.

[11]  S. Qin,et al.  Detection and identification of faulty sensors in dynamic processes , 2001 .

[12]  Victor M. Zavala,et al.  On the Convergence of the Dynamic Inner PCA Algorithm , 2020, ArXiv.

[13]  A. Basilevsky,et al.  Karhunen-Loeve analysis of historical time series with an application to plantation births in Jamaica. , 1979, Journal of the American Statistical Association.

[14]  Nathalie Japkowicz,et al.  Nonlinear Autoassociation Is Not Equivalent to PCA , 2000, Neural Computation.

[15]  Anima Anandkumar,et al.  Tensor decompositions for learning latent variable models , 2012, J. Mach. Learn. Res..

[16]  Zhihuan Song,et al.  Autoregressive Dynamic Latent Variable Models for Process Monitoring , 2017, IEEE Transactions on Control Systems Technology.

[17]  Tiago J. Rato,et al.  Defining the structure of DPCA models and its impact on process monitoring and prediction activities , 2013 .

[18]  Thomas J. McAvoy,et al.  Nonlinear PLS Modeling Using Neural Networks , 1992 .

[19]  S. Joe Qin,et al.  DiCCA with Discrete-Fourier Transforms for Power System Events Detection and Localization , 2018 .

[20]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[21]  Mohammad Soleymani,et al.  A Multimodal Database for Affect Recognition and Implicit Tagging , 2012, IEEE Transactions on Affective Computing.

[22]  R. Shumway,et al.  AN APPROACH TO TIME SERIES SMOOTHING AND FORECASTING USING THE EM ALGORITHM , 1982 .

[23]  Zhiqiang Ge,et al.  Data‐based linear Gaussian state‐space model for dynamic process monitoring , 2012 .

[24]  Wallace E. Larimore,et al.  Statistical optimality and canonical variate analysis system identification , 1996, Signal Process..

[25]  Christos Georgakis,et al.  Disturbance detection and isolation by dynamic principal component analysis , 1995 .

[26]  Donghua Zhou,et al.  A New Method of Dynamic Latent-Variable Modeling for Process Monitoring , 2014, IEEE Transactions on Industrial Electronics.

[27]  M. Kramer Nonlinear principal component analysis using autoassociative neural networks , 1991 .

[28]  Si-Zhao Joe Qin,et al.  Dynamic latent variable analytics for process operations and control , 2017, Comput. Chem. Eng..

[29]  M. Kulahci,et al.  On the structure of dynamic principal component analysis used in statistical process monitoring , 2017 .

[30]  J. Edward Jackson,et al.  A User's Guide to Principal Components. , 1991 .