On the equivalence between total least squares and maximum likelihood PCA

The maximum likelihood PCA (MLPCA) method has been devised in chemometrics as a generalization of the well-known PCA method in order to derive consistent estimators in the presence of errors with known error distribution. For similar reasons, the total least squares (TLS) method has been generalized in the field of computational mathematics and engineering to maintain consistency of the parameter estimates in linear models with measurement errors of known distribution. The basic motivation for TLS is the following. Let a set of multidimensional data points (vectors) be given. How can one obtain a linear model that explains these data? The idea is to modify all data points in such a way that some norm of the modification is minimized subject to the constraint that the modified vectors satisfy a linear relation. Although the name “total least squares” appeared in the literature only 25 years ago, this method of fitting is certainly not new and has a long history in the statistical literature, where the method is known as “orthogonal regression”, “errors-in-variables regression” or “measurement error modeling”. The purpose of this paper is to explore the tight equivalences between MLPCA and element-wise weighted TLS (EW-TLS). Despite their seemingly different problem formulation, it is shown that both methods can be reduced to the same mathematical kernel problem, i.e. finding the closest (in a certain sense) weighted low rank matrix approximation where the weight is derived from the distribution of the errors in the data. Different solution approaches, as used in MLPCA and EW-TLS, are discussed. In particular, we will discuss the weighted low rank approximation (WLRA), the MLPCA, the EW-TLS and the generalized TLS (GTLS) problems. These four approaches tackle an equivalent weighted low rank approximation problem, but different algorithms are used to come up with the best approximation matrix. We will compare their computation times on chemical data and discuss their convergence behavior.

[1]  Sabine Van Huffel,et al.  The element-wise weighted total least-squares problem , 2006, Comput. Stat. Data Anal..

[2]  Robert E. Mahony,et al.  The geometry of weighted low-rank approximations , 2003, IEEE Trans. Signal Process..

[3]  Sabine Van Huffel,et al.  Consistency of elementwise-weighted total least squares estimator in a multivariate errors-in-variables model AX=B , 2004 .

[4]  James Durbin,et al.  Errors in variables , 1954 .

[5]  Sabine Van Huffel,et al.  Total least squares problem - computational aspects and analysis , 1991, Frontiers in applied mathematics.

[6]  I. Markovsky,et al.  Consistency of the structured total least squares estimator in a multivariate errors-in-variables model , 2005 .

[7]  Gene H. Golub,et al.  Some modified matrix eigenvalue problems , 1973, Milestones in Matrix Computation.

[8]  L. Gleser Estimation in a Multivariate "Errors in Variables" Regression Model: Large Sample Results , 1981 .

[9]  Jerry M. Mendel,et al.  The constrained total least squares technique and its applications to harmonic superresolution , 1991, IEEE Trans. Signal Process..

[10]  G. Golub,et al.  Regularized Total Least Squares Based on Quadratic Eigenvalue Problem Solvers , 2004 .

[11]  Sijmen de Jong,et al.  Regression coefficients in multilinear PLS , 1998 .

[12]  Robert L. Mason,et al.  A Comparison of Least Squares and Latent Root Regression Estimators , 1976 .

[13]  Darren T. Andrews,et al.  Maximum Likelihood Multivariate Calibration , 2022 .

[14]  Edward V. Thomas,et al.  Errors-in-variables estimation in multivariate calibration , 1991 .

[15]  Sudhir Gupta,et al.  Statistical Regression With Measurement Error , 1999, Technometrics.

[16]  Rik Pintelon,et al.  A Gauss-Newton-like optimization algorithm for "weighted" nonlinear least-squares problems , 1996, IEEE Trans. Signal Process..

[17]  Eric M. Dowling,et al.  The Data Least Squares Problem and Channel Equalization , 1993, IEEE Trans. Signal Process..

[18]  Gene H. Golub,et al.  An analysis of the total least squares problem , 1980, Milestones in Matrix Computation.

[19]  Maria Luisa Rastello,et al.  The Parametric Quadratic Form Method for Solving TLS Problems with Elementwise Weighting , 2002 .

[20]  S. Wold,et al.  The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses , 1984 .

[21]  Peter D. Wentzell,et al.  Maximum likelihood principal component analysis with correlated measurement errors: theoretical and practical considerations , 1999 .

[22]  R. J. Adcock Note on the Method of Least Squares , 1877 .

[23]  A. Phatak,et al.  The geometry of partial least squares , 1997 .

[24]  J. T. Webster,et al.  Latent Root Regression Analysis , 1974 .

[25]  S. Joe Qin,et al.  Consistent dynamic PCA based on errors-in-variables subspace identification , 2001 .

[26]  M. Peruggia Total Least Squares and Errors-in-Variables Modeling: Analysis, Algorithms and Applications , 2003 .

[27]  Gene H. Golub,et al.  Regularization by Truncated Total Least Squares , 1997, SIAM J. Sci. Comput..

[28]  M. Forina,et al.  Multivariate calibration. , 2007, Journal of chromatography. A.

[29]  S. Huffel,et al.  Total Least Squares and Errors-in-Variables Modeling : Analysis, Algorithms and Applications , 2002 .

[30]  Darren T. Andrews,et al.  Maximum likelihood principal component analysis , 1997 .

[31]  Sabine Van Huffel,et al.  Fast regularized structured total least squares algorithm for solving the basic deconvolution problem , 2005, Numer. Linear Algebra Appl..

[32]  R. J. Adcock A Problem in Least Squares , 1878 .

[33]  Alison J. Burnham,et al.  Frameworks for latent variable multivariate regression , 1996 .

[34]  K. S. Arun,et al.  A Unitarily Constrained Total Least Squares Problem in Signal Processing , 1992, SIAM J. Matrix Anal. Appl..

[35]  J. Vandewalle,et al.  Analysis and properties of the generalized total least squares problem AX≈B when some or all columns in A are subject to error , 1989 .

[36]  Sabine Van Huffel,et al.  Recent advances in total least squares techniques and errors-in-variables modeling , 1997 .

[37]  S. D. Jong PLS fits closer than PCR , 1993 .