A Probabilistic Derivation of the Partial Least-Squares Algorithm

Traditionally the partial least-squares (PLS) algorithm, commonly used in chemistry for ill-conditioned multivariate linear regression, has been derived (motivated) and presented in terms of data matrices. In this work the PLS algorithm is derived probabilistically in terms of stochastic variables where sample estimates calculated using data matrices are employed at the end. The derivation, which offers a probabilistic motivation to each step of the PLS algorithm, is performed for the general multiresponse case and without reference to any latent variable model of the response variable and also without any so-called "inner relation". On the basis of the derivation, some theoretical issues of the PLS algorithm are briefly considered: the complexity of the original motivation of PLS regression which involves an "inner relation"; the original motivation behind the prediction stage of the PLS algorithm; the relationship between uncorrelated and orthogonal latent variables; the limited possibilities to make natural interpretations of the latent variables extracted.

[1]  Svante Wold Discussion: PLS in Chemical Practice , 1993 .

[2]  Alison J. Burnham,et al.  Frameworks for latent variable multivariate regression , 1996 .

[3]  I. Helland ON THE STRUCTURE OF PARTIAL LEAST SQUARES REGRESSION , 1988 .

[4]  Agnar Höskuldsson,et al.  A combined theory for PCA and PLS , 1995 .

[5]  R. Manne Analysis of two partial-least-squares algorithms for multivariate calibration , 1987 .

[6]  J. Friedman,et al.  [A Statistical View of Some Chemometrics Regression Tools]: Response , 1993 .

[7]  Bhupinder S. Dayal,et al.  Improved PLS algorithms , 1997 .

[8]  Christian Jutten,et al.  Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture , 1991, Signal Process..

[9]  S. Wold,et al.  The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses , 1984 .

[10]  M. Stone Continuum regression: Cross-validated sequentially constructed prediction embracing ordinary least s , 1990 .

[11]  Olav M. Kvalheim,et al.  Latent-structure decompositions (projections) of multivariate data , 1987 .

[12]  Thomas J. McAvoy,et al.  Nonlinear PLS Modeling Using Neural Networks , 1992 .

[13]  J. Freischlag,et al.  Response , 1994, Science.

[14]  Petre Stoica,et al.  Partial Least Squares: A First‐order Analysis , 1998 .

[15]  S. Wold,et al.  The multivariate calibration problem in chemistry solved by the PLS method , 1983 .

[16]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[17]  Hagai Attias,et al.  Independent Factor Analysis , 1999, Neural Computation.

[18]  William J. Egan,et al.  Measurement of Carboxyhemoglobin in Forensic Blood Samples Using UV-Visible Spectrometry and Improved Principal Component Regression , 1999 .

[19]  A. Höskuldsson PLS regression methods , 1988 .

[20]  S. D. Jong SIMPLS: an alternative approach to partial least squares regression , 1993 .

[21]  L. E. Wangen,et al.  A theoretical foundation for the PLS algorithm , 1987 .

[22]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[23]  O. Kvalheim Latent Variable , 1992, The SAGE Encyclopedia of Research Design.

[24]  S. Wold,et al.  PLS: Partial Least Squares Projections to Latent Structures , 1993 .

[25]  J. Friedman,et al.  A Statistical View of Some Chemometrics Regression Tools , 1993 .

[26]  S. Wold,et al.  Nonlinear PLS modeling , 1989 .

[27]  Edward C. Malthouse,et al.  Nonlinear partial least squares , 1997 .

[28]  A. J. Morris,et al.  Non-linear projection to latent structures revisited (the neural network PLS algorithm) , 1999 .

[29]  J. Mandel Use of the Singular Value Decomposition in Regression Analysis , 1982 .