Regression of a data matrix on descriptors of both its rows and of its columns via latent variables: L-PLSR

Abstract A new approach is described, for extracting and visualising structures in a data matrix Y in light of additional information BOTH about the ROWS in Y, given in matrix X, AND about the COLUMNS in Y, given in matrix Z. The three matrices Z–Y–X may be envisioned as an “L-shape”; X(I×K) and Z(J×L) share no matrix size dimension, but are connected via Y(I×J). A few linear combinations (components) are extracted from X and from Z, and their interactions are used for bi-linear modelling of Y, as well as for bi-linear modelling of X and Z themselves. The components are defined by singular value decomposition (SVD) of X′YZ. Two versions of the L-PLSR are described—using one single SVD for all components, or component-wise SVDs after deflation. The method is applied to the analysis of consumer liking data Y of six products assessed by 125 persons, in light of 10 other product descriptors X and 15 other person descriptors Z. Its performance is also checked on artificial data.

[1]  Lars Eldén,et al.  Partial least-squares vs. Lanczos bidiagonalization - I: analysis of a projection method for multiple regression , 2004, Comput. Stat. Data Anal..

[2]  Herman Wold,et al.  Soft modelling: The Basic Design and Some Extensions , 1982 .

[3]  Danh V. Nguyen,et al.  On partial least squares dimension reduction for microarray-based classification: a simulation study , 2004, Comput. Stat. Data Anal..

[4]  Anette Kistrup Thybo,et al.  The influence of sensory and physiochemical quality on Danish children's preferences for apples , 2001 .

[5]  Frank Westad,et al.  Gender specific preferences and attitudes towards meat , 2002 .

[6]  Anette Kistrup Thybo,et al.  Explaining Danish children's preferences for apples using instrumental, sensory and demographic/behavioural data , 2004 .

[7]  Charles K. Bayne,et al.  Multivariate Analysis of Quality. An Introduction , 2001 .

[8]  Danh V. Nguyena,et al.  On partial least squares dimension reduction for microarray-based classi'cation: a simulation study , 2004 .

[9]  ScienceDirect Computational statistics & data analysis , 1983 .

[10]  H. Martens,et al.  Multivariate analysis of quality , 2000 .

[11]  Harald Martens,et al.  Reliable and relevant modelling of real world data: a personal account of the development of PLS Regression , 2001 .

[12]  A. Höskuldsson PLS regression methods , 1988 .

[13]  Fred L. Bookstein,et al.  Exploiting Redundant Measurement of Dose and Developmental Outcome: New Methods from the Behavioral Teratology of Alcohol. , 1996 .

[14]  Agnar Höskuldsson,et al.  Causal and path modelling , 2001 .

[15]  S. Wold,et al.  The multivariate calibration problem in chemistry solved by the PLS method , 1983 .