The informative converse paradox: Windows into the unknown

Abstract Model-based interpretation of empirical data is useful. But unanticipated phenomena (interferences) can give erroneous model parameter estimates, leading to wrong interpretation. However, for multi-channel data, interference phenomena may be discovered, described and corrected for, by analysis of the lack-of-fit residual table — although with a strange limitation, which is here termed the Informative Converse paradox: When a data table (rows × columns) is approximated by a linear model, and the model-fitting is done by row-wise regression, it means that only the column-wise interference information can be correctly obtained, and vice versa. These “windows into the unknown” are here explained mathematically. They are then applied to multi-channel mixture data — artificial simulations as well as spectral NIR powder measurements — to demonstrate discovery after incomplete row-wise curve fitting and column-wise multivariate regression. The analysis shows how the Informative Converse paradox is the basis for selectivity enhancement in multivariate calibration. Data-driven model expansion for statistical multi-response analyses (ANOVA, N-way models etc.) is proposed.

[1]  B. Kowalski,et al.  Multivariate curve resolution applied to spectral data from multiple runs of an industrial process , 1993 .

[2]  E. J. Lowe,et al.  The four-category ontology : a metaphysical foundation for natural science , 2006 .

[3]  H. Kaiser The varimax criterion for analytic rotation in factor analysis , 1958 .

[4]  Harald Martens,et al.  Factor analysis of chemical mixtures , 1979 .

[5]  Harald Martens,et al.  Regression of a data matrix on descriptors of both its rows and of its columns via latent variables: L-PLSR , 2005, Comput. Stat. Data Anal..

[6]  H. Martens,et al.  Light scattering and light absorbance separated by extended multiplicative signal correction. application to near-infrared transmission analysis of powder mixtures. , 2003, Analytical chemistry.

[7]  E. Oja,et al.  Independent Component Analysis , 2013 .

[8]  Harald Martens,et al.  LPLS-regression: a method for prediction and classification under the influence of background information on predictor variables , 2008 .

[9]  Nouna Kettaneh-Wold,et al.  Analysis of mixture data with partial least squares , 1992 .

[10]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[11]  H. Martens,et al.  Multivariate analysis of quality , 2000 .

[12]  S. Wold,et al.  The multivariate calibration problem in chemistry solved by the PLS method , 1983 .

[13]  B. Sheldrick Computer analysis of protein mixtures. , 1971, The Biochemical journal.

[14]  Harald Martens,et al.  The senses linking mind and matter , 2008 .

[15]  Charles K. Bayne,et al.  Multivariate Analysis of Quality. An Introduction , 2001 .