Learning from Procrustes analysis to improve multivariate calibration

For the multivariate calibration model y = Xb + e, y and X are typically mean centered to respective column means to remove offsets before estimating b. Often X is obtained under different experimental conditions (temperature, days, etc.). In this case, mean centering across all the conditions (global mean centering) assumes a common offset for the different conditions. Additionally, after a calibration model has been developed for process analysis, it is often necessary to update the model later in time to new conditions. In this case, new measurements are made. These new X and y values can be augmented to the original X and y arrays; mean centered to new column means, that is global mean centering; and a new calibration model is then obtained. The established method of Procrustes analysis (PA) is used in many spectral preprocessing and calibration transfer methods and local mean centering is a key processing element in PA. Rather than global mean centering in multivariate calibration, this paper looks at local mean centering where X and y are locally mean centered relative to respective experimental conditions, that is mean centering data groups with common offsets. When this approach is used, improved results are possible as demonstrated with two spectroscopic data sets where prediction errors and uncertainties reduce. Other regression diagnostics are also shown to improve. Copyright © 2008 John Wiley & Sons, Ltd.

[1]  A K Smilde,et al.  Influence of temperature on vibrational spectra and consequences for the predictive ability of multivariate models. , 1998, Analytical chemistry.

[2]  T. Fearn Standardisation and Calibration Transfer for near Infrared Instruments: A Review , 2001 .

[3]  Edward V. Thomas,et al.  Development of Robust Multivariate Calibration Models , 2000, Technometrics.

[4]  John H. Kalivas,et al.  Impartial graphical comparison of multivariate calibration methods and the harmony/parsimony tradeoff , 2006 .

[5]  Bruce R. Kowalski,et al.  Propagation of measurement errors for the validation of predictions obtained by principal component regression and partial least squares , 1997 .

[6]  Tormod Næs,et al.  Related versions of the multiplicative scatter correction method for preprocessing spectroscopic data , 1995 .

[7]  Steven D. Brown,et al.  Transfer of multivariate calibration models: a review , 2002 .

[8]  B. Kowalski,et al.  Multivariate instrument standardization , 1991 .

[9]  Onno E. de Noord,et al.  Multivariate calibration standardization , 1994 .

[10]  Thomas D. Wickens,et al.  The geometry of multivariate statistics , 1994 .

[11]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[12]  José Manuel Andrade,et al.  Procrustes rotation in analytical chemistry, a tutorial , 2004 .

[13]  Avraham Lorber,et al.  Estimation of prediction error for multivariate calibration , 1988 .

[14]  Stephen T. Balke,et al.  Mid-Infrared Spectra from Near-Infrared Spectra Using Partial Least-Squares , 1993 .

[15]  Huwei Tan,et al.  Improvement of a Standard-Free Method for Near-Infrared Calibration Transfer , 2002 .

[16]  H. Martens,et al.  Near-Infrared Absorption and Scattering Separated by Extended Inverted Signal Correction (EISC): Analysis of Near-Infrared Transmittance Spectra of Single Wheat Seeds , 2002 .

[17]  Bruce R. Kowalski,et al.  The effect of mean centering on prediction in multivariate calibration , 1992 .

[18]  Tormod Næs,et al.  A user-friendly guide to multivariate calibration and classification , 2002 .

[19]  Tormod Næs,et al.  Comparison of Methods for Transfer of Calibration Models in Near-Infared Spectroscopy: A Case Study Based on Correcting Path Length Differences Using Fiber-Optic Transmittance Probes in In-Line Near-Infrared Spectroscopy , 2005, Applied spectroscopy.

[20]  John H. Kalivas,et al.  Multivariate Calibration, an Overview , 2005 .

[21]  Bruce R. Kowalski,et al.  Additive Background Correction in Multivariate Instrument Standardization , 1995 .

[22]  R. Bro,et al.  Centering and scaling in component analysis , 2003 .

[23]  Improvement of the partial least squares model performance for oral glucose intake experiments by inside mean centering and inside multiplicative signal correction. , 2005, Analytical sciences : the international journal of the Japan Society for Analytical Chemistry.

[24]  John H. Kalivas,et al.  Fundamentals of Calibration Transfer through Procrustes Analysis , 1999 .