Sample and feature augmentation strategies for calibration updating

Calibration updating—transfer and/or maintenance—has historically been implemented using a simple but effective technique: in addition to primary samples, include a small number of secondary samples and weight them. It would be beneficial if these classical weighting techniques could be enhanced. Moreover, it would be ideal if we could only use secondary spectra without reference measurements. In this paper, we examine multiple calibration updating scenarios involving unlabeled and labeled secondary spectra. First, we propose three new updating approaches involving sample augmentation whereby unlabeled secondary spectra are used to construct an “undesirable” subspace. This subspace is then used to steer the model vector away from a spectroscopically undesirable solution. Second, we propose two new feature augmentation approaches using labeled secondary samples. These three approaches involves the sum of two model vectors, a dedicated primary model vector plus a perturbation vector, that can accommodate new secondary samples. We rigorously vet the proposed approaches across two near‐infrared (NIR) data sets and across multiple data splits. Out of all of the approaches examined, one feature augmentation approach provides improved results compared with existing approaches, and one sample augmentation approach utilizing only unlabeled secondary spectra appears promising.

[1]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[2]  D. Longsine,et al.  Simultaneous rayleigh-quotient minimization methods for Ax=λBx , 1980 .

[3]  John H. Kalivas,et al.  Fusion strategies for selecting multiple tuning parameters for multivariate calibration and other penalty based processes: A model updating application for pharmaceutical analysis. , 2016, Analytica chimica acta.

[4]  S. Wold,et al.  Orthogonal signal correction of near-infrared spectra , 1998 .

[5]  Charles R. Hurburgh,et al.  Standardisation of near Infrared Spectrometers: Evaluation of Some Common Techniques for Intra- and Inter-Brand Calibration Transfer , 2008 .

[6]  Muhammad Ghifary,et al.  Domain Adaptation and Domain Generalization with Representation Learning , 2016 .

[7]  David M. Haaland,et al.  Concentration Residual Augmented Classical Least Squares (CRACLS): A Multivariate Calibration Method with Advantages over Partial Least Squares , 2002 .

[8]  J. Kalivas,et al.  Penalty processes for combining roughness and smoothness in spectral multivariate calibration , 2016 .

[9]  K. Helland,et al.  Recursive algorithm for partial least squares regression , 1992 .

[10]  Steven D. Brown,et al.  Transfer of multivariate calibration models: a review , 2002 .

[11]  Desire L. Massart,et al.  Standardization of near-infrared spectra in the wavelet domain , 1997 .

[12]  P. Hansen Rank-Deficient and Discrete Ill-Posed Problems: Numerical Aspects of Linear Inversion , 1987 .

[13]  Erik Andries,et al.  Penalized eigendecompositions: motivations from domain adaptation for calibration transfer , 2017 .

[14]  Tom Fearn,et al.  Collaborative evaluation of universal calibrations for the measurement of protein and moisture in flour by near infrared reflectance , 2007 .

[15]  Julianne Chung,et al.  Optimal Regularization Parameters for General-Form Tikhonov Regularization , 2014, 1407.1911.

[16]  D. Massart,et al.  Standardization of near-infrared spectrometric instruments , 1996 .

[17]  J. Shenk,et al.  Calibration Transfer Between near Infrared Reflectance Spectrophotometers , 1985 .

[18]  Erik Andries,et al.  Model updating for spectral calibration maintenance and transfer using 1-norm variants of Tikhonov regularization. , 2010, Analytical chemistry.

[19]  M. Forina,et al.  Transfer of calibration function in near-infrared spectroscopy , 1995 .

[20]  T. Fearn Standardisation and Calibration Transfer for near Infrared Instruments: A Review , 2001 .

[21]  Charles R. Hurburgh,et al.  Improving the transfer of near infrared prediction models by orthogonal methods , 2009 .

[22]  J. Kalivas,et al.  Sample‐wise spectral multivariate calibration desensitized to new artifacts relative to the calibration data using a residual penalty , 2017 .

[23]  David M. Haaland,et al.  New augmented classical least squares methods for improved quantitative spectral analyses , 2002 .

[24]  John H. Kalivas,et al.  Learning from Procrustes analysis to improve multivariate calibration , 2008 .

[25]  Bruce R. Kowalski,et al.  Weighting schemes for updating regression models—a theoretical approach , 1999 .

[26]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[27]  G. W. Small,et al.  Spectral Simulation Methodology for Calibration Transfer of Near-Infrared Spectra , 2007, Applied spectroscopy.

[28]  E. V. Thomas Incorporating auxiliary predictor variation in principal component regression models , 1995 .

[29]  Onno E. de Noord,et al.  Multivariate calibration standardization , 1994 .

[30]  Tormod Næs,et al.  Selecting the number of factors in principal component analysis by permutation testing—Numerical and practical aspects , 2017 .

[31]  Steven D. Brown,et al.  Wavelet analysis applied to removing non‐constant, varying spectroscopic background in multivariate calibration , 2002 .

[32]  Yehuda Koren,et al.  Ieee Transactions on Visualization and Computer Graphics Robust Linear Dimensionality Reduction , 2022 .

[33]  J. Kalivas,et al.  Spectral multivariate calibration without laboratory prepared or determined reference analyte values. , 2013, Analytical chemistry.

[34]  Tom Fearn,et al.  COLLABORATIVE EVALUATION OF NEAR-INFRARED REFLECTANCE ANALYSIS FOR THE DETERMINATION OF PROTEIN, MOISTURE AND HARDNESS IN WHEAT , 1983 .

[35]  Erik Andries,et al.  Calibration Maintenance and Transfer Using Tikhonov Regularization Approaches , 2009, Applied spectroscopy.

[36]  J. Kalivas,et al.  Interrelationships between generalized Tikhonov regularization, generalized net analyte signal, and generalized least squares for desensitizing a multivariate calibration to interferences , 2013 .

[37]  John H. Kalivas,et al.  Fundamentals of Calibration Transfer through Procrustes Analysis , 1999 .

[38]  Onno E. de Noord,et al.  The influence of data preprocessing on the robustness and parsimony of multivariate calibration models , 1994 .

[39]  David M. Haaland,et al.  New Hybrid Algorithm for Maintaining Multivariate Quantitative Calibrations of a Near-Infrared Spectrometer , 2002 .

[40]  Aiyou Chen,et al.  Data enriched linear regression , 2013, 1304.1837.

[41]  Tom Fearn,et al.  Transfer by orthogonal projection: making near-infrared calibrations robust to between-instrument variation , 2004 .

[42]  Olof Svensson,et al.  An evaluation of orthogonal signal correction applied to calibration transfer of near infrared spectra , 1998 .

[43]  David M. Haaland,et al.  New Prediction-Augmented Classical Least-Squares (PACLS) Methods: Application to Unmodeled Interferents , 2000 .

[44]  E. R. Malinowski Determination of rank by augmentation (DRAUG) , 2011 .

[45]  C. Lupton,et al.  Effects of breed, sex, and age on the variation and ability of fecal near-infrared reflectance spectra to predict the composition of goat diets. , 2007, Journal of animal science.

[46]  Bruce R. Kowalski,et al.  Temperature-compensating calibration transfer for near-infrared filter instruments , 1993 .

[47]  B. Kowalski,et al.  Multivariate instrument standardization , 1991 .

[48]  M. P. Callao,et al.  Multivariate standardization techniques using UV-Vis data , 1997 .

[49]  Jieping Ye,et al.  Generalized Linear Discriminant Analysis: A Unified Framework and Efficient Model Selection , 2008, IEEE Transactions on Neural Networks.

[50]  Christopher D. Brown,et al.  Discordance between net analyte signal theory and practical multivariate calibration. , 2004, Analytical chemistry.

[51]  Eric O. Postma,et al.  Dimensionality Reduction: A Comparative Review , 2008 .

[52]  Ahmed H. Sameh,et al.  Trace Minimization Algorithm for the Generalized Eigenvalue Problem , 1982, PPSC.

[53]  David M. Haaland,et al.  New Classical Least-Squares/Partial Least-Squares Hybrid Algorithm for Spectral Analyses , 2001 .

[54]  Desire L. Massart,et al.  Improvement of the piecewise direct standardisation procedure for the transfer of NIR spectra for multivariate calibration , 1996 .

[55]  H. Swierenga,et al.  Comparison of Two Different Approaches toward Model Transferability in NIR Spectroscopy , 1998 .