Two-level independent component regression model for multivariate spectroscopic calibration

Abstract In this paper, a two-level independent component regression (ICR) model is developed for multivariate spectroscopic calibration. Compared to the traditionally used principal component regression and partial least squared regression model, the ICR model is more efficient to extract high order statistical information from the spectra data. To improve the calibration performance, an ensemble form of the ICR model is proposed. In the first level of the method, various subspaces are constructed based on the independent component decomposition of the original data space. Meanwhile, by defining a related index, the most important variables in each subspace are selected for ICR modeling, which form the second level of the proposed method. A Bayesian inference strategy is further developed for probabilistic combination of calibration results obtained from different subspaces. For performance evaluation, two case studies are carried out on a benchmark spectra dataset.

[1]  Richard G. Brereton,et al.  A short history of chemometrics: a personal view , 2014 .

[2]  Elaine Martin,et al.  Bayesian linear regression and variable selection for spectroscopic calibration. , 2009, Analytica chimica acta.

[3]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Chih-Chou Chiu,et al.  Integration of nonlinear independent component analysis and support vector regression for stock price forecasting , 2013, Neurocomputing.

[5]  Zhiqiang Ge,et al.  Bagging support vector data description model for batch process monitoring , 2013 .

[6]  ChangKyoo Yoo,et al.  Statistical process monitoring with independent component analysis , 2004 .

[7]  X. Z. Wang,et al.  A New Approach to Near-Infrared Spectral Data Analysis Using Independent Component Analysis , 2001, J. Chem. Inf. Comput. Sci..

[8]  Stephen R. Delwiche,et al.  Integration of independent component analysis with near infrared spectroscopy for evaluation of rice freshness , 2014 .

[9]  Youngjo Lee,et al.  Sparse partial least-squares regression and its applications to high-throughput data analysis , 2011 .

[10]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[11]  R. Brereton Pattern recognition in chemometrics , 2015 .

[12]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[13]  Lei Xie,et al.  Statistical‐based monitoring of multivariate non‐Gaussian systems , 2008 .

[14]  P. H. Araújo,et al.  Spectroscopic on-line monitoring of reactions in dispersed medium: chemometric challenges. , 2007, Analytica chimica acta.

[15]  Richard G. Brereton A short history of chemometrics: a personal view , 2014 .

[16]  J. Suykens,et al.  A tutorial on support vector machine-based methods for classification problems in chemometrics. , 2010, Analytica chimica acta.

[17]  M Lopatka,et al.  Probabilistic peak detection for first-order chromatographic data. , 2014, Analytica chimica acta.

[18]  Calibration transfer of partial least squares jet fuel property models using a segmented virtual standards slope-bias correction method , 2012 .

[19]  M. Strlič,et al.  Non-destructive dating of fiber-based gelatin silver prints using near-infrared spectroscopy and multivariate analysis , 2012, Analytical and Bioanalytical Chemistry.

[20]  David I. Ellis,et al.  A comparative investigation of modern feature selection and classification approaches for the analysis of mass spectrometry data. , 2014, Analytica chimica acta.

[21]  Ingoo Han,et al.  Extracting underlying meaningful features and canceling noise using independent component analysis for direct marketing , 2007, Expert Syst. Appl..

[22]  Manabu Kano,et al.  Monitoring independent components for fault detection , 2003 .

[23]  Ke Wang,et al.  Bagging for robust non-linear multivariate calibration of spectroscopy , 2011 .

[24]  Smilde,et al.  Spectroscopic monitoring of batch reactions for on-line fault detection and diagnosis , 2000, Analytical chemistry.

[25]  Biao Huang,et al.  A unified recursive just-in-time approach with industrial near infrared spectroscopy application , 2014 .

[26]  Amos Ben-Zvi,et al.  Online sensor for monitoring a microalgal bioreactor system using support vector regression , 2012 .

[27]  Rolf Ergon,et al.  Reduced PCR/PLSR models by subspace projections , 2006 .

[28]  Roger C. Wiens,et al.  Independent component analysis classification of laser induced breakdown spectroscopy spectra , 2013 .

[29]  W. Cai,et al.  An improved boosting partial least squares method for near-infrared spectroscopic quantitative analysis. , 2010, Analytica chimica acta.

[30]  Frank Westad,et al.  Independent component analysis and regression applied on sensory data , 2005 .

[31]  Parviz Shahbazikhah,et al.  A consensus modeling approach to update a spectroscopic calibration , 2013 .

[32]  Hiromasa Kaneko,et al.  Development of a New Regression Analysis Method Using Independent Component Analysis , 2008, J. Chem. Inf. Model..

[33]  Dong-Sheng Cao,et al.  The boosting: A new idea of building models , 2010 .

[34]  Mats G. Gustafsson,et al.  Independent Component Analysis Yields Chemically Interpretable Latent Variables in Multivariate Regression , 2005, J. Chem. Inf. Model..

[35]  X. Wang,et al.  Dimension reduction of process dynamic trends using independent component analysis , 2002 .

[36]  G. Guiochon,et al.  Polynomial multivariate least‐squares regression for modeling nonlinear data applied to in‐depth characterization of chromatographic resolution , 2011 .

[37]  Xuelong Li,et al.  Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Randy J. Pell,et al.  Process analytical chemistry and chemometrics, Bruce Kowalski's legacy at The Dow Chemical Company , 2014 .