Incremental and decremental active learning for optimized self-adaptive calibration in viscose production

Abstract In viscose production, it is important to monitor three process parameters as part of the spin bath in order to assure a high quality of the final product: the concentrations of H 2 SO 4 , Na 2 SO 4 and ZnSO 4 . NIR-spectroscopy is a fast analytical method applicable to conditions of industrial production and is capable of determining those concentrations. The collective composition of the spin bath varies in the industrial process, which implies changes in the matrix of the aforementioned analytes. Thus, conventional static chemometric models, which are trained based on collected calibration spectra from Fourier transform near infrared (FT-NIR) measurements, show a quite imprecise behavior when predicting the concentrations of new on-line data. In this paper, we are presenting a methodology which is able to cope with on-line self-calibration and -adaptation demands in order to compensate high system dynamics, reflected in conceptual changes in the mappings between NIR spectra and target concentrations. The methodology includes intelligent strategies for actively selecting those samples which should be accumulated into and excluded from the current data window in order to optimize the generalization performance of calibration models (thus termed as incremental and decremental active learning stages ) while keeping the number of update cycles (and thus required target measurements) as low as possible. This follows the company requirements in terms of necessary cost reduction . Experiments on real-world data streams from viscose production process show that the new self-calibration methods are able to significantly reduce the number of update cycles while still keeping the predictive quality of the calibration models high (below 5% errors) for H 2 SO 4 and Na 2 SO 4 . Incremental active learning is able to smoothen and improve the overall quality of the predictions, while decremental active learning achieves a lower number of medium to large prediction errors.

[1]  Edwin Lughofer,et al.  Learning in Non-Stationary Environments: Methods and Applications , 2012 .

[2]  Elke Bach,et al.  Using chemometric methods and NIR spectrophotometry in the textile industry , 2000 .

[3]  David E. Booth,et al.  Chemometrics: Data Analysis for the Laboratory and Chemical Plant , 2004, Technometrics.

[4]  Edwin Lughofer,et al.  Self-adaptive and local strategies for a smooth treatment of drifts in data streams , 2014, Evol. Syst..

[5]  Feng Gan,et al.  A spectral similarity measure using Bayesian statistics. , 2009, Analytica chimica acta.

[6]  Edwin Lughofer,et al.  On-line assurance of interpretability criteria in evolving fuzzy systems - Achievements, new concepts and open issues , 2013, Inf. Sci..

[7]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[8]  I. Jolliffe Principal Component Analysis , 2002 .

[9]  Plamen P. Angelov,et al.  Handling drifts and shifts in on-line data streams with evolving fuzzy systems , 2011, Appl. Soft Comput..

[10]  I. Talavera,et al.  Support vector regression for functional data in multivariate calibration problems. , 2009, Analytica chimica acta.

[11]  Grzegorz Zadora,et al.  Information-theoretical feature selection using data obtained by scanning electron microscopy coupled with and energy dispersive X-ray spectrometer for the classification of glass traces. , 2011, Analytica chimica acta.

[12]  T. Hirschfeld,et al.  Unique-sample selection via near-infrared spectral subtraction. , 1985, Analytical chemistry.

[13]  T. Næs,et al.  Principal component regression in NIR analysis: Viewpoints, background details and selection of components , 1988 .

[14]  Naif Alajlan,et al.  Active learning for spectroscopic data regression , 2012 .

[15]  Edwin Lughofer,et al.  NIR-based quantification of process parameters in polyetheracrylat (PEA) production using flexible non-linear fuzzy systems , 2011 .

[16]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[17]  Xueguang Shao,et al.  Representative subset selection in modified iterative predictor weighting (mIPW) — PLS models for parsimonious multivariate calibration , 2007 .

[18]  Bahram Hemmateenejad,et al.  QSPR models for half-wave reduction potential of steroids: a comparative study between feature selection and feature extraction from subsets of or entire set of descriptors. , 2009, Analytica chimica acta.

[19]  John H. Kalivas,et al.  Generalized simulated annealing for calibration sample selection from an existing set and orthogonalization of undesigned experiments , 1991 .

[20]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[21]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[22]  P. Hintenaus,et al.  An Infrared Spectrometer for Process Monitoring I, Spectroscopy , 2007, IECON 2007 - 33rd Annual Conference of the IEEE Industrial Electronics Society.

[23]  Edwin Lughofer,et al.  Hybrid adaptive calibration methods and ensemble strategy for prediction of cloud point in melamine resin production , 2013 .

[24]  H. B. Mann,et al.  On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .

[25]  K. Bellgardt,et al.  On-line fault detection of flow-injection analysis systems based on recursive parameter estimation , 1995 .

[26]  Klaus Nordhausen,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman , 2009 .

[27]  Peter Filzmoser,et al.  Introduction to Multivariate Statistical Analysis in Chemometrics , 2009 .

[28]  L. Brás,et al.  A bootstrap‐based strategy for spectral interval selection in PLS regression , 2008 .

[29]  H. Hyötyniemi,et al.  Recursive multimodel partial least squares estimation of mineral flotation slurry contents using optical reflectance spectra. , 2009, Analytica chimica acta.

[30]  J. Miller,et al.  Statistics and chemometrics for analytical chemistry , 2005 .

[31]  Wei Hu,et al.  Unsupervised Active Learning Based on Hierarchical Graph-Theoretic Clustering , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[32]  C. Ruckebusch,et al.  Genetic algorithm optimisation combined with partial least squares regression and mutual information variable selection procedures in near-infrared quantitative analysis of cotton-viscose textiles. , 2007, Analytica chimica acta.

[33]  Edwin Lughofer,et al.  On-line elimination of local redundancies in evolving fuzzy systems , 2011, Evol. Syst..

[34]  Bao Lin,et al.  Observer and Data-Driven-Model-Based Fault Detection in Power Plant Coal Mills , 2008, IEEE Transactions on Energy Conversion.

[35]  Haitao Zhao,et al.  A novel incremental principal component analysis and its application for face recognition , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[36]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[37]  Edwin Lughofer,et al.  FLEXFIS: A Robust Incremental Learning Approach for Evolving Takagi–Sugeno Fuzzy Models , 2008, IEEE Transactions on Fuzzy Systems.

[38]  Sundaram Suresh,et al.  A Meta-cognitive Interval Type-2 fuzzy inference system classifier and its projection based learning algorithm , 2013, 2013 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS).

[39]  Matthias Otto,et al.  Chemometrics: Statistics and Computer Application in Analytical Chemistry , 1999 .

[40]  Edwin Lughofer,et al.  Evolving chemometric models for predicting dynamic process parameters in viscose production. , 2012, Analytica chimica acta.

[41]  Jesús S. Aguilar-Ruiz,et al.  Knowledge discovery from data streams , 2009, Intell. Data Anal..

[42]  Edwin Lughofer,et al.  Single-pass active learning with conflict and ignorance , 2012, Evolving Systems.

[43]  Edwin Lughofer,et al.  Flexible Evolving Fuzzy Inference Systems from Data Streams (FLEXFIS , 2012 .

[44]  T. Næs,et al.  Locally weighted regression and scatter correction for near-infrared reflectance data , 1990 .

[45]  Stephen R. Delwiche,et al.  SAS® Partial Least Squares Regression for Analysis of Spectroscopic Data , 2003 .

[46]  Celio Pasquini,et al.  A strategy for selecting calibration samples for multivariate modelling , 2004 .