Neural Network-Based Active Learning in Multivariate Calibration

In chemometrics, data from infrared or near-infrared (NIR) spectroscopy are often used to identify a compound or to analyze the composition of a material. This involves the calibration of models that predict the concentration of material constituents from the measured NIR spectrum. An interesting aspect of multivariate calibration is to achieve a particular accuracy level with a minimum number of training samples, as this reduces the number of laboratory tests and thus the cost of model building. In these chemometric models, the input refers to a proper representation of the spectra and the output to the concentrations of the sample constituents. The search for a most informative new calibration sample thus has to be performed in the output space of the model, rather than in the input space as in conventional modeling problems. In this paper, we propose to solve the corresponding inversion problem by utilizing the disagreements of an ensemble of neural networks to represent the prediction error in the unexplored component space. The next calibration sample is then chosen at a composition where the individual models of the ensemble disagree most. The results obtained for a realistic chemometric calibration example show that the proposed active learning can achieve a given calibration accuracy with less training samples than random sampling.

[1]  Abhisek Ukil,et al.  Improved Calibration of Near-Infrared Spectra by Using Ensembles of Neural Network Models , 2010, IEEE Sensors Journal.

[2]  Jayanta Basak,et al.  Active Evaluation and Ranking of Multiple-Attribute Items Using Feedforward Neural Networks , 2006, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[3]  Abhisek Ukil,et al.  Intelligent systems and signal processing in power engineering , 2007 .

[4]  K. Funatsu,et al.  Prediction of Polyethylene Density by Near-Infrared Spectroscopy Combined with Neural Network Analysis , 2003 .

[5]  Roman M. Balabin,et al.  Comparison of linear and nonlinear calibration models based on near infrared (NIR) spectroscopy data for gasoline properties prediction , 2007 .

[6]  David E. Booth,et al.  Chemometrics: Data Analysis for the Laboratory and Chemical Plant , 2004, Technometrics.

[7]  Michel Verleysen,et al.  Chemometric calibration of infrared spectrometers: selection and validation of variables by non-linear models , 2004 .

[8]  Xiaodong Lin,et al.  Active Learning From Stream Data Using Optimal Weight Classifier Ensemble , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  Andreas Zell,et al.  Different criteria for active learning in neural networks: a comparative study , 2002, ESANN.

[10]  Anders Krogh,et al.  Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[11]  David J. C. MacKay,et al.  Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[12]  Roman M. Balabin,et al.  Universal technique for optimization of neural network training parameters: gasoline near infrared data example , 2009, Neural Computing and Applications.

[13]  R. Carlson,et al.  Design of Experiments, Principles and Applications, L. Eriksson, E. Johansson, N. Kettaneh‐ Wold, C. Wikström and S. Wold, Umetrics AB, Umeå Learnways AB, Stockholm, 2000, ISBN 91‐973730‐0‐1, xii + 329 pp. , 2001 .

[14]  Marcelo Blanco,et al.  NIR spectroscopy: a rapid-response analytical tool , 2002 .

[15]  Tormod Næs,et al.  Artificial Neural Networks in Multivariate Calibration , 1993 .

[16]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[17]  田原 Matlab语言的Neural Network Toolbox及其在同步中的应用 , 2008 .

[18]  Tetsuo Sato Application of an Artificial Neural Network to the Identification of Amino Acids from near Infrared Spectral Data , 1993 .

[19]  F Despagne,et al.  Neural networks in multivariate calibration. , 1998, The Analyst.

[20]  Wei Hu,et al.  Unsupervised Active Learning Based on Hierarchical Graph-Theoretic Clustering , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).