A case study of extrapolation in NIR modelling — A chemometric challenge at ‘Chimiométrie 2009’

Abstract Since 2004, a challenge is proposed at the “Chimiometrie” conference organized by the GFC (“Groupe Francais de Chimiometrie”). The annual congress was held in Paris 30 November and 1 December 2009. The data are still available on the GFC website ( www.chimiometrie.org/ ). The data for this session were extracted from the CRA-W (Walloon Agricultural Research Centre, www.cra.wallonie.be/ ) spectral data base. The calibration set and the validation set (the reference values for the latter were not available to the participants) were set up to have an obvious case of extrapolation. The data sets were from whole rapeseed samples and the parameter to be calibrated was the total glucosinolate content. Seven participants reported results and, according to the highest R 2 of prediction, three were asked to present their approach during the conference. These approaches and the ones of the challenge organizers are presented in this paper.

[1]  L. A. Currie,et al.  Nomenclature in evaluation of analytical methods including detection and quantification capabilities (IUPAC Recommendations 1995) , 1995 .

[2]  N. M. Faber,et al.  Uncertainty estimation and figures of merit for multivariate calibration (IUPAC Technical Report) , 2006 .

[3]  J. Riu,et al.  Assessing the accuracy of analytical methods using linear regression with errors in both axes. , 1996, Analytical chemistry.

[4]  Evelyne Vigneau,et al.  Clustering of variables to analyze spectral data , 2005 .

[5]  J. Pierna,et al.  Soil parameter quantification by NIRS as a Chemometric challenge at ‘Chimiométrie 2006’ , 2008 .

[6]  Jean-Michel Roger,et al.  How to build a robust model against perturbation factors with only a few reference values: A chemometric challenge at 'Chimiométrie 2007' , 2011 .

[7]  K. Danzer,et al.  Selectivity in analytical chemistry (IUPAC Recommendations 2001) , 2001 .

[8]  K. Danzer,et al.  SELECTIVITY IN ANALYTICAL CHEMISTRY , 2001 .

[9]  A. Lorber Error propagation and figures of merit for quantification by solving matrix equations , 1986 .

[10]  Pierre Dardenne,et al.  Chemometric contest at ‘Chimiométrie 2005’: A discrimination study , 2007 .

[11]  Ronald D. Snee,et al.  Validation of Regression Models: Methods and Examples , 1977 .

[12]  E. Vigneau,et al.  Clustering of Variables Around Latent Components , 2003 .

[13]  Ronei J. Poppi,et al.  Validation of multivariate calibration models in the determination of sugar cane quality parameters by near infrared spectroscopy , 2007 .

[14]  R. D. Jee,et al.  Meeting the International Conference on Harmonisation's Guidelines on Validation of Analytical Procedures: quantification as exemplified by a near-infrared reflectance assay of paracetamol in intact tablets. , 2000, The Analyst.

[15]  Ulf G. Indahl,et al.  A twist to partial least squares regression , 2005 .

[16]  Pierre Dardenne,et al.  A NIR data set is the object of a chemometric contest at ‘Chimiométrie 2004’ , 2006 .

[17]  P. Dardenne,et al.  Analysis of Quality Parameters of Whole Rapeseed by N.I.R.S. , 1987 .

[18]  P. Dardenne,et al.  Fast analysis of rapeseed glucosinolates by near infrared reflectance spectroscopy , 1988 .

[19]  S. Engelsen,et al.  Interval Partial Least-Squares Regression (iPLS): A Comparative Chemometric Study with an Example from Near-Infrared Spectroscopy , 2000 .

[20]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[21]  Patrícia Valderrama,et al.  Variable selection, outlier detection, and figures of merit estimation in a partial least-squares regression multivariate calibration model. A case study for the determination of quality parameters in the alcohol industry by near-infrared spectroscopy. , 2007, Journal of agricultural and food chemistry.