Improvement of near infrared spectroscopic (NIRS) analysis of caffeine in roasted Arabica coffee by variable selection method of stability competitive adaptive reweighted sampling (SCARS).

Coffee is the most heavily consumed beverage in the world after water, for which quality is a key consideration in commercial trade. Therefore, caffeine content which has a significant effect on the final quality of the coffee products requires to be determined fast and reliably by new analytical techniques. The main purpose of this work was to establish a powerful and practical analytical method based on near infrared spectroscopy (NIRS) and chemometrics for quantitative determination of caffeine content in roasted Arabica coffees. Ground coffee samples within a wide range of roasted levels were analyzed by NIR, meanwhile, in which the caffeine contents were quantitative determined by the most commonly used HPLC-UV method as the reference values. Then calibration models based on chemometric analyses of the NIR spectral data and reference concentrations of coffee samples were developed. Partial least squares (PLS) regression was used to construct the models. Furthermore, diverse spectra pretreatment and variable selection techniques were applied in order to obtain robust and reliable reduced-spectrum regression models. Comparing the respective quality of the different models constructed, the application of second derivative pretreatment and stability competitive adaptive reweighted sampling (SCARS) variable selection provided a notably improved regression model, with root mean square error of cross validation (RMSECV) of 0.375 mg/g and correlation coefficient (R) of 0.918 at PLS factor of 7. An independent test set was used to assess the model, with the root mean square error of prediction (RMSEP) of 0.378 mg/g, mean relative error of 1.976% and mean relative standard deviation (RSD) of 1.707%. Thus, the results provided by the high-quality calibration model revealed the feasibility of NIR spectroscopy for at-line application to predict the caffeine content of unknown roasted coffee samples, thanks to the short analysis time of a few seconds and non-destructive advantages of NIRS.

[1]  W. Cai,et al.  A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra , 2008 .

[2]  C. Pizarro,et al.  Prediction of Roasting Colour and other Quality Parameters of Roasted Coffee Samples by near Infrared Spectroscopy. A Feasibility Study , 2004 .

[3]  Israel Schechter,et al.  Wavelength Selection for Simultaneous Spectroscopic Analysis. Experimental and Theoretical Study , 1996 .

[4]  S. Engelsen,et al.  Interval Partial Least-Squares Regression (iPLS): A Comparative Chemometric Study with an Example from Near-Infrared Spectroscopy , 2000 .

[5]  C. Spiegelman,et al.  Theoretical Justification of Wavelength Selection in PLS Calibration:  Development of a New Algorithm. , 1998, Analytical Chemistry.

[6]  S. Lanteri,et al.  Selection of useful predictors in multivariate calibration , 2004, Analytical and bioanalytical chemistry.

[7]  Ronald R. Coifman,et al.  The prediction error in CLS and PLS: the importance of feature selection prior to multivariate calibration , 2005 .

[8]  J. Lane,et al.  Effects of hot tea, coffee and water ingestion on physiological responses and mood: the role of caffeine, water and beverage type , 1997, Psychopharmacology.

[9]  M. C. U. Araújo,et al.  The successive projections algorithm for variable selection in spectroscopic multicomponent analysis , 2001 .

[10]  Santina Romani,et al.  Near infrared spectroscopy: an analytical tool to predict coffee roasting degree. , 2008, Analytica chimica acta.

[11]  P. D. Tzanavaras,et al.  Development and validation of a high-throughput high-performance liquid chromatographic assay for the determination of caffeine in food samples using a monolithic column. , 2007, Analytica chimica acta.

[12]  Authentication of Whole and Ground Coffee Beans by near Infrared Reflectance Spectroscopy , 1994 .

[13]  A. García-Lafuente,et al.  Fast and simultaneous determination of phenolic compounds and caffeine in teas, mate, instant coffee, soft drink and energetic drink by high-performance liquid chromatography using a fused-core column. , 2011, Analytica chimica acta.

[14]  Hongdong Li,et al.  Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. , 2009, Analytica chimica acta.

[15]  G. Alpdoğan,et al.  Derivative Spectrophotometric Determination of Caffeine in Some Beverages , 2002 .

[16]  Consuelo Pizarro,et al.  Influence of data pre-processing on the quantitative determination of the ash content and lipids in roasted coffee by near infrared spectroscopy , 2004 .

[17]  Kaiyi Zheng,et al.  Stability competitive adaptive reweighted sampling (SCARS) and its applications to multivariate calibration of NIR spectra , 2012 .

[18]  R. Teófilo,et al.  Sorting variables by using informative vectors as a strategy for feature selection in multivariate regression , 2009 .

[19]  Yvan Vander Heyden,et al.  Improved variable reduction in partial least squares modelling based on predictive-property-ranked variables and adaptation of partial least squares complexity. , 2011, Analytica chimica acta.

[20]  B. Strukelj,et al.  Determination of caffeine and associated compounds in food, beverages, natural products, pharmaceuticals, and cosmetics by micellar electrokinetic capillary chromatography. , 2008, Journal of chromatographic science.

[21]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[22]  C. Pizarro,et al.  Coffee varietal differentiation based on near infrared spectroscopy. , 2007, Talanta.

[23]  Consuelo Pizarro,et al.  Prediction of sensory properties of espresso from roasted coffee samples by near-infrared spectroscopy , 2004 .

[24]  Christian W. Huck,et al.  Analysis of caffeine, theobromine and theophylline in coffee by near infrared spectroscopy (NIRS) compared to high-performance liquid chromatography (HPLC) coupled to mass spectrometry , 2005 .

[25]  M. Gallignani,et al.  Determination of theobromine, theophylline and caffeine in cocoa samples by a high-performance liquid chromatographic method with on-line sample cleanup in a switching-column system , 2007 .

[26]  Xueguang Shao,et al.  Application of latent projective graph in variable selection for near infrared spectral analysis , 2012 .

[27]  M A Arnold,et al.  Genetic algorithm-based method for selecting wavelengths and model size for use with partial least-squares regression: application to near-infrared spectroscopy. , 1996, Analytical chemistry.

[28]  Authentication of Coffee Bean Variety by Near-infrared Reflectance Spectroscopy of Dried Extract , 1996 .

[29]  M. Forina,et al.  Use of near-infrared spectroscopy and feature selection techniques for predicting the caffeine content and roasting color in roasted coffees. , 2007, Journal of agricultural and food chemistry.

[30]  E. K. Kemsley,et al.  Near- and Mid-Infrared Spectroscopies in Food Authentication: Coffee Varietal Identification , 1997 .

[31]  Xueguang Shao,et al.  A wavelength selection method based on randomization test for near-infrared spectral analysis , 2009 .

[32]  S. Garrigues,et al.  Solid-phase FT-Raman determination of caffeine in energy drinks , 2005 .

[33]  C. Máguas,et al.  Application of solid-phase extraction to brewed coffee caffeine and organic acid determination by UV/HPLC , 2007 .

[34]  E. Abourashed,et al.  HPTLC determination of caffeine in stimulant herbal products and power drinks. , 2004, Journal of pharmaceutical and biomedical analysis.

[35]  Xueguang Shao,et al.  Multivariate calibration of near-infrared spectra by using influential variables , 2012 .

[36]  C. Pizarro,et al.  Mixture resolution according to the percentage of robusta variety in order to detect adulteration in roasted coffee by near infrared spectroscopy. , 2007, Analytica chimica acta.

[37]  J. Dórea,et al.  Is coffee a functional food? , 2005, British Journal of Nutrition.

[38]  Takayuki Shibamoto,et al.  Chlorogenic acid and caffeine contents in various commercial brewed coffees , 2008 .

[39]  D. Massart,et al.  Elimination of uninformative variables for multivariate calibration. , 1996, Analytical chemistry.

[40]  Draženka Komes,et al.  Comparative study of polyphenols and caffeine in different coffee varieties affected by the degree of roasting. , 2011, Food chemistry.

[41]  T. Næs,et al.  The Effect of Multiplicative Scatter Correction (MSC) and Linearity Improvement in NIR Spectroscopy , 1988 .

[42]  J. Irudayaraj,et al.  Rapid determination of caffeine content in soft drinks using FTIR-ATR spectroscopy , 2002 .

[43]  J. M. González-Sáiz,et al.  An evaluation of orthogonal signal correction methods for the characterisation of arabica and robusta coffee varieties by NIRS , 2004 .

[44]  G. Morlock,et al.  Simultaneous determination of riboflavin, pyridoxine, nicotinamide, caffeine and taurine in energy drinks by planar chromatography-multiple detection with confirmation by electrospray ionization mass spectrometry. , 2006, Journal of chromatography. A.