Near infrared reflectance spectrometry classification of cigarettes using the successive projections algorithm for variable selection.

This paper proposes a methodology for cigarette classification employing Near Infrared Reflectance spectrometry and variable selection. For this purpose, the Successive Projections Algorithm (SPA) is employed to choose an appropriate subset of wavenumbers for a Linear Discriminant Analysis (LDA) model. The proposed methodology is applied to a set of 210 cigarettes of four different brands. For comparison, Soft Independent Modelling of Class Analogy (SIMCA) is also employed for full-spectrum classification. The resulting SPA-LDA model successfully classified all test samples with respect to their brands using only two wavenumbers (5058 and 4903 cm(-1)). In contrast, the SIMCA models were not able to achieve 100% of classification accuracy, regardless of the significance level adopted for the F-test. The results obtained in this investigation suggest that the proposed methodology is a promising alternative for assessment of cigarette authenticity.

[1]  G. Downey,et al.  Detecting and quantifying sunflower oil adulteration in extra virgin olive oils from the eastern mediterranean by visible and near-infrared spectroscopy. , 2002, Journal of agricultural and food chemistry.

[2]  T. B. Murphy,et al.  A comparison of model-based and regression classification techniques applied to near infrared spectroscopic data in food authentication studies , 2007 .

[3]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[4]  D. Paschal,et al.  Cadmium, lead, and thallium in mainstream tobacco smoke particulate. , 2006, Food and chemical toxicology : an international journal published for the British Industrial Biological Research Association.

[5]  Xin Qin,et al.  Study of the feasibility of distinguishing cigarettes of different brands using an Adaboost algorithm and near-infrared spectroscopy , 2007, Analytical and bioanalytical chemistry.

[6]  Dong Wang,et al.  Successive projections algorithm combined with uninformative variable elimination for spectral variable selection , 2008 .

[7]  M. C. U. Araújo,et al.  The successive projections algorithm for variable selection in spectroscopic multicomponent analysis , 2001 .

[8]  Celio Pasquini,et al.  Assessment of infrared spectroscopy and multivariate techniques for monitoring the service condition of diesel-engine lubricating oils. , 2006, Talanta.

[9]  Panmanas Sirisomboon,et al.  Study on non-destructive evaluation methods for defect pods for green soybean processing by near-infrared spectroscopy. , 2009 .

[10]  Roberto Kawakami Harrop Galvão,et al.  A variable elimination method to improve the parsimony of MLR models using the successive projections algorithm , 2008 .

[11]  T. Streibel,et al.  Discrimination of three tobacco types (Burley, Virginia and Oriental) by pyrolysis single-photon ionisation–time-of-flight mass spectrometry and advanced statistical methods , 2005, Analytical and bioanalytical chemistry.

[12]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[13]  D. Massart,et al.  Near-infrared spectroscopy applications in pharmaceutical analysis. , 2007, Talanta.

[14]  M. C. U. Araújo,et al.  Classification of edible vegetable oils using square wave voltammetry with multivariate data analysis. , 2009, Talanta.

[15]  Chonghun Han,et al.  Real-time classification of petroleum products using near-infrared spectra , 2000 .

[16]  Yizeng Liang,et al.  Comparative analysis of the volatile components in cut tobacco from different locations with gas chromatography-mass spectrometry (GC-MS) and combined chemometric methods. , 2006, Analytica chimica acta.

[17]  Venkata Radhakrishna Kondepati,et al.  Application of near-infrared spectroscopy for the diagnosis of colorectal cancer in resected human tissue specimens , 2007 .

[18]  Randall D. Tobias,et al.  Chemometrics: A Practical Guide , 1998, Technometrics.

[19]  S. Furlanetto,et al.  Identification and determination of mainstream and sidestream smoke components in different brands and types of cigarettes by means of solid-phase microextraction-gas chromatography-mass spectrometry. , 2008, Journal of chromatography. A.

[20]  Roman M. Balabin,et al.  Gasoline classification by source and type based on near infrared (NIR) spectroscopy data , 2008 .

[21]  E. Smidt,et al.  Classification of waste materials using Fourier transform infrared spectroscopy and soft independent modeling of class analogy. , 2008, Waste management.

[22]  L. A. Stone,et al.  Computer Aided Design of Experiments , 1969 .

[23]  D. Steinberg,et al.  Technometrics , 2008 .

[24]  D. Coomans,et al.  Recent developments in discriminant analysis on high dimensional spectral data , 1996 .

[25]  Márcio José Coelho Pontes,et al.  Classification of distilled alcoholic beverages and verification of adulteration by near infrared spectrometry , 2006 .

[26]  C. Pasquini Near Infrared Spectroscopy: fundamentals, practical aspects and analytical applications , 2003 .

[27]  Desire L. Massart,et al.  Comparison of regularized discriminant analysis linear discriminant analysis and quadratic discriminant analysis applied to NIR data , 1996 .

[28]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[29]  David G. Stork,et al.  Pattern Classification , 1973 .

[30]  Celio Pasquini,et al.  Classification of Brazilian soils by using LIBS and variable selection in the wavelet domain. , 2009, Analytica chimica acta.

[32]  Beata Walczak,et al.  Comprehensive Chemometrics: Set: Chemical and Biochemical Data Analysis , 2009 .

[33]  M. Wright Real Time Imaging , 2005 .

[34]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[35]  Y. Ling,et al.  Determination of Δ9-tetrahydrocannabinol in indoor air as an indicator of marijuana cigarette smoking using adsorbent sampling and in-injector thermal desorption gas chromatography–mass spectrometry , 2007 .

[36]  K. Héberger,et al.  Supervised pattern recognition in food analysis. , 2007, Journal of chromatography. A.

[37]  Mariko Ishiwatari,et al.  J. Anal. Appl. Pyrolysis , 1995 .

[38]  Roberto Kawakami Harrop Galvão,et al.  The successive projections algorithm for spectral variable selection in classification problems , 2005 .

[39]  J. Nóbrega,et al.  Multivariate classification of cigarettes according to their elemental content determined by inductively coupled plasma optical emission spectrometry. , 2007, Analytical sciences : the international journal of the Japan Society for Analytical Chemistry.