Brand classification of detergent powder using near-infrared spectroscopy and extreme learning machines

Abstract Detergent powder is a necessity in daily life and has a wide range of sales worldwide. To obtain illegal benefits, immoral merchants often sell inferior detergent powder labeled as famous brands. Thus, there is a pressing need for developing convenient methods able to identify different brands of detergent powder. In this study, near-infrared (NIR) spectroscopy is applied for the identification of 6 different brands of detergent powder. Both extreme learning machine (ELM) and its ensemble (EELM) are used for constructing predictive models. A total 180 samples belonged to 6 different brands are prepared for experiment. A simple filter algorithm, i.e., Relief, was applied to select informative features/variables. Principal component analysis (PCA) is used for preliminary exploratory analysis. All samples were divided into subsets with equal size for training, optimizing and testing the predictive model, respectively. The model from the EELM algorithm achieved 100% accuracy on both the training and test sets, which is superior to the model from the ELM algorithm. It indicates that NIR spectroscopy combined with feature compression and the EELM algorithm is a potential tool for identifying the brands of detergent powder. .

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[4]  Jing J. Liang,et al.  Two-hidden-layer extreme learning machine for regression and classification , 2016, Neurocomputing.

[5]  D. Massart,et al.  Near-infrared spectroscopy applications in pharmaceutical analysis. , 2007, Talanta.

[6]  Hui Chen,et al.  Non-destructive identification of native egg by near-infrared spectroscopy and data driven-based class-modeling. , 2019, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[7]  O. S. Toker,et al.  Rapid detection of adulteration of cold pressed sesame oil adultered with hazelnut, canola, and sunflower oils using ATR-FTIR spectroscopy combined with chemometric , 2017 .

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  A. B. Garmarudi,et al.  Simultaneous Determination of Sodium Tripolyphosphate, Sodium Sulfate and Linear Alkylbenzensulfonate in Washing Powder by Attenuated Total Reflectance: Fourier Transform Infrared Spectrometry , 2007 .

[10]  Muchenxuan Tong,et al.  An ensemble of SVM classifiers based on gene pairs , 2013, Comput. Biol. Medicine.

[11]  José Manuel Amigo,et al.  Identification and quantification of turkey meat adulteration in fresh, frozen-thawed and cooked minced beef by FT-NIR spectroscopy and chemometrics. , 2016, Meat science.

[12]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[13]  Xihui Bian,et al.  A boosting extreme learning machine for near-infrared spectral quantitative analysis of diesel fuel and edible blend oil samples , 2017 .

[14]  Mohammadreza Khanmohammadi,et al.  Infrared spectroscopy provides a green analytical chemistry tool for direct diagnosis of cancer , 2011 .

[15]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[16]  Qinmin Yang,et al.  Classification of washing powder brands using near-infrared spectroscopy combined with chemometric calibrations. , 2014, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[17]  Pedro M. Domingos Knowledge Discovery Via Multiple Models , 1998, Intell. Data Anal..

[18]  L. Breiman Population theory for boosting ensembles , 2003 .

[19]  Zou Xiaobo,et al.  Variables selection methods in near-infrared spectroscopy. , 2010, Analytica chimica acta.

[20]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[21]  Hongming Zhou,et al.  Optimization method based extreme learning machine for classification , 2010, Neurocomputing.

[22]  O. G. Meza-Márquez,et al.  Application of mid-infrared spectroscopy with multivariate analysis and soft independent modeling of class analogies (SIMCA) for the detection of adulterants in minced beef. , 2010, Meat science.

[23]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[24]  K. Héberger,et al.  Quantitative determination and classification of energy drinks using near-infrared spectroscopy , 2016, Analytical and Bioanalytical Chemistry.

[25]  William Perrizo,et al.  Comprehensive vertical sample-based KNN/LSVM classification for gene expression analysis , 2004, J. Biomed. Informatics.

[26]  Peter Tiño,et al.  Managing Diversity in Regression Ensembles , 2005, J. Mach. Learn. Res..

[27]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[28]  Comparative artificial neural network and partial least squares models for analysis of Metronidazole, Diloxanide, Spiramycin and Cliquinol in pharmaceutical preparations. , 2014, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[29]  Hui Chen,et al.  Express detection of expired drugs based on near-infrared spectroscopy and chemometrics: A feasibility study. , 2019, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[30]  Hui Chen,et al.  Ensemble of extreme learning machines for multivariate calibration of near-infrared spectroscopy. , 2019, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.