Classification of oolong tea varieties based on hyperspectral imaging technology and BOSS‐LightGBM model

A fast and nondestructive detection method based on hyperspectral imaging technology (HSI) was investigated in this study to discriminate different oolong tea varieties. Five varieties of oolong tea were taken as the research object. Multiplicative scatter correction was used to reduce the influence of noise in the raw spectra. Then competitive adaptive reweighted sampling and bootstrapping soft shrinkage (BOSS) were applied, respectively, to select characteristic wavelengths. Extreme gradient boosting (XGBoost) and light gradient boosting machine (LightGBM) were individually utilized to establish classification models. Finally, the BOSS‐LightGBM model for discriminating tea varieties achieved the best performance, with the accuracy of 100% in the training set and 97.33% in the prediction set. Therefore, it is feasible to use HSI technology coupled with the BOSS‐LightGBM model for the classification of oolong tea varieties. PRACTICAL APPLICATIONS: Tieguanyin tea is a high value commodity in the tea market. Replacing Tieguanyin tea with cheaper oolong tea varieties is a common way utilized by illegal traders to maximize profit. Traditional methods for identifying tea varieties are time‐consuming and destructive, and are thus unable to meet the requirements of modern agriculture. In this study, hyperspectral imaging technology (HSI) was applied to realize the fast and nondestructive detection of tea varieties. The final results show that using HSI technology to discriminate different oolong tea varieties is feasible, and also provide a theoretical basis for the design of a portable tea variety detection device.

[1]  Jiangbing Li,et al.  Oolong tea made from tea plants from different locations in Yunnan and Fujian, China showed similar aroma but different taste characteristics , 2016, SpringerPlus.

[2]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[3]  Z. Pan,et al.  Discrimination of oolong tea (Camellia sinensis) varieties based on feature extraction and selection from aromatic profiles analysed by HS-SPME/GC-MS. , 2013, Food chemistry.

[4]  Jifeng Shen,et al.  Nondestructive identification of green tea varieties based on hyperspectral imaging technology , 2018 .

[5]  Shigeki Nakauchi,et al.  Noninvasive sensing of thermal treatments of Japanese seafood products using imaging spectroscopy , 2015 .

[6]  Dong-Sheng Cao,et al.  A bootstrapping soft shrinkage approach for variable selection in chemical modeling. , 2016, Analytica chimica acta.

[7]  Jen-kun Lin,et al.  Fermentation process enhanced production and bioactivities of oolong tea polysaccharides , 2012 .

[8]  Hongdong Li,et al.  Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. , 2009, Analytica chimica acta.

[9]  Yong He,et al.  [Identification of green tea brand based on hyperspectra imaging technology]. , 2014, Guang pu xue yu guang pu fen xi = Guang pu.

[10]  Jun Sun,et al.  Visualizing distribution of moisture content in tea leaves using optimization algorithms and NIR hyperspectral imaging , 2019, Comput. Electron. Agric..

[11]  J. Tzen,et al.  Effects of baking and aging on the changes of phenolic and volatile compounds in the preparation of old Tieguanyin oolong teas , 2013 .

[12]  Roger W. Johnson,et al.  An Introduction to the Bootstrap , 2001 .

[13]  Quansheng Chen,et al.  Recent developments of green analytical techniques in analysis of tea's quality and nutrition , 2015 .

[14]  Dong-Sheng Cao,et al.  Model-population analysis and its applications in chemical and biological modeling , 2012 .

[15]  Jianan Huang,et al.  Comparison of catechins and volatile compounds among different types of tea using high performance liquid chromatograph and gas chromatograph mass spectrometer , 2011 .

[16]  Jiewen Zhao,et al.  Nondestructive quantifying total volatile basic nitrogen (TVB-N) content in chicken using hyperspectral imaging (HSI) technique combined with different data dimension reduction algorithms. , 2016, Food chemistry.