Rapid determination of Panax notoginseng origin by terahertz spectroscopy combined with the machine learning method

Abstract Panax notoginseng is a valuable herb with geographical indication, and the quality and price of P. notoginseng from different origins are very different. Therefore, this paper proposes a rapid and accurate method for identifying the origins of P. notoginseng by collecting the roots of P. notoginseng. This paper improves the whale optimization algorithm in terms of global convergence and convergence speed, introduces the Levy flight strategy and reconstructed whale synergy factor A, and applies it to the parameter optimization of support vector machines, to obtain a high-performance classification model. The improved whale optimization algorithm model identifies the origin of P. notoginseng by discriminating their terahertz spectra. Compared with the commonly used genetic algorithm and the original whale optimization algorithm, improvement in the whale optimization algorithm was able to avoid falling into local optimum solutions more effectively while having a high convergence rate. Accordingly, the improved whale optimization algorithm optimized support vector machine model obtained an overall accuracy of 98.44%, which was significantly higher than the 95.31% overall accuracy of the genetic algorithm optimized support vector machine model and the 96.88% overall accuracy of the whale optimization algorithm optimized support vector machine model. It was concluded that terahertz spectroscopy together with machine learning would be a promising technique for identifying the origins of P. notoginseng.

[1]  Jia-ling Wang,et al.  Identification of Panax Notoginseng Powder in Different Parts Based on the Electronic Nose and Time-Domain Feature Extraction , 2022, Journal of the Electrochemical Society.

[2]  Lei Shi,et al.  Research on Classification Model of Panax notoginseng Taproots Based on Machine Vision Feature Fusion , 2021, Sensors.

[3]  Guihua Jiang,et al.  Discrimination of authenticity of Fritillariae Cirrhosae Bulbus based on terahertz spectroscopy and chemometric analysis , 2021 .

[4]  Wei Wang,et al.  Application of terahertz dielectric constant spectroscopy for discrimination of oxidized coal and unoxidized coal by machine learning algorithms , 2021, Fuel.

[5]  Huo Zhang,et al.  Classification of Ginseng with different growth ages based on terahertz spectroscopy and machine learning algorithm , 2021 .

[6]  Zhihua Zhou,et al.  High-level sustainable production of the characteristic protopanaxatriol-type saponins from Panax species in engineered Saccharomyces cerevisiae. , 2021, Metabolic engineering.

[7]  Fang Wang,et al.  Identification of geographical origins of Panax notoginseng based on HPLC multi-wavelength fusion profiling combined with average linear quantitative fingerprint method , 2021, Scientific Reports.

[8]  Zexuan Zhu,et al.  Quantitative analysis of bisphenol analogue mixtures by terahertz spectroscopy using machine learning method. , 2021, Food chemistry.

[9]  Shiping Zhu,et al.  Terahertz spectroscopy combined with data dimensionality reduction algorithms for quantitative analysis of protein content in soybeans. , 2021, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[10]  Kexin Xu,et al.  Discrimination of corn variety using Terahertz spectroscopy combined with chemometrics methods. , 2021, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[11]  Shiping Zhu,et al.  Application of terahertz spectrum and interval partial least squares method in the identification of genetically modified soybeans. , 2020, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[12]  Yu Zhou,et al.  Origin identification of Panax notoginseng by multi-sensor information fusion strategy of infrared spectra combined with random forest. , 2020, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[13]  I. Mercuriani,et al.  Molecular Identification of Several Orchid Species Based on OPA10 and OPA18 RAPD Marker , 2019, Journal of Physics: Conference Series.

[14]  Haiying Wang,et al.  Combining DNA Barcoding and HPLC Fingerprints to Trace Species of an Important Traditional Chinese Medicine Fritillariae Bulbus , 2019, Molecules.

[15]  Z. Cantillo-Ciau,et al.  HPLC profile and simultaneous quantitative analysis of tingenone and pristimerin in four Celastraceae species using HPLC-UV-DAD-MS , 2019, Revista Brasileira de Farmacognosia.

[16]  Vijander Singh,et al.  A novel nature-inspired algorithm for optimization: Squirrel search algorithm , 2019, Swarm Evol. Comput..

[17]  Jie Liang,et al.  Reliable origin identification of Scutellaria baicalensis based on terahertz time-domain spectroscopy and pattern recognition , 2018, Optik.

[18]  Fangrong Hu,et al.  Sensitive distinction between herbs by terahertz spectroscopy and a metamaterial resonator , 2018 .

[19]  Huo Zhang,et al.  Discrimination of traditional herbal medicines based on terahertz spectroscopy , 2017 .

[20]  Wei Liu,et al.  Application of terahertz spectroscopy imaging for discrimination of transgenic rice seeds with chemometrics. , 2016, Food chemistry.

[21]  Andrew Lewis,et al.  The Whale Optimization Algorithm , 2016, Adv. Eng. Softw..

[22]  Fangrong Hu,et al.  Discrimination of genetically modified sugar beets based on terahertz spectroscopy. , 2016, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[23]  Jie Wang,et al.  Protective Effects of Panax Notoginseng Saponins on Cardiovascular Diseases: A Comprehensive Overview of Experimental Studies , 2014, Evidence-based complementary and alternative medicine : eCAM.

[24]  H. Paik,et al.  Morphological Characterization, Chemical Components, and Biofunctional Activities of Panax ginseng, Panax quinquefolium, and Panax notoginseng Roots: A Comparative Study , 2014 .

[25]  Qin Li,et al.  Morphological and microscopic identification studies of Cordyceps and its counterfeits , 2011 .

[26]  Li-Jun Ni,et al.  Pattern recognition of Chinese flue-cured tobaccos by an improved and simplified K-nearest neighbors classification algorithm on near infrared spectra. , 2009, Analytica chimica acta.

[27]  Hua Li,et al.  Total saponins of Panax notoginseng modulate the expression of caspases and attenuate apoptosis in rats following focal cerebral ischemia-reperfusion. , 2009, Journal of ethnopharmacology.

[28]  T B Ng,et al.  Pharmacological activity of sanchi ginseng (Panax notoginseng) , 2006, The Journal of pharmacy and pharmacology.

[29]  Chong-Zhi Wang,et al.  Phytochemical and analytical studies of Panax notoginseng (Burk.) F.H. Chen , 2006, Journal of Natural Medicines.

[30]  B. Tomlinson,et al.  Protective effects of trilinolein extracted from panax notoginseng against cardiovascular disease. , 2002, Acta pharmacologica Sinica.

[31]  G. Bao,et al.  Modeling the effects of biodiesel chemical composition on iodine value using novel machine learning algorithm , 2022, Fuel.

[32]  T. Ng,et al.  Bioactive Proteins in Panax notoginseng Roots and Other Panax Species. , 2019, Current protein & peptide science.

[33]  Yingjin Yuan,et al.  Antioxidant activities of Salvia miltiorrhiza and Panax notoginseng , 2006 .

[34]  A. V.DavidSánchez,et al.  Advanced support vector machines and kernel methods , 2003, Neurocomputing.