Method for Classifying a Noisy Raman Spectrum Based on a Wavelet Transform and a Deep Neural Network

Because it is relatively difficult in practice to classify the Raman spectrum under baseline noise and additive white Gaussian noise environments, this paper proposes a new framework based on a wavelet transform and deep neural network for identification of noisy Raman spectra. The framework consists of two main engines. Wavelet transform is proposed as the framework front end for transforming the 1-D noise Raman spectrum to two-dimensional data. The two-dimensional data are fed to the framework back end, which is a classifier. The optimum classifier is chosen by implementing several traditional machine learning (ML) and deep learning (DL) algorithms, and we investigate their classification accuracy and robustness performances. The four chosen MLs are naive Bayes (NB), a support vector machine (SVM), a random forest (RF) and a k-nearest neighbor (KNN), and a deep convolution neural network (DCNN) was chosen as a DL classifier. Noise-free, Gaussian noise, baseline noise, and mixed-noise Raman spectra were applied to train and validate the ML and DCNN models. The optimum back-end classifier was obtained by testing the ML and DCNN models with several noisy Raman spectra (10-30 dB noise power). Based on the simulation, the accuracy of the DCNN classifier is 9% higher than that of the NB classifier, 3.5% higher than the RF classifier, 1% higher than the KNN classifier, and 0.5% higher than the SVM classifier. In terms of robustness to mixed noise scenarios, the framework with the DCNN back end showed superior performance compared with the other ML back ends. The DCNN back end achieved 90% accuracy at 3 dB SNR, while the NB, SVM, RF, and K-NN back ends required 27 dB, 22 dB, 27 dB, and 23 dB SNR, respectively. In addition, in the low-noise test dataset, the F-measure score of the DCNN back end exceeded 99.1%, and the F-measure scores of the other ML engines were below 98.7%.

[1]  D. A. Stuart,et al.  Surface-enhanced Raman spectroscopy of half-mustard agent. , 2006, The Analyst.

[2]  P. Vandenabeele,et al.  Reference database of Raman spectra of biological molecules , 2007 .

[3]  Yoshihiro Maruyama,et al.  Feature visualization of Raman spectrum analysis with deep convolutional neural network , 2019, Analytica chimica acta.

[4]  F. Tuinstra,et al.  Raman Spectrum of Graphite , 1970 .

[5]  F Ehrentreich,et al.  Spike removal and denoising of Raman spectra by wavelet transform methods. , 2001, Analytical chemistry.

[6]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[7]  Kun Liu,et al.  A Probabilistic Process Neural Network and Its Application in ECG Classification , 2019, IEEE Access.

[8]  Nataliia Kussul,et al.  Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data , 2017, IEEE Geoscience and Remote Sensing Letters.

[9]  Stephen T. C. Wong,et al.  Combining deep learning and coherent anti-Stokes Raman scattering imaging for automated differential diagnosis of lung cancer , 2017, Journal of biomedical optics.

[10]  Jianyu Long,et al.  A Novel Sparse Echo Autoencoder Network for Data-Driven Fault Diagnosis of Delta 3-D Printers , 2020, IEEE Transactions on Instrumentation and Measurement.

[11]  Jun-Qyu Park,et al.  Fast and sensitive recognition of various explosive compounds using Raman spectroscopy and principal component analysis , 2013, Defense, Security, and Sensing.

[12]  Ling Jian,et al.  A weighted SVM ensemble predictor based on AdaBoost for blast furnace Ironmaking process , 2020, Applied Intelligence.

[13]  Bin Yao,et al.  ECG Arrhythmia Classification Using STFT-Based Spectrogram and Convolutional Neural Network , 2019, IEEE Access.

[14]  Hajar Mousannif,et al.  A real-time crash prediction fusion framework: An imbalance-aware strategy for collision avoidance systems , 2020 .

[15]  Muhammad Ghulam,et al.  Deep Learning for EEG motor imagery classification based on multi-layer CNNs feature fusion , 2019, Future Gener. Comput. Syst..

[16]  Qing-Jiang Chen,et al.  Numerical solution of differential equations by using Haar wavelets , 2007, 2007 International Conference on Wavelet Analysis and Pattern Recognition.

[17]  Ricardo Mendoza-González,et al.  Improving Sensitivity and Specificity in Breast Cancer Detection Using Raman Spectroscopy and Bayesian Classification , 2015 .

[18]  Wei Zheng,et al.  Classification of colonic tissues using near-infrared Raman spectroscopy and support vector machines. , 2008, International journal of oncology.

[19]  K. Castro,et al.  On-line FT-Raman and dispersive Raman spectra database of artists’ materials (e-VISART database) , 2005, Analytical and bioanalytical chemistry.

[20]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[21]  Abien Fred Agarap Deep Learning using Rectified Linear Units (ReLU) , 2018, ArXiv.

[22]  Xiangquan Zheng,et al.  A practical convolutional neural network model for discriminating Raman spectra of human and animal blood , 2019, Journal of Chemometrics.

[23]  Vineet Kansal,et al.  A benchmarking framework using nonlinear manifold detection techniques for software defect prediction , 2020, Int. J. Comput. Sci. Eng..

[24]  Jianyu Long,et al.  Attitude data-based deep hybrid learning architecture for intelligent fault diagnosis of multi-joint industrial robots , 2020 .

[25]  Yi-Zeng Liang,et al.  Baseline correction using adaptive iteratively reweighted penalized least squares. , 2010, The Analyst.

[26]  Haoyan Guo,et al.  The Application of Mexican Hat Wavelet Filtering and Averaging Algorithm in Raman Spectra Denoising , 2008, 2008 Congress on Image and Signal Processing.

[27]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  H. Edwards,et al.  Raman spectra of carotenoids in natural products. , 2003, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[29]  Yoshua Bengio,et al.  Why Does Unsupervised Pre-training Help Deep Learning? , 2010, AISTATS.

[30]  W. Mansor,et al.  Reduced Featured k-NN Classifier Model Optimal for Classification of Dengue Fever from Salivary Raman Spectra , 2019, 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[31]  O. Nielsen,et al.  Near‐infrared Fourier transform Raman, surface‐enhanced Raman scattering and Fourier transform infrared spectra and ab initio calculations of the natural product nodakenetin angelate , 2005 .

[32]  K. Poulose Jacob,et al.  COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH RECOGNITION , 2013 .

[33]  R. Bonner,et al.  Application of wavelet transforms to experimental spectra : Smoothing, denoising, and data set compression , 1997 .

[34]  R. Sharma,et al.  Signal processing of Raman signatures and realtime identification of hazardous molecules using continuous wavelet transformation (CWT) , 2015, 2015 International Conference on Signal Processing and Communication Engineering Systems.

[35]  Wei Zhang,et al.  Baseline correction for Raman spectra using an improved asymmetric least squares method , 2014 .

[36]  Miroslav Kvassay,et al.  Application of Fuzzy Decision Tree for Signal Classification , 2019, IEEE Transactions on Industrial Informatics.

[37]  Aaron Park,et al.  Baseline correction using asymmetrically reweighted penalized least squares smoothing. , 2015, The Analyst.

[38]  Rodrigo Stephani,et al.  Fourier-transform Raman analysis of milk powder: a potential method for rapid quality screening , 2011 .

[39]  Feng Xu,et al.  E-commerce product review sentiment classification based on a naïve Bayes continuous learning framework , 2020, Inf. Process. Manag..

[40]  Margarita Osadchy,et al.  Deep Convolutional Neural Networks for Raman Spectrum Recognition: A Unified Solution , 2017, The Analyst.

[41]  Mwaffaq Otoom,et al.  An IoT-based framework for early identification and monitoring of COVID-19 cases , 2020, Biomedical Signal Processing and Control.

[42]  Stephan Seifert,et al.  Application of random forest based approaches to surface-enhanced Raman scattering data , 2020, Scientific Reports.

[43]  Ajay Rana,et al.  A Nonlinear Manifold Detection based Model for Software Defect Prediction , 2018 .