End‐to‐end analysis modeling of vibrational spectroscopy based on deep learning approach

The characteristics of the spectral data are essential for the qualitative analysis of substances. Traditional classification models often need to preprocess the data. However, misuse of preprocessing may change the characteristic information carried by the original data which result in poor model performance. This paper proposes an end‐to‐end deep learning method that combines residual modules to learn features from raw data to improve model performance, which called ResidualSpectra. ResidualSpectra model is compared with three convolutional neural network (CNN) models on the original data. The 15 preprocessing approaches are used to evaluate the preprocessing impact by testing five open‐access mid‐infrared, near‐infrared, and Raman spectra datasets (fruits, meats, olive_oils, Tablets_Nir, Tablets_Raman). In most cases, the ResidualSpectra model performs better than the other three CNN models on five datasets and obtains better results in original data than in preprocessed data. The model is compared with linear discriminant analysis (LDA), nonlinear artificial neural network (ANN), support vector machines (SVM) for original and preprocessed data. The results show that the ResidualSpectra method provides improved results over traditional classification methods in most scenarios.

[1]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Meijun Sun,et al.  How to predict the sugariness and hardness of melons: A near-infrared hyperspectral imaging method. , 2017, Food chemistry.

[3]  E. K. Kemsley,et al.  Use of Fourier transform infrared spectroscopy and partial least squares regression for the detection of adulteration of strawberry purées , 1998 .

[4]  Hossein Baharvand,et al.  Automatic white blood cell classification using pre-trained deep learning models: ResNet and Inception , 2018, International Conference on Machine Vision.

[5]  Chuyi Wang,et al.  Attention-based multi-scale transfer ResNet for skull fracture image classification , 2019, International Workshop on Pattern Recognition.

[6]  Margarita Osadchy,et al.  Deep Convolutional Neural Networks for Raman Spectrum Recognition: A Unified Solution , 2017, The Analyst.

[7]  D. A. Stuart,et al.  Surface Enhanced Raman Spectroscopy: New Materials, Concepts, Characterization Tools, and Applications , 2005 .

[8]  E. K. Kemsley,et al.  FTIR spectroscopy and multivariate analysis can distinguish the geographic origin of extra virgin olive oils. , 2003, Journal of agricultural and food chemistry.

[9]  Andrew Jirasek,et al.  Investigation of Selected Baseline Removal Techniques as Candidates for Automated Implementation , 2005, Applied spectroscopy.

[10]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[11]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[12]  D. E. Aston,et al.  Multivariate Analysis of Micro-Raman Spectra of Thermoplastic Polyurethane Blends Using Principal Component Analysis and Principal Component Regression , 2012, Applied spectroscopy.

[13]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[14]  Jens Petter Wold,et al.  Raman Spectra of Biological Samples: A Study of Preprocessing Methods , 2006, Applied spectroscopy.

[15]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[16]  Yuan‐Yuan Chen,et al.  End‐to‐end quantitative analysis modeling of near‐infrared spectroscopy based on convolutional neural network , 2019, Journal of Chemometrics.

[17]  Jan Gerretzen,et al.  Simple and Effective Way for Data Preprocessing Selection Based on Design of Experiments. , 2015, Analytical chemistry.

[18]  Zhenqi Shi,et al.  Development of Near Infrared Spectroscopy-based Process Monitoring Methodology for Pharmaceutical Continuous Manufacturing Using an Offline Calibration Approach. , 2017, Analytical chemistry.

[19]  Manel Alcalà,et al.  Calibration sets selection strategy for the construction of robust PLS models for prediction of biodiesel/diesel blends physico-chemical properties using NIR spectroscopy. , 2017, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[20]  Lutgarde M. C. Buydens,et al.  Breaking with trends in pre-processing? , 2013 .

[21]  M. Dyrby,et al.  Chemometric Quantitation of the Active Substance (Containing C≡N) in a Pharmaceutical Tablet Using Near-Infrared (NIR) Transmittance and NIR FT-Raman Spectra , 2002 .

[22]  Elena Marchiori,et al.  Convolutional neural networks for vibrational spectroscopic data analysis. , 2017, Analytica chimica acta.

[23]  R. Dasari,et al.  Prospects for in vivo Raman spectroscopy , 2000 .

[24]  Mike Preuss,et al.  Planning chemical syntheses with deep neural networks and symbolic AI , 2017, Nature.

[25]  Farid Melgani,et al.  One‐dimensional convolutional neural networks for spectroscopic signal regression , 2018 .

[26]  Kushtrim Bresilla,et al.  Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree , 2019, Front. Plant Sci..

[27]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Frans van den Berg,et al.  Review of the most common pre-processing techniques for near-infrared spectra , 2009 .

[29]  Wenyong Wang,et al.  An efficient instance selection algorithm to reconstruct training set for support vector machine , 2017, Knowl. Based Syst..

[30]  Yibin Ying,et al.  DeepSpectra: An end-to-end deep learning approach for quantitative spectral analysis. , 2019, Analytica chimica acta.

[31]  Tao Zhang,et al.  Tongue squamous cell carcinoma discrimination with Raman spectroscopy and convolutional neural networks , 2019, Vibrational Spectroscopy.

[32]  Guoyan Zheng,et al.  Holistic Decomposition Convolution for Effective Semantic Segmentation of 3D MR Images , 2018, ArXiv.

[33]  E. K. Kemsley,et al.  Mid-infrared spectroscopy and authenticity problems in selected meats: a feasibility study , 1997 .