Convolutional neural networks for vibrational spectroscopic data analysis.

In this work we show that convolutional neural networks (CNNs) can be efficiently used to classify vibrational spectroscopic data and identify important spectral regions. CNNs are the current state-of-the-art in image classification and speech recognition and can learn interpretable representations of the data. These characteristics make CNNs a good candidate for reducing the need for preprocessing and for highlighting important spectral regions, both of which are crucial steps in the analysis of vibrational spectroscopic data. Chemometric analysis of vibrational spectroscopic data often relies on preprocessing methods involving baseline correction, scatter correction and noise removal, which are applied to the spectra prior to model building. Preprocessing is a critical step because even in simple problems using 'reasonable' preprocessing methods may decrease the performance of the final model. We develop a new CNN based method and provide an accompanying publicly available software. It is based on a simple CNN architecture with a single convolutional layer (a so-called shallow CNN). Our method outperforms standard classification algorithms used in chemometrics (e.g. PLS) in terms of accuracy when applied to non-preprocessed test data (86% average accuracy compared to the 62% achieved by PLS), and it achieves better performance even on preprocessed test data (96% average accuracy compared to the 89% achieved by PLS). For interpretability purposes, our method includes a procedure for finding important spectral regions, thereby facilitating qualitative interpretation of results.

[1]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[2]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[3]  R. A. van den Berg,et al.  Centering, scaling, and transformations: improving the biological information content of metabolomics data , 2006, BMC Genomics.

[4]  Nikolaos Doulamis,et al.  Deep supervised learning for hyperspectral data classification through convolutional neural networks , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[5]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[6]  Rohit Bhargava,et al.  Using Fourier transform IR spectroscopy to analyze biological materials , 2014, Nature Protocols.

[7]  Laxman Yetukuri,et al.  Algorithms and tools for the preprocessing of LC–MS metabolomics data , 2011 .

[8]  Landulfo Silveira,et al.  Identification of Different Forms of Cocaine and Substances Used in Adulteration Using Near‐infrared Raman Spectroscopy and Infrared Absorption Spectroscopy , 2015, Journal of forensic sciences.

[9]  Desire L. Massart,et al.  The robust normal variate transform for pattern recognition with near-infrared data , 1999 .

[10]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[11]  Andrea D. Magrì,et al.  Artificial neural networks in chemometrics: History, examples and perspectives , 2008 .

[12]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[13]  M. Baker,et al.  Developing and Understanding Biofluid Vibrational Spectroscopy: A Critical Review , 2016 .

[14]  P. Geladi,et al.  Linearization and Scatter-Correction for Near-Infrared Reflectance Spectra of Meat , 1985 .

[15]  Tso-Jung Yen,et al.  Discussion on "Stability Selection" by Meinshausen and Buhlmann , 2010 .

[16]  Romain Briandet,et al.  Discrimination of Arabica and Robusta in Instant Coffee by Fourier Transform Infrared Spectroscopy and Chemometrics , 1996 .

[17]  Fernando De la Torre,et al.  Optimal feature selection for support vector machines , 2010, Pattern Recognit..

[18]  Aaron Park,et al.  Baseline correction using asymmetrically reweighted penalized least squares smoothing. , 2015, The Analyst.

[19]  Roman M. Balabin,et al.  Near-infrared (NIR) spectroscopy for motor oil classification: From discriminant analysis to support vector machines , 2011 .

[20]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[21]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[22]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[23]  Lutgarde M. C. Buydens,et al.  Novel unified framework for latent modeling and its interpretation , 2015 .

[24]  Roman M. Balabin,et al.  Variable selection in near-infrared spectroscopy: benchmarking of feature selection methods on biodiesel data. , 2011, Analytica chimica acta.

[25]  R. Tibshirani,et al.  Generalized Additive Models: Some Applications , 1987 .

[26]  Jasper Engel,et al.  Confirmation of brand identity of a Trappist beer by mid-infrared spectroscopy coupled with multivariate data analysis. , 2012, Talanta.

[27]  M. Baker,et al.  Developing and understanding biofluid vibrational spectroscopy: a critical review. , 2016, Chemical Society reviews.

[28]  Lutgarde M. C. Buydens,et al.  Breaking with trends in pre-processing? , 2013 .

[29]  Klaus-Robert Müller,et al.  Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.

[30]  Shie Mannor,et al.  Robustness and Regularization of Support Vector Machines , 2008, J. Mach. Learn. Res..

[31]  Bryan A. Hanson,et al.  ChemoSpec : An R Package for Chemometric Analysis of Spectroscopic Data and Chromatograms ( Package Version 1 . 61-3 ) , 2013 .

[32]  Yann LeCun,et al.  Convolutional neural networks applied to house numbers digit classification , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[33]  M. Mitreva,et al.  Alpha-gliadin genes from the A, B, and D genomes of wheat contain different sets of celiac disease epitopes , 2006, BMC Genomics.

[34]  E. K. Kemsley,et al.  Mid-infrared spectroscopy and authenticity problems in selected meats: a feasibility study , 1997 .

[35]  E. K. Kemsley,et al.  FTIR spectroscopy and multivariate analysis can distinguish the geographic origin of extra virgin olive oils. , 2003, Journal of agricultural and food chemistry.

[36]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  R. Bro,et al.  Multiblock variance partitioning: a new approach for comparing variation in multiple data blocks. , 2008, Analytica chimica acta.

[38]  Tapani Raiko,et al.  Semi-supervised Learning with Ladder Networks , 2015, NIPS.

[39]  Jamison Greenx Centering , 2004 .

[40]  R. Barnes,et al.  Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra , 1989 .

[41]  M. Dyrby,et al.  Chemometric Quantitation of the Active Substance (Containing C≡N) in a Pharmaceutical Tablet Using Near-Infrared (NIR) Transmittance and NIR FT-Raman Spectra , 2002 .

[42]  Rekha Gautam,et al.  Review of multidimensional data processing approaches for Raman and infrared spectroscopy , 2015, EPJ Techniques and Instrumentation.

[43]  Takuji Nishimura,et al.  Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator , 1998, TOMC.

[44]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[45]  Jan Gerretzen,et al.  Simple and Effective Way for Data Preprocessing Selection Based on Design of Experiments. , 2015, Analytical chemistry.

[46]  Tapani Raiko,et al.  Semi-supervised Learning with Ladder Networks , 2015, NIPS.

[47]  Roman M. Balabin,et al.  Gasoline classification using near infrared (NIR) spectroscopy data: comparison of multivariate techniques. , 2010, Analytica chimica acta.