Peak Detection and Baseline Correction Using a Convolutional Neural Network

Peak detection and localization in a noisy signal with an unknown baseline is a fundamental task in signal processing applications such as spectroscopy. A current trend in signal processing is to reformulate traditional processing pipelines as (deep) neural networks that can be trained end-to-end. A trainable algorithm for baseline removal and peak localization can serve as an important module in such a processing pipeline. In practical applications, one of the most successful approaches to joint baseline suppression and peak localization is based on the continuous wavelet transform: We re-formulate this as a convolutional neural network (CNN) followed by a non-linear readout layer. On a synthetic benchmark we demonstrate that with sufficient training data, the CNN approach consistently outperforms the optimized continuous wavelet method by means of adapting to the spectral peak shape, noise level, and characteristics of the baseline. The CNN approach to peak localization shows great promise, as it can more efficiently leverage data to outperform the current state of the art, and can readily be extended and incorporated as a module in a larger neural network architecture.

[1]  I. Selesnick,et al.  Chromatogram baseline estimation and denoising using sparsity (BEADS) , 2014 .

[2]  Jiangtao Peng,et al.  Baseline correction combined partial least squares algorithm and its application in on-line Fourier transform infrared quantitative analysis. , 2011, Analytica chimica acta.

[3]  Robert W. Field,et al.  Baseline subtraction using robust local regression estimation , 2001 .

[4]  Mark J. Shensa,et al.  The discrete wavelet transform: wedding the a trous and Mallat algorithms , 1992, IEEE Trans. Signal Process..

[5]  Pan Du,et al.  Bioinformatics Original Paper Improved Peak Detection in Mass Spectrum by Incorporating Continuous Wavelet Transform-based Pattern Matching , 2022 .

[6]  M. Mehicic,et al.  Practical Raman Spectroscopy : Springer Verlag, Berlin, 1989 (ISBN 3-540-50254-8). viii + 157 pp. Price DM 78.00 , 1990 .

[7]  Chao Yang,et al.  Comparison of public peak detection algorithms for MALDI mass spectrometry data analysis , 2009, BMC Bioinformatics.

[8]  Pierre Grangeat,et al.  Linear MALDI-ToF simultaneous spectrum deconvolution and baseline removal , 2018, BMC Bioinformatics.

[9]  Theo Lynn,et al.  Evaluation of peak detection algorithms for social media event detection , 2015, 2015 10th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP).

[10]  Lars Kai Hansen,et al.  Detection of skin cancer by classification of Raman spectra , 2004, IEEE Transactions on Biomedical Engineering.

[11]  Mark Girolami,et al.  Bayesian modelling and quantification of Raman spectroscopy , 2016, 1604.07299.

[12]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[13]  Mark B. Sandler,et al.  A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.