Breathing Sound Segmentation and Detection Using Transfer Learning Techniques on an Attention-Based Encoder-Decoder Architecture

This paper focuses on the use of an attention-based encoder-decoder model for the task of breathing sound segmentation and detection. This study aims to accurately segment the inspiration and expiration of patients with pulmonary diseases using the proposed model. Spectrograms of the lung sound signals and labels for every time segment were used to train the model. The model would first encode the spectrogram and then detect inspiratory or expiratory sounds using the encoded image on an attention-based decoder. Physicians would be able to make a more precise diagnosis based on the more interpretable outputs with the assistance of the attention mechanism.The respiratory sounds used for training and testing were recorded from 22 participants using digital stethoscopes or anti-noising microphone sets. Experimental results showed a high 92.006% accuracy when applied 0.5 second time segments and ResNet101 as encoder. Consistent performance of the proposed method can be observed from ten-fold cross-validation experiments.

[1]  Özkan Kiliç,et al.  Classification of lung sounds using convolutional neural networks , 2017, EURASIP Journal on Image and Video Processing.

[2]  Qiao Li,et al.  Classification of normal/abnormal heart sound recordings: The PhysioNet/Computing in Cardiology Challenge 2016 , 2016, 2016 Computing in Cardiology Conference (CinC).

[3]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[4]  Kofi Odame,et al.  DeepCough: A deep convolutional neural network in a wearable cough detection system , 2015, 2015 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[5]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Grega Vrbancic,et al.  Automatic Detection of Heartbeats in Heart Sound Signals Using Deep Convolutional Neural Networks , 2019 .

[7]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[8]  Kun Zhang,et al.  Lung sounds classification using convolutional neural networks , 2018, Artif. Intell. Medicine.

[9]  Valentyn Vaityshyn,et al.  Pre-trained Convolutional Neural Networks for the Lung Sounds Classification , 2019, 2019 IEEE 39th International Conference on Electronics and Nanotechnology (ELNANO).

[10]  Cristina Jácome,et al.  Convolutional Neural Network for Breathing Phase Detection in Lung Sounds , 2019, Sensors.

[11]  Tara N. Sainath,et al.  State-of-the-Art Speech Recognition with Sequence-to-Sequence Models , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Mounya Elhilali,et al.  Audio object classification using distributed beliefs and attention , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[13]  Annamaria Mesaros,et al.  Metrics for Polyphonic Sound Event Detection , 2016 .

[14]  三平 将彦 頭蓋内圧に及ぼすInverse Ratio Ventilationの影響 , 2000 .

[15]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.