Training one model to detect heart and lung sound events from single point auscultations

Objective: This work proposes a semi-supervised training approach for detecting lung and heart sounds simultaneously with only one trained model and in invariance to the auscultation point. Methods: We use open-access data from the 2016 Physionet/CinC Challenge, the 2022 George Moody Challenge, and from the lung sound database HF_V1. We first train specialist single-task models using foreground ground truth (GT) labels from different auscultation databases to identify background sound events in the respective lung and heart auscultation databases. The pseudo-labels generated in this way were combined with the ground truth labels in a new training iteration, such that a new model was subsequently trained to detect foreground and background signals. Benchmark tests ensured that the newly trained model could detect both, lung, and heart sound events in different auscultation sites without regressing on the original task. We also established hand-validated labels for the respective background signal in heart and lung sound auscultations to evaluate the models. Results: In this work, we report for the first time results for i) a multi-class prediction for lung sound events and ii) for simultaneous detection of heart and lung sound events and achieve competitive results using only one model. The combined multi-task model regressed slightly in heart sound detection and gained significantly in lung sound detection accuracy with an overall macro F1 score of 39.2% over six classes, representing a 6.7% improvement over the single-task baseline models. Conclusion/Significance: To the best of our knowledge, this is the first approach developed to date for measuring heart and lung sound events invariant to both, the auscultation site and capturing device. Hence, our model is capable of performing lung and heart sound detection from any auscultation location.

[1]  K. Kashino,et al.  BYOL for Audio: Exploring Pre-Trained General-Purpose Audio Representations , 2022, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[2]  Sakiko Mishima,et al.  Impact of data imbalance caused by inactive frames and difference in sound duration on sound event detection performance , 2022, Applied Acoustics.

[3]  Seong-Hu Kim,et al.  Filteraugment: An Acoustic Environmental Data Augmentation Method , 2021, ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[4]  Gari D Clifford,et al.  The CirCor DigiScope Dataset: From Murmur Detection to Murmur Classification , 2021, IEEE Journal of Biomedical and Health Informatics.

[5]  Sridha Sridharan,et al.  Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings , 2021, IEEE Journal of Biomedical and Health Informatics.

[6]  Keisuke Imoto,et al.  Acoustic Scene Classification Using Multichannel Observation with Partially Missing Channels , 2021, 2021 29th European Signal Processing Conference (EUSIPCO).

[7]  Franz Pernkopf,et al.  Crackle Detection In Lung Sounds Using Transfer Learning And Multi-Input Convolutional Neural Networks , 2021, 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[8]  L. Fraiwan,et al.  Recognition of pulmonary diseases from lung sounds using convolutional neural networks and long short-term memory , 2021, Journal of Ambient Intelligence and Humanized Computing.

[9]  F. Lai,et al.  An Update of a Progressively Expanded Database for Automated Lung Sound Analysis , 2021, ArXiv.

[10]  F. Lai,et al.  Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database—HF_Lung_V1 , 2021, PloS one.

[11]  Ian McLoughlin,et al.  Inception-Based Network and Multi-Spectrogram Ensemble Applied To Predict Respiratory Anomalies and Lung Diseases , 2020, 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[12]  Lam Pham,et al.  Deep Learning Framework Applied For Predicting Anomaly of Respiratory Sounds , 2020, 2021 International Symposium on Electrical and Electronics Engineering (ISEE).

[13]  Rui Pedro Paiva,et al.  Automatic Classification of Adventitious Respiratory Sounds: A (Un)Solved Problem? † , 2020, Sensors.

[14]  Christoph Schorn,et al.  SELD-TCN: Sound Event Localization & Detection via Temporal Convolutional Networks , 2020, 2020 28th European Signal Processing Conference (EUSIPCO).

[15]  Sihan Xu,et al.  Respiratory Sound Classification Based on BiGRU-Attention Network with XGBoost , 2020, 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[16]  Tinoosh Mohsenin,et al.  Neural Networks for Pulmonary Disease Diagnosis using Auditory and Demographic Information , 2020, ArXiv.

[17]  Yi Ma,et al.  LungRN+NL: An Improved Adventitious Lung Sound Classification Using Non-Local Block ResNet Neural Network with Mixup Data Augmentation , 2020, INTERSPEECH.

[18]  Yibo Yin,et al.  Temporal Convolutional Network Connected with an Anti-Arrhythmia Hidden Semi-Markov Model for Heart Sound Segmentation , 2020, Applied Sciences.

[19]  Franz Pernkopf,et al.  Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks , 2020, 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[20]  Ting-Wei Lin,et al.  Breathing Sound Segmentation and Detection Using Transfer Learning Techniques on an Attention-Based Encoder-Decoder Architecture , 2020, 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[21]  Tomoki Toda,et al.  Weakly-Supervised Sound Event Detection with Self-Attention , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Yu Wang,et al.  Few-Shot Sound Event Detection , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  Ian McLoughlin,et al.  Robust Deep Learning Framework For Predicting Respiratory Anomalies and Diseases , 2020, 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[24]  Mark D. Plumbley,et al.  PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[25]  Mark D. Plumbley,et al.  Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[26]  Kai Yu,et al.  Duration Robust Weakly Supervised Sound Event Detection , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  Tomoki Toda,et al.  CONVOLUTION-AUGMENTED TRANSFORMER FOR SEMI-SUPERVISED SOUND EVENT DETECTION Technical Report , 2020 .

[28]  Varun Bajaj,et al.  Convolutional neural networks based efficient approach for classification of lung diseases , 2019, Health Information Science and Systems.

[29]  Yugyung Lee,et al.  Lung Disease Classification using Deep Convolutional Neural Network , 2019, 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[30]  Jian Zhao,et al.  LungBRN: A Smart Digital Stethoscope for Detecting Respiratory Disease Using bi-ResNet Deep Learning Algorithm , 2019, 2019 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[31]  Hongxia Jin,et al.  Rare Sound Event Detection Using Deep Learning and Data Augmentation , 2019, INTERSPEECH.

[32]  Ankit Shah,et al.  Sound Event Detection in Domestic Environments with Weakly Labeled Data and Soundscape Synthesis , 2019, DCASE.

[33]  Andrea Tagarelli,et al.  Deep Auscultation: Predicting Respiratory Anomalies and Diseases via Recurrent Neural Networks , 2019, 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS).

[34]  Quoc V. Le,et al.  SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition , 2019, INTERSPEECH.

[35]  Jędrzej Kociński,et al.  Practical implementation of artificial intelligence algorithms in pulmonary auscultation examination , 2019, European Journal of Pediatrics.

[36]  Cristina Jácome,et al.  Convolutional Neural Network for Breathing Phase Detection in Lung Sounds , 2019, Sensors.

[37]  Miguel Tavares Coimbra,et al.  Deep Convolutional Neural Networks for Heart Sound Segmentation , 2019, IEEE Journal of Biomedical and Health Informatics.

[38]  Lionel Delphin-Poulat,et al.  MEAN TEACHER WITH DATA AUGMENTATION FOR DCASE 2019 TASK 4 Technical Report , 2019 .

[39]  Andrey Filchenkov,et al.  Noise Masking Recurrent Neural Network for Respiratory Sound Classification , 2018, ICANN.

[40]  Nicolas Turpault,et al.  Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments , 2018, DCASE.

[41]  Franz Pernkopf,et al.  Crackle and Breathing Phase Detection in Lung Sounds with Deep Bidirectional Gated Recurrent Neural Networks , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[42]  Franz Pernkopf,et al.  Heart Sound Segmentation—An Event Detection Approach Using Deep Recurrent Neural Networks , 2018, IEEE Transactions on Biomedical Engineering.

[43]  Kun Zhang,et al.  Lung sounds classification using convolutional neural networks , 2018, Artif. Intell. Medicine.

[44]  Tomoki Toda,et al.  Duration-Controlled LSTM for Polyphonic Sound Event Detection , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[45]  Aren Jansen,et al.  CNN architectures for large-scale audio classification , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[46]  Heikki Huttunen,et al.  Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[47]  Lin Li,et al.  Classification between normal and adventitious lung sounds using deep neural network , 2016, 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP).

[48]  Vivek Miglani,et al.  Application of semi-supervised deep learning to lung sound analysis , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[49]  Annamaria Mesaros,et al.  Metrics for Polyphonic Sound Event Detection , 2016 .

[50]  Lionel Tarassenko,et al.  Logistic Regression-HSMM-Based Heart Sound Segmentation , 2016, IEEE Transactions on Biomedical Engineering.

[51]  M. Sarkar,et al.  Auscultation of the respiratory system , 2015, Annals of thoracic medicine.

[52]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[53]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[54]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[55]  J J Struijk,et al.  Segmentation of heart sound recordings by a duration-dependent hidden Markov model , 2010, Physiological measurement.