Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings

OBJECTIVE This paper proposes a novel framework for lung sound event detection, segmenting continuous lung sound recordings into discrete events and performing recognition of each event. METHODS We propose the use of a multi-branch TCN architecture and exploit a novel fusion strategy to combine the resultant features from these branches. This not only allows the network to retain the most salient information across different temporal granularities and disregards irrelevant information, but also allows our network to process recordings of arbitrary length. RESULTS The proposed method is evaluated on multiple public and in-house benchmarks, containing irregular and noisy recordings of the respiratory auscultation process for the identification of auscultation events including inhalation, crackles, and rhonchi. Moreover, we provide an end-to-end model interpretation pipeline. CONCLUSION Our analysis of different feature fusion strategies shows that the proposed feature concatenation method leads to better suppression of non-informative features, which drastically reduces the classifier overhead resulting in a robust lightweight network. SIGNIFICANCE Lung sound event detection is a primary diagnostic step for numerous respiratory diseases. The proposed method provides a cost-effective and efficient alternative to exhaustive manual segmentation and provides more accurate segmentation than existing methods. The end-to-end model interpretability helps to build the required trust in the system for use in clinical settings.

[1]  Shingo Mabu,et al.  Automatic Classification of Large-Scale Respiratory Sound Dataset Based on Convolutional Neural Network , 2019, 2019 19th International Conference on Control, Automation and Systems (ICCAS).

[2]  Ioanna Chouvarda,et al.  Α Respiratory Sound Database for the Development of Automated Classification , 2017, BHI 2017.

[3]  Sebastian Böck,et al.  Temporal convolutional networks for musical audio beat tracking , 2019, 2019 27th European Signal Processing Conference (EUSIPCO).

[4]  Yazan Abu Farha,et al.  MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Mohammed Imamul Hassan Bhuiyan,et al.  A Lightweight CNN Model for Detecting Respiratory Diseases From Lung Auscultation Sounds Using EMD-CWT-Based Hybrid Scalogram , 2020, IEEE Journal of Biomedical and Health Informatics.

[6]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Gregory D. Hager,et al.  Temporal Convolutional Networks for Action Segmentation and Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9]  Varun Bajaj,et al.  Convolutional neural networks based efficient approach for classification of lung diseases , 2019, Health Information Science and Systems.

[10]  Cristina Jácome,et al.  Convolutional Neural Network for Breathing Phase Detection in Lung Sounds , 2019, Sensors.

[11]  J. Tielsch,et al.  Computerized lung sound analysis as diagnostic aid for the detection of abnormal lung sounds: a systematic review and meta-analysis. , 2011, Respiratory medicine.

[12]  Shan Gao,et al.  Respiratory Sounds Feature Learning with Deep Convolutional Neural Networks , 2017, 2017 IEEE 15th Intl Conf on Dependable, Autonomic and Secure Computing, 15th Intl Conf on Pervasive Intelligence and Computing, 3rd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[13]  Avanti Shrikumar,et al.  Learning Important Features Through Propagating Activation Differences , 2017, ICML.

[14]  Mukund Sundararajan,et al.  How Important Is a Neuron? , 2018, ICLR.

[15]  Sridha Sridharan,et al.  Tree Memory Networks for Modelling Long-term Temporal Dependencies , 2017, Neurocomputing.

[16]  Sridha Sridharan,et al.  Neural memory plasticity for medical anomaly detection , 2020, Neural Networks.

[17]  Ting-Wei Lin,et al.  Breathing Sound Segmentation and Detection Using Transfer Learning Techniques on an Attention-Based Encoder-Decoder Architecture , 2020, 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[18]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[19]  Franz Pernkopf,et al.  Multi-channel lung sound classification with convolutional recurrent neural networks , 2020, Comput. Biol. Medicine.

[20]  Sridha Sridharan,et al.  Heart Sound Segmentation Using Bidirectional LSTMs With Attention , 2020, IEEE Journal of Biomedical and Health Informatics.

[21]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Franz Pernkopf,et al.  Crackle and Breathing Phase Detection in Lung Sounds with Deep Bidirectional Gated Recurrent Neural Networks , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[23]  Quoc V. Le,et al.  SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition , 2019, INTERSPEECH.

[24]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[25]  Novruz Allahverdi,et al.  Deep Learning on Computerized Analysis of Chronic Obstructive Pulmonary Disease , 2020, IEEE Journal of Biomedical and Health Informatics.

[26]  M. S. Swapna,et al.  Nonlinear time series and principal component analyses: Potential diagnostic tools for COVID-19 auscultation , 2020, Chaos, Solitons & Fractals.

[27]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[28]  F. Lai,et al.  Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database—HF_Lung_V1 , 2021, PloS one.

[29]  Ian McLoughlin,et al.  CNN-MoE Based Framework for Classification of Respiratory Anomalies and Lung Disease Detection , 2020, IEEE Journal of Biomedical and Health Informatics.