论文信息 - COVID-19 Diagnosis from Cough Acoustics using ConvNets and Data Augmentation

COVID-19 Diagnosis from Cough Acoustics using ConvNets and Data Augmentation

With the periodic rise and fall of COVID-19 and countries being inflicted by its waves, an efficient, economic, and effortless diagnosis procedure for the virus has been the utmost need of the hour. COVID-19 positive individuals may even be asymptomatic making the diagnosis difficult, but amongst the infected subjects, the asymptomatic ones need not be entirely free of symptoms caused by the virus. They might not show any observable symptoms like the symptomatic subjects, but they may differ from uninfected ones in the way they cough. These differences in the coughing sounds are minute and indiscernible to the human ear, however, these can be captured using machine learning-based statistical models. In this paper, we present a deep learning approach to analyze the acoustic dataset provided in Track 1 of the DiCOVA 2021 Challenge containing cough sound recordings belonging to both COVID-19 positive and negative examples. To perform the classification on the sound recordings as belonging to a COVID-19 positive or negative examples, we propose a ConvNet model. Our model achieved an AUC score percentage of 72.23 on the blind test set provided by the same for an unbiased evaluation of the models. The ConvNet model incorporated with Data Augmentation further increased the AUC-ROC percentage from 72.23 to 87.07. It also outperformed the DiCOVA 2021 Challenge's baseline model by 23% thus, claiming the top position on the DiCOVA 2021 Challenge leaderboard. This paper proposes the use of Mel frequency cepstral coefficients as the feature input for the proposed model.

Koushik Guha | Shubham Jain | Hoang Van Truong | Darsh Kaushik | Saranga Kingkor Mahanta

[1] Cecilia Mascolo,et al. Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data , 2020, KDD.

[2] Amil Khanzada,et al. Virufy: A Multi-Branch Deep Learning Network for Automated Detection of COVID-19 , 2021, Interspeech 2021.

[3] Juliana A. Knocikova,et al. Wavelet analysis of voluntary cough sound in patients with respiratory diseases. , 2008, Journal of physiology and pharmacology : an official journal of the Polish Physiological Society.

[4] C. Dolea,et al. World Health Organization , 1949, International Organization.

[5] Srikanth Raj Chetupalli,et al. Coswara - A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis , 2020, INTERSPEECH.

[6] Andrew W. Senior,et al. Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.

[7] R.W. Schafer,et al. From frequency to quefrency: a history of the cepstrum , 2004, IEEE Signal Processing Magazine.

[8] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9] Ismail Shahin,et al. COVID-19 Detection System using Recurrent Neural Networks , 2020, 2020 International Conference on Communications, Computing, Cybersecurity, and Informatics (CCCI).

[10] Hoang Van Truong,et al. Unsupervised Detection of Anomalous Sound for Machine Condition Monitoring using Fully Connected U-Net , 2021, Journal of ICT Research and Applications.

[11] Prasanta Kumar Ghosh,et al. DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics , 2021, Interspeech.

[12] Colin Raffel,et al. librosa: Audio and Music Signal Analysis in Python , 2015, SciPy.

[13] E. B. Newman,et al. A Scale for the Measurement of the Psychological Magnitude Pitch , 1937 .

[14] Muhammad Nabeel,et al. AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app , 2020, Informatics in Medicine Unlocked.

[15] Bhiksha Raj,et al. On the Origin of Deep Learning , 2017, ArXiv.

[16] Brian Subirana,et al. COVID-19 Artificial Intelligence Diagnosis Using Only Cough Recordings , 2020, IEEE Open Journal of Engineering in Medicine and Biology.

[17] Arsha Nagrani,et al. Cough Against COVID: Evidence of COVID-19 Signature in Cough Sounds , 2020, ArXiv.

[18] Thaweesak Yingthawornsuk,et al. Speech Recognition using MFCC , 2012 .

[19] Yann LeCun,et al. Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[20] Gunvant R. Chaudhari,et al. Virufy: Global Applicability of Crowdsourced and Clinical Datasets for AI Detection of COVID-19 from Cough , 2020, ArXiv.

[21] Tony R. Martinez,et al. Distribution-balanced stratified cross-validation for accuracy estimation , 2000, J. Exp. Theor. Artif. Intell..

[22] Andrew P. Bradley,et al. The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[23] Matthew Osborne,et al. Diagnosing COVID-19: The Disease and Tools for Detection , 2020, ACS nano.