Uncertainty-Aware COVID-19 Detection from Imbalanced Sound Data

Recently, sound-based COVID-19 detection studies have shown great promise to achieve scalable and prompt digital prescreening. However, there are still two unsolved issues hindering the practice. First, collected datasets for model training are often imbalanced, with a considerably smaller proportion of users tested positive, making it harder to learn representative and robust features. Second, deep learning models are generally overconfident in their predictions. Clinically, false predictions aggravate healthcare costs. Estimation of the uncertainty of screening would aid this. To handle these issues, we propose an ensemble framework where multiple deep learning models for sound-based COVID-19 detection are developed from different but balanced subsets from original data. As such, data are utilized more effectively compared to traditional up-sampling and down-sampling approaches: an AUC of 0.74 with a sensitivity of 0.68 and a specificity of 0.69 is achieved. Simultaneously, we estimate uncertainty from the disagreement across multiple models. It is shown that false predictions often yield higher uncertainty, enabling us to suggest the users with certainty higher than a threshold to repeat the audio test on their phones or to take clinical tests if digital diagnosis still fails. This study paves the way for a more robust sound-based COVID-19 automated screening system. Copyright © 2021 ISCA.

[1]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[2]  Siegfried Wahl,et al.  Leveraging uncertainty information from deep neural networks for disease detection , 2016, Scientific Reports.

[3]  Charles Blundell,et al.  Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[4]  Cecilia Mascolo,et al.  The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates , 2021, Interspeech.

[5]  Srikanth Raj Chetupalli,et al.  Coswara - A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis , 2020, INTERSPEECH.

[6]  Kun Qian,et al.  An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety , 2020, INTERSPEECH.

[7]  Aren Jansen,et al.  CNN architectures for large-scale audio classification , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Kevin Smith,et al.  Bayesian Uncertainty Estimation for Batch Normalized Deep Networks , 2018, ICML.

[9]  Sebastian Nowozin,et al.  Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift , 2019, NeurIPS.

[10]  Mark J. F. Gales,et al.  Ensemble Approaches for Uncertainty in Spoken Language Assessment , 2020, INTERSPEECH.

[11]  K. Yuen,et al.  Clinical Characteristics of Coronavirus Disease 2019 in China , 2020, The New England journal of medicine.

[12]  J. Stock,et al.  Economic Benefits of COVID-19 Screening Tests , 2020, medRxiv.

[13]  Kun Qian,et al.  COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis , 2020, Frontiers in Digital Health.

[14]  Thomas Niesler,et al.  COVID-19 cough classification using machine learning and global smartphone recordings , 2021, Computers in Biology and Medicine.

[15]  Gadi Pinkas,et al.  SARS-CoV-2 Detection From Voice , 2020, IEEE Open Journal of Engineering in Medicine and Biology.

[16]  Allan Tucker,et al.  Estimating Uncertainty and Interpretability in Deep Learning for Coronavirus (COVID-19) Detection , 2020, ArXiv.

[17]  Wenqi Wei,et al.  A Real-time Robot-based Auxiliary System for Risk Evaluation of COVID-19 Infection , 2020, INTERSPEECH.

[18]  Maral Asiaee,et al.  Voice Quality Evaluation in Patients With COVID-19: An Acoustic Analysis , 2020, Journal of Voice.

[19]  Malik Peiris,et al.  Virology, transmission, and pathogenesis of SARS-CoV-2 , 2020, BMJ.

[20]  Cecilia Mascolo,et al.  Exploring Automatic COVID-19 Diagnosis via Voice and Symptoms from Crowdsourced Data , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21]  Kilian Q. Weinberger,et al.  On Calibration of Modern Neural Networks , 2017, ICML.

[22]  Bjorn W. Schuller,et al.  Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks , 2020, 2021 IEEE 34th International Symposium on Computer-Based Medical Systems (CBMS).

[23]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[24]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[25]  Eric Song,et al.  Analytical sensitivity and efficiency comparisons of SARS-CoV-2 RT–qPCR primer–probe sets , 2020, Nature Microbiology.

[26]  Muhammad Nabeel,et al.  AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app , 2020, Informatics in Medicine Unlocked.

[27]  T. Ortmaier,et al.  Quantifying the uncertainty of deep learning-based computer-aided diagnosis for patient safety , 2019, Current Directions in Biomedical Engineering.

[28]  Brian Subirana,et al.  COVID-19 Artificial Intelligence Diagnosis Using Only Cough Recordings , 2020, IEEE Open Journal of Engineering in Medicine and Biology.

[29]  Cecilia Mascolo,et al.  The Benefit of the Doubt: Uncertainty Aware Sensing for Edge Computing Platforms , 2021, 2021 IEEE/ACM Symposium on Edge Computing (SEC).

[30]  Giorgio Valentini,et al.  Imbalance-Aware Machine Learning for Predicting Rare and Common Disease-Associated Non-Coding Variants , 2017, Scientific Reports.

[31]  Cecilia Mascolo,et al.  Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data , 2020, KDD.

[32]  Gongping Yang,et al.  On the Class Imbalance Problem , 2008, 2008 Fourth International Conference on Natural Computation.

[33]  Arsha Nagrani,et al.  Cough Against COVID: Evidence of COVID-19 Signature in Cough Sounds , 2020, ArXiv.

[34]  D. Lendrem,et al.  First experience of COVID-19 screening of health-care workers in England , 2020, The Lancet.