The Second DiCOVA Challenge: Dataset and performance analysis for COVID-19 diagnosis using acoustics

The Second Diagnosis of COVID-19 using Acoustics (DiCOVA) Challenge aimed at accelerating the research in acoustics based detection of COVID-19, a topic at the intersection of acoustics, signal processing, machine learning, and healthcare. This paper presents the details of the challenge, which was an open call for researchers to analyze a dataset of audio recordings consisting of breathing, cough and speech signals. This data was collected from individuals with and without COVID-19 infection, and the task in the challenge was a two-class classification. The development set audio recordings were collected from 965 (172 COVID-19 positive) individuals, while the evaluation set contained data from 471 individuals (71 COVID-19 positive). The challenge featured four tracks, one associated with each sound category of cough, speech and breathing, and a fourth fusion track. A baseline system was also released to benchmark the participants. In this paper, we present an overview of the challenge, the rationale for the data collection and the baseline system. Further, a performance analysis for the systems submitted by the $16$ participating teams in the leaderboard is also presented.

[1]  Björn Schuller,et al.  COVID-19 detection from audio: seven grains of salt , 2021, The Lancet Digital Health.

[2]  Mehrin Kiani,et al.  A Generic Deep Learning Based Cough Analysis System From Clinically Validated Samples for Point-of-Need Covid-19 Test and Severity Levels , 2021, IEEE Transactions on Services Computing.

[3]  Muhammad Nabeel,et al.  AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app , 2020, Informatics in Medicine Unlocked.

[4]  Andrea Vercelli,et al.  Can Lung US Help Critical Care Clinicians in the Early Diagnosis of Novel Coronavirus (COVID-19) Pneumonia? , 2020, Radiology.

[5]  Gema González,et al.  Optimized and scalable synthesis of magnetic nanoparticles for RNA extraction in response to developing countries' needs in the detection and control of SARS-CoV-2 , 2020, Scientific reports.

[6]  Prasanta Kumar Ghosh,et al.  DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics , 2021, Interspeech.

[7]  M. Salit,et al.  Testing at scale during the COVID-19 pandemic , 2021, Nature Reviews Genetics.

[8]  Brian Subirana,et al.  COVID-19 Artificial Intelligence Diagnosis Using Only Cough Recordings , 2020, IEEE Open Journal of Engineering in Medicine and Biology.

[9]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[10]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[11]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[12]  Alexei Baevski,et al.  wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations , 2020, NeurIPS.

[13]  Cecilia Mascolo,et al.  Exploring Automatic COVID-19 Diagnosis via Voice and Symptoms from Crowdsourced Data , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Srikanth Raj Chetupalli,et al.  Coswara - A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis , 2020, INTERSPEECH.

[15]  H. Gendelman,et al.  Diagnostics for SARS-CoV-2 infections , 2021, Nature Materials.

[16]  A. Haq,et al.  Wavelet-Based Cough Signal Decomposition for Multimodal Classification , 2020, 2020 17th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP).

[17]  David Atienza,et al.  The COUGHVID crowdsourcing dataset, a corpus for the study of large-scale cough analysis algorithms , 2021, Scientific data.

[18]  Srikanth Raj Chetupalli,et al.  Towards sound based testing of COVID-19—Summary of the first Diagnostics of COVID-19 using Acoustics (DiCOVA) Challenge , 2021, Computer Speech & Language.

[19]  Cecilia Mascolo,et al.  Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data , 2020, KDD.

[20]  Melina Hosseiny,et al.  Radiology Perspective of Coronavirus Disease 2019 (COVID-19): Lessons From Severe Acute Respiratory Syndrome and Middle East Respiratory Syndrome. , 2020, AJR. American journal of roentgenology.