COVID-19 Artificial Intelligence Diagnosis Using Only Cough Recordings

Goal: We hypothesized that COVID-19 subjects, especially including asymptomatics, could be accurately discriminated only from a forced-cough cell phone recording using Artificial Intelligence. To train our MIT Open Voice model we built a data collection pipeline of COVID-19 cough recordings through our website (opensigma.mit.edu) between April and May 2020 and created the largest audio COVID-19 cough balanced dataset reported to date with 5,320 subjects. Methods: We developed an AI speech processing framework that leverages acoustic biomarker feature extractors to pre-screen for COVID-19 from cough recordings, and provide a personalized patient saliency map to longitudinally monitor patients in real-time, non-invasively, and at essentially zero variable cost. Cough recordings are transformed with Mel Frequency Cepstral Coefficient and inputted into a Convolutional Neural Network (CNN) based architecture made up of one Poisson biomarker layer and 3 pre-trained ResNet50's in parallel, outputting a binary pre-screening diagnostic. Our CNN-based models have been trained on 4256 subjects and tested on the remaining 1064 subjects of our dataset. Transfer learning was used to learn biomarker features on larger datasets, previously successfully tested in our Lab on Alzheimer's, which significantly improves the COVID-19 discrimination accuracy of our architecture. Results: When validated with subjects diagnosed using an official test, the model achieves COVID-19 sensitivity of 98.5% with a specificity of 94.2% (AUC: 0.97). For asymptomatic subjects it achieves sensitivity of 100% with a specificity of 83.2%. Conclusions: AI techniques can produce a free, non-invasive, real-time, any-time, instantly distributable, large-scale COVID-19 asymptomatic screening tool to augment current approaches in containing the spread of COVID-19. Practical use cases could be for daily screening of students, workers, and public as schools, jobs, and transport reopen, or for pool testing to quickly alert of outbreaks in groups. General speech biomarkers may exist that cover several disease categories, as we demonstrated using the same ones for COVID-19 and Alzheimer's.

[1]  Vinayak Swarnkar,et al.  Cough Sound Analysis Can Rapidly Diagnose Childhood Pneumonia , 2013, Annals of Biomedical Engineering.

[2]  Martina Capuzzo,et al.  Testing for SARS-CoV-2 (COVID-19): a systematic review and clinical guide to molecular and serological in-vitro diagnostic assays , 2020, Reproductive BioMedicine Online.

[3]  S. R. Livingstone,et al.  The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English , 2018, PloS one.

[4]  Sanjay Sarma,et al.  Hi Sigma, do I have the Coronavirus?: Call for a New Artificial Intelligence Approach to Support Health Care Professionals Dealing With The COVID-19 Pandemic , 2020, ArXiv.

[5]  L. Mao,et al.  Neurological Manifestations of Hospitalized Patients with COVID-19 in Wuhan, China: a retrospective case series study , 2020, medRxiv.

[6]  Cuong Pham,et al.  MobiCough: Real-Time Cough Detection and Monitoring Using Low-Cost Mobile Devices , 2016, ACIIDS.

[7]  Sanjay E. Sarma,et al.  On the Forgetting of College Academice: at "Ebbinghaus Speed"? , 2017 .

[8]  Renard Xaviero Adhi Pramono,et al.  A Cough-Based Algorithm for Automatic Diagnosis of Pertussis , 2016, PloS one.

[9]  W. M. van der Flier,et al.  The need for harmonisation and innovation of neuropsychological assessment in neurodegenerative dementias in Europe: consensus document of the Joint Program for Neurodegenerative Diseases Working Group , 2017, Alzheimer's Research & Therapy.

[10]  V. Swarnkar,et al.  A prospective multicentre study testing the diagnostic accuracy of an automated cough sound centred analytic system for the identification of common respiratory disorders in children , 2019, Respiratory Research.

[11]  Muhammad Nabeel,et al.  AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app , 2020, Informatics in Medicine Unlocked.

[12]  Richard L. Doty,et al.  Smell dysfunction: a biomarker for COVID‐19 , 2020, International forum of allergy & rhinology.

[13]  Bernard Harmegnies,et al.  Clinical and epidemiological characteristics of 1420 European patients with mild‐to‐moderate coronavirus disease 2019 , 2020, Journal of internal medicine.

[14]  Gregory Jicha,et al.  Screen and Intervene: The Importance of Early Detection and Treatment of Alzheimer's Disease , 2020 .

[15]  Sharelle Baldwin,et al.  Unit 10.3: Assessment of Cognitive Impairments in the Diagnosis of Alzheimer’s Disease” , 2009 .

[16]  Thomas F. Quatieri,et al.  A Framework for Biomarkers of COVID-19 Based on Coordination of Speech-Production Subsystems , 2020, IEEE Open Journal of Engineering in Medicine and Biology.

[17]  M. Salathé,et al.  COVID-19 epidemic in Switzerland: on the importance of testing, contact tracing and isolation. , 2020, Swiss medical weekly.

[18]  Sanjeev Khudanpur,et al.  Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Lawrence O Gostin,et al.  Access to lifesaving medical resources for African countries: COVID-19 testing and response, ethics, and politics , 2020, The Lancet.

[20]  David J. Hunter Covid-19 and the Stiff Upper Lip - The Pandemic Response in the United Kingdom. , 2020, The New England journal of medicine.

[21]  Brian Subirana,et al.  Call for a wake standard for artificial intelligence , 2020, Commun. ACM.

[22]  C. Raji,et al.  Neurobiology of COVID-19. , 2020, Journal of Alzheimer's disease : JAD.

[23]  T. Bayer,et al.  Motor impairment in Alzheimer’s disease and transgenic Alzheimer’s disease mouse models , 2008, Genes, brain, and behavior.

[24]  Z. Fayad,et al.  Artificial intelligence–enabled rapid diagnosis of patients with COVID-19 , 2020, Nature Medicine.

[25]  R. Barro,et al.  The Coronavirus and the Great Influenza Epidemic - Lessons from the 'Spanish Flu' for the Coronavirus's Potential Effects on Mortality and Economic Activity , 2020, SSRN Electronic Journal.

[26]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  T. Higenbottam,et al.  Glottis narrowing in lung disease. , 2015, The American review of respiratory disease.

[28]  Sanjay E. Sarma,et al.  Theory of Intelligence with Forgetting: Mathematical Theorems Explaining Human Universal Forgetting using “Forgetting Neural Networks” , 2017 .

[29]  W. Reed,et al.  From gene families and genera to incomes and internet file sizes: why power laws are so common in nature. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Bruce J. Tromberg,et al.  Rapid Scaling Up of Covid-19 Diagnostic Testing in the United States — The NIH RADx Initiative , 2020, The New England journal of medicine.

[31]  D. Bub,et al.  Semantic memory loss in dementia of Alzheimer's type. What do various measures measure? , 1990, Brain : a journal of neurology.

[32]  Sanjay E. Sarma,et al.  “Wake Neutrality” of Artificial Intelligence Devices , 2020 .

[33]  K. Cao,et al.  Using Artificial Intelligence to Detect COVID-19 and Community-acquired Pneumonia Based on Pulmonary CT: Evaluation of the Diagnostic Accuracy , 2020 .

[34]  J. Dodd,et al.  Lung disease as a determinant of cognitive decline and dementia , 2015, Alzheimer's Research & Therapy.

[35]  Jos Prickaerts,et al.  From Age-Related Cognitive Decline to Alzheimer's Disease: A Translational Overview of the Potential Role for Phosphodiesterases. , 2017, Advances in neurobiology.

[36]  P D van Helden,et al.  Detection of tuberculosis by automatic cough sound analysis , 2018, Physiological measurement.

[37]  Ann Chang,et al.  Perceived phonatory effort and phonation threshold pressure across a prolonged voice loading task: a study of vocal fatigue. , 2004, Journal of voice : official journal of the Voice Foundation.