Evaluation of Speech-Based Digital Biomarkers: Review and Recommendations

Speech represents a promising novel biomarker by providing a window into brain health, as shown by its disruption in various neurological and psychiatric diseases. As with many novel digital biomarkers, however, rigorous evaluation is currently lacking and is required for these measures to be used effectively and safely. This paper outlines and provides examples from the literature of evaluation steps for speech-based digital biomarkers, based on the recent V3 framework (Goldsack et al., 2020). The V3 framework describes 3 components of evaluation for digital biomarkers: verification, analytical validation, and clinical validation. Verification includes assessing the quality of speech recordings and comparing the effects of hardware and recording conditions on the integrity of the recordings. Analytical validation includes checking the accuracy and reliability of data processing and computed measures, including understanding test-retest reliability, demographic variability, and comparing measures to reference standards. Clinical validity involves verifying the correspondence of a measure to clinical outcomes which can include diagnosis, disease progression, or response to treatment. For each of these sections, we provide recommendations for the types of evaluation necessary for speech-based biomarkers and review published examples. The examples in this paper focus on speech-based biomarkers, but they can be used as a template for digital biomarker development more generally.

[1]  Aurobinda Routray,et al.  A novel drowsiness detection scheme based on speech analysis with validation using simultaneous EEG recordings , 2010, 2010 IEEE International Conference on Automation Science and Engineering.

[2]  Sean Khozin,et al.  Developing and adopting safe and effective digital biomarkers to improve patient outcomes , 2019, npj Digital Medicine.

[3]  Chris Leptak,et al.  What evidence do we need for biomarker qualification? , 2017, Science Translational Medicine.

[4]  Kathleen C. Fraser,et al.  Automated classification of primary progressive aphasia subtypes from narrative speech transcripts , 2014, Cortex.

[5]  L S Honig,et al.  Aging and Alzheimer's disease: lessons from the Nun Study. , 1998, The Gerontologist.

[6]  Antoine Piau,et al.  Current State of Digital Biomarker Technologies for Real-Life, Home-Based Monitoring of Cognitive Function for Mild Cognitive Impairment to Mild Alzheimer Disease and Implications for Clinical Care: Systematic Review , 2019, Journal of medical Internet research.

[7]  Kathleen C. Fraser,et al.  Linguistic Features Identify Alzheimer's Disease in Narrative Speech. , 2015, Journal of Alzheimer's disease : JAD.

[8]  K. Strimbu,et al.  What are biomarkers? , 2010, Current opinion in HIV and AIDS.

[9]  Andrea Coravos,et al.  Modernizing and designing evaluation frameworks for connected sensor technologies in medicine , 2020, npj Digital Medicine.

[10]  Michal Novotný,et al.  Smartphone Allows Capture of Speech Abnormalities Associated With High Risk of Developing Parkinson’s Disease , 2018, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[11]  Serguei V. S. Pakhomov,et al.  Using automatic speech recognition to assess spoken responses to cognitive tests of semantic verbal fluency , 2015, Speech Commun..

[12]  Frank Rudzicz,et al.  On the importance of normative data in speech-based assessment , 2017, ArXiv.

[13]  M. Copelli,et al.  Thought disorder measured as random speech structure classifies negative symptoms and schizophrenia diagnosis 6 months in advance , 2017, npj Schizophrenia.

[14]  V. Puntmann How-to guide on biomarkers: biomarker definitions, validation and applications with examples from cardiovascular disease , 2009, Postgraduate Medical Journal.

[15]  Gábor Gosztolya,et al.  Detecting Mild Cognitive Impairment from Spontaneous Speech by Correlation-Based Phonetic Feature Selection , 2016, INTERSPEECH.

[16]  Michael A. Covington,et al.  Speech deficits in serious mental illness: A cognitive resource issue? , 2014, Schizophrenia Research.

[17]  Youri Maryn,et al.  Mobile Communication Devices, Ambient Noise, and Acoustic Voice Measures. , 2017, Journal of voice : official journal of the Voice Foundation.

[18]  Claudia Manfredi,et al.  Maximal Ambient Noise Levels and Type of Voice Material Required for Valid Use of Smartphones in Clinical Voice Research. , 2017, Journal of voice : official journal of the Voice Foundation.

[19]  Visar Berisha,et al.  Tracking discourse complexity preceding Alzheimer's disease diagnosis: a case study comparing the press conferences of Presidents Ronald Reagan and George Herbert Walker Bush. , 2015, Journal of Alzheimer's disease : JAD.

[20]  Felix Schaeffler,et al.  Assessing voice health using smartphones: bias and random error of acoustic voice parameters captured by different smartphone types. , 2019, International journal of language & communication disorders.

[21]  S. Cappa,et al.  Connected Speech in Neurodegenerative Language Disorders: A Review , 2017, Front. Psychol..

[22]  D R Wekstein,et al.  Linguistic ability in early life and cognitive function and Alzheimer's disease in late life. Findings from the Nun Study. , 1996, JAMA.

[23]  Ryuki Tachibana,et al.  Major depressive disorder discrimination using vocal acoustic features. , 2018, Journal of affective disorders.

[24]  Gábor Gosztolya,et al.  A Speech Recognition-based Solution for the Automatic Detection of Mild Cognitive Impairment from Spontaneous Speech , 2018, Current Alzheimer research.

[25]  Dimitra Vergyri,et al.  Learning diagnostic models using speech and language measures , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[26]  David A. Snowdon,et al.  Early life linguistic ability, late life cognitive function, and neuropathology: findings from the Nun Study , 2005, Neurobiology of Aging.

[27]  J. Flint,et al.  Re-examining the robustness of voice features in predicting depression: Compared with baseline of confounders , 2019, PloS one.

[28]  J. Mundt,et al.  Vocal Acoustic Biomarkers of Depression Severity and Treatment Response , 2012, Biological Psychiatry.

[29]  J. Mundt,et al.  Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology , 2007, Journal of Neurolinguistics.

[30]  Elmar Nöth,et al.  How language flows when movements don’t: An automated analysis of spontaneous discourse in Parkinson’s disease , 2016, Brain and Language.

[31]  H. Riper,et al.  Validating Automated Sentiment Analysis of Online Cognitive Behavioral Therapy Patient Texts: An Exploratory Study , 2019, Front. Psychol..

[32]  K. Forbes-McKay,et al.  Detecting subtle spontaneous language decline in early Alzheimer’s disease with a picture description task , 2005, Neurological Sciences.

[33]  Mark Liberman,et al.  Validated automatic speech biomarkers in primary progressive aphasia , 2018, Annals of clinical and translational neurology.

[34]  Dong Si,et al.  Parkinson’s Disease Diagnosis Using Machine Learning and Voice , 2018, 2018 IEEE Signal Processing in Medicine and Biology Symposium (SPMB).

[35]  Graeme Hirst,et al.  Longitudinal detection of dementia through lexical and syntactic changes in writing: a case study of three British novelists , 2011, Lit. Linguistic Comput..

[36]  Miguel Angel Ferrer-Ballester,et al.  Alzheimer's disease and automatic speech analysis: A review , 2020, Expert Syst. Appl..

[37]  Colleen Richey,et al.  Aided diagnosis of dementia type through computer-based analysis of spontaneous speech , 2014, CLPsych@ACL.

[38]  Enkelejda Miho,et al.  Traditional and Digital Biomarkers: Two Worlds Apart? , 2019, Digital Biomarkers.

[39]  S. Borson,et al.  Rationale for Early Diagnosis of Mild Cognitive Impairment (MCI) supported by Emerging Digital Technologies , 2020, The Journal of Prevention of Alzheimer's Disease.

[40]  Brita Elvevåg,et al.  Automated computerized analysis of speech in psychiatric disorders , 2014, Current opinion in psychiatry.

[41]  Michael Sedlmair,et al.  More than Bags of Words: Sentiment Analysis with Word Embeddings , 2018 .

[42]  O. Kinouchi,et al.  Speech Graphs Provide a Quantitative Measure of Thought Disorder in Psychosis , 2012, PloS one.

[43]  Jennifer C. Goldsack,et al.  Verification, analytical validation, and clinical validation (V3): the foundation of determining fit-for-purpose for Biometric Monitoring Technologies (BioMeTs) , 2020, npj Digital Medicine.

[44]  Dimitrios Kokkinakis,et al.  Voice quality and speech fluency distinguish individuals with Mild Cognitive Impairment from Healthy Controls , 2020, PloS one.

[45]  Robert T. Schultz,et al.  Automatic Detection of Autism Spectrum Disorder in Children Using Acoustic and Text Features from Brief Natural Conversations , 2019, INTERSPEECH.

[46]  Pavel Sovka,et al.  Validation of freely-available pitch detection algorithms across various noise levels in assessing speech captured by smartphone in Parkinson's disease , 2020, Biomed. Signal Process. Control..

[47]  Meysam Asgari,et al.  Predicting mild cognitive impairment from spontaneous spoken utterances , 2017, Alzheimer's & dementia.

[48]  Veronika Vincze,et al.  Speaking in Alzheimer’s Disease, is That an Early Sign? Importance of Changes in Language Abilities in Alzheimer’s Disease , 2015, Front. Aging Neurosci..

[49]  Adam P Vogel,et al.  Motor Speech Phenotypes of Frontotemporal Dementia, Primary Progressive Aphasia, and Progressive Apraxia of Speech. , 2017, Journal of speech, language, and hearing research : JSLHR.

[50]  M. Sigman,et al.  Automated analysis of free speech predicts psychosis onset in high-risk youths , 2015, npj Schizophrenia.

[51]  Oliver B. Regele,et al.  Digital biomarkers for Alzheimer’s disease: the mobile/wearable devices opportunity , 2019, npj Digital Medicine.

[52]  Antanas Verikas,et al.  Exploring the feasibility of smart phone microphone for measurement of acoustic voice parameters and voice pathology screening , 2015, European Archives of Oto-Rhino-Laryngology.

[53]  Ioannis Tarnanas,et al.  Digital technologies as biomarkers, clinical outcomes assessment, and recruitment tools in Alzheimer's disease clinical trials , 2018, Alzheimer's & dementia.

[54]  Nicholas B. Allen,et al.  Detection of Clinical Depression in Adolescents’ Speech During Family Interactions , 2011, IEEE Transactions on Biomedical Engineering.

[55]  John L Allinson Clinical biomarker validation. , 2018, Bioanalysis.

[56]  Alexandra Konig,et al.  Use of Speech Analyses within a Mobile Application for the Assessment of Cognitive Impairment in Elderly People. , 2018, Current Alzheimer research.

[57]  Jennifer C. Goldsack,et al.  Digital Medicine: A Primer on Measurement , 2019, Digital Biomarkers.

[58]  Tingshao Zhu,et al.  Acoustic differences between healthy and depressed people: a cross-situation study , 2019, BMC Psychiatry.

[59]  Gary Bishop,et al.  Automated Speech Recognition in Adult Stroke Survivors: Comparing Human and Computer Transcriptions , 2019, Folia Phoniatrica et Logopaedica.

[60]  Sheena Reilly,et al.  Comparability of Modern Recording Devices for Speech Analysis: Smartphone, Landline, Laptop, and Hard Disc Recorder , 2015, Folia Phoniatrica et Logopaedica.

[61]  G. Cecchi,et al.  Prediction of psychosis across protocols and risk cohorts using automated language analysis , 2018, World psychiatry : official journal of the World Psychiatric Association.

[62]  C Manfredi,et al.  Smartphones Offer New Opportunities in Clinical Voice Research. , 2017, Journal of voice : official journal of the Voice Foundation.

[63]  J. Willits,et al.  Measuring disorganized speech in schizophrenia: automated analysis explains variance in cognitive deficits beyond clinician-rated scales , 2018, Psychological Medicine.

[64]  J. Hodges,et al.  The effects of very early Alzheimer's disease on the characteristics of writing by a renowned author. , 2004, Brain : a journal of neurology.

[65]  C. Jack,et al.  Strategic roadmap for an early diagnosis of Alzheimer's disease based on biomarkers , 2017, The Lancet Neurology.