Predicting Intelligible Speaking Rate in Individuals with Amyotrophic Lateral Sclerosis from a Small Number of Speech Acoustic and Articulatory Samples.

Amyotrophic lateral sclerosis (ALS) is a rapidly progressive neurological disease that affects the speech motor functions, resulting in dysarthria, a motor speech disorder. Speech and articulation deterioration is an indicator of the disease progression of ALS; timely monitoring of the disease progression is critical for clinical management of these patients. This paper investigated machine prediction of intelligible speaking rate of nine individuals with ALS based on a small number of speech acoustic and articulatory samples. Two feature selection techniques - decision tree and gradient boosting - were used with support vector regression for predicting the intelligible speaking rate. Experimental results demonstrated the feasibility of predicting intelligible speaking rate from only a small number of speech samples. Furthermore, adding articulatory features to acoustic features improved prediction performance, when decision tree was used as the feature selection technique.

[1]  Yana Yunusova,et al.  Tongue Movements and Their Acoustic Consequences in Amyotrophic Lateral Sclerosis , 2012, Folia Phoniatrica et Logopaedica.

[2]  M. Strong,et al.  Amyotrophic lateral sclerosis: A review of current concepts , 2003, Amyotrophic lateral sclerosis and other motor neuron disorders : official publication of the World Federation of Neurology, Research Group on Motor Neuron Diseases.

[3]  Raymond D. Kent,et al.  Speech deterioration in amyotrophic lateral sclerosis: a case study. , 1991, Journal of speech and hearing research.

[4]  Panying Rong,et al.  Predicting Early Bulbar Decline in Amyotrophic Lateral Sclerosis: A Speech Subsystem Approach , 2015, Behavioural neurology.

[5]  Max A. Little,et al.  Suitability of Dysphonia Measurements for Telemonitoring of Parkinson's Disease , 2008, IEEE Transactions on Biomedical Engineering.

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  Max A. Little,et al.  Novel Speech Signal Processing Algorithms for High-Accuracy Classification of Parkinson's Disease , 2012, IEEE Transactions on Biomedical Engineering.

[8]  Jun Wang,et al.  Towards Automatic Detection of Amyotrophic Lateral Sclerosis from Speech Acoustic and Articulatory Samples , 2016, INTERSPEECH.

[9]  Jun Wang,et al.  Individual articulator's contribution to phoneme production , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Jun Wang,et al.  Parkinson's condition estimation using speech acoustic and inversely mapped articulatory data , 2015, INTERSPEECH.

[11]  Jun Wang,et al.  Determining an Optimal Set of Flesh Points on Tongue, Lips, and Jaw for Continuous Silent Speech Recognition , 2015, SLPAT@Interspeech.

[12]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[13]  Thomas F. Quatieri,et al.  A review of depression and suicide risk assessment using speech analysis , 2015, Speech Commun..

[14]  Jan Nouza,et al.  Automatic Speech Recognition Used for Intelligibility Assessment of Text-to-Speech Systems , 2007, COST 2102 Workshop.

[15]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[16]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[17]  Ashok Samal,et al.  Articulatory distinctiveness of vowels and consonants: a data-driven approach. , 2013, Journal of speech, language, and hearing research : JSLHR.

[18]  J. Friedman Stochastic gradient boosting , 2002 .

[19]  J. Cedarbaum,et al.  The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function , 1999, Journal of the Neurological Sciences.

[20]  D R Beukelman,et al.  Frequency of word occurrence in communication samples produced by adult communication aid users. , 1984, The Journal of speech and hearing disorders.

[21]  Jennifer L. Spielman,et al.  Formant centralization ratio: a proposal for a new acoustic measure of dysarthric speech. , 2010, Journal of speech, language, and hearing research : JSLHR.

[22]  Panying Rong,et al.  Predicting Speech Intelligibility Decline in Amyotrophic Lateral Sclerosis Based on the Deterioration of Individual Speech Subsystems , 2016, PloS one.

[23]  Elmar Nöth,et al.  The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition , 2015, INTERSPEECH.

[24]  O. Hardiman,et al.  Amyotrophic lateral sclerosis , 2011, The Lancet.

[25]  S. Skodda,et al.  Intonation and speech rate in Parkinson's disease: general and dynamic aspects and responsiveness to levodopa admission. , 2011, Journal of voice : official journal of the Voice Foundation.

[26]  Thomas F. Quatieri,et al.  Vocal-Source Biomarkers for Depression: A Link to Psychomotor Activity , 2012, INTERSPEECH.

[27]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[28]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[29]  D R Beukelman,et al.  Communication efficiency of dysarthric speakers as measured by sentence intelligibility and speaking rate. , 1981, The Journal of speech and hearing disorders.

[30]  Ashok Samal,et al.  An Optimal Set of Flesh Points on Tongue and Lips for Speech-Movement Classification. , 2016, Journal of speech, language, and hearing research : JSLHR.

[31]  Christian Poellabauer,et al.  Using isolated vowel sounds for classification of Mild Traumatic Brain Injury , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[32]  J. Green,et al.  DETECTION OF BULBAR ALS USING A COMPREHENSIVE SPEECH ASSESSMENT BATTERY , 2013 .

[33]  Elmar Nöth,et al.  Intelligibility Rating with Automatic Speech Recognition, Prosodic, and Cepstral Evaluation , 2011, TSD.

[34]  Elmar Nöth,et al.  New computer aided device for real time analysis of speech of people with Parkinson’s disease , 2014, Revista Facultad de Ingeniería Universidad de Antioquia.

[35]  Jun Wang,et al.  Sentence recognition from articulatory movements for silent speech interfaces , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[36]  Björn Schuller,et al.  Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[37]  S. Skodda,et al.  Progression of dysprosody in Parkinson's disease over time—A longitudinal study , 2009, Movement disorders : official journal of the Movement Disorder Society.

[38]  Jeffrey J Berry,et al.  Accuracy of the NDI wave speech research system. , 2011, Journal of speech, language, and hearing research : JSLHR.

[39]  Max A. Little,et al.  Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson's disease symptom severity , 2011, Journal of The Royal Society Interface.

[40]  Gary L. Pattee,et al.  Bulbar and speech motor assessment in ALS: Challenges and future directions , 2013, Amyotrophic lateral sclerosis & frontotemporal degeneration.

[41]  E. Beghi,et al.  The epidemiology of ALS and the role of population-based registries. , 2006, Biochimica et biophysica acta.