Pathological Speech Intelligibility Assessment Based on the Short-time Objective Intelligibility Measure

Impaired speech intelligibility in motor speech disorders arising due to neurological diseases negatively affects the communication ability and quality of life of patients. Reliable and cost-effective measures to automatically assess speech intelligibility are necessary for the management of such disorders. In this paper, we propose to automatically assess the intelligibility of pathological speech based on short-time objective intelligibility measures typically used in speech enhancement, which however require a reference signal that is time-aligned to the test signal. We propose a method to create an utterance-dependent reference signal of intelligible speech from multiple healthy speakers. In order to assess intelligibility, the pathological speech signal is aligned to the created reference signal using dynamic time warping and the divergence between the two signals is quantified using either the short-time or the spectral correlation. Experiments on databases of English and French patients suffering from Cerebral Palsy and Amyotrophic Lateral Sclerosis show that the proposed intelligibility measures can obtain a high correlation with subjective intelligibility ratings, outperforming several state-of-the-art pathological speech intelligibility measures.

[1]  K. Hustad Estimating the Intelligibility of Speakers with Dysarthria , 2006, Folia Phoniatrica et Logopaedica.

[2]  Jonathan C. Kim,et al.  Speech intelligibility estimation using multi-resolution spectral features for speakers undergoing cancer treatment. , 2014, The Journal of the Acoustical Society of America.

[3]  Heidi Christensen,et al.  Dysarthria intelligibility assessment in a factor analysis total variability space , 2013, INTERSPEECH.

[4]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[5]  Fraser Shein,et al.  Characterization of atypical vocal source excitation, temporal dynamics and prosody for objective measurement of dysarthric word intelligibility , 2012, Speech Commun..

[6]  Joseph R. Duffy,et al.  Motor Speech Disorders: Clues to Neurologic Diagnosis , 2000 .

[7]  Elmar Nöth,et al.  Can you understand him? Let's look at his word accuracy-automatic evaluation of tracheoesophageal speech , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[8]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[9]  Waad Ben Kheder,et al.  Automatic Prediction of Speech Evaluation Metrics for Dysarthric Speech , 2017, INTERSPEECH.

[10]  Jesper Jensen,et al.  A short-time objective intelligibility measure for time-frequency weighted noisy speech , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  Tiago H. Falk,et al.  Automated Dysarthria Severity Classification for Improved Objective Intelligibility Assessment of Spastic Dysarthric Speech , 2012, INTERSPEECH.

[12]  Thomas S. Huang,et al.  Dysarthric speech database for universal access research , 2008, INTERSPEECH.

[13]  Tiago H. Falk,et al.  Quantifying perturbations in temporal dynamics for automated assessment of spastic dysarthric speech intelligibility , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Tiago H. Falk,et al.  Spectral Features for Automatic Blind Intelligibility Estimation of Spastic Dysarthric Speech , 2011, INTERSPEECH.

[15]  A. Aronson,et al.  Differential diagnostic patterns of dysarthria. , 1969, Journal of speech and hearing research.

[16]  Véronique Delvaux,et al.  The MonPaGe_HA Database for the Documentation of Spoken French Throughout Adulthood , 2018, LREC.

[17]  Raymond D. Kent,et al.  Acoustic-phonetic contrasts and intelligibility in the dysarthria associated with mixed cerebral palsy. , 1992, Journal of speech and hearing research.

[18]  Jean-Pierre Martens,et al.  Objective intelligibility assessment of pathological speakers , 2008, INTERSPEECH.

[19]  Chunying Fang,et al.  Intelligibility Evaluation of Pathological Speech through Multigranularity Feature Extraction and Optimization , 2017, Comput. Math. Methods Medicine.

[20]  Richard Heusdens,et al.  Matching pursuit for channel selection in cochlear implants based on an intelligibility metric , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[21]  Nick Miller,et al.  Association between objective measurement of the speech intelligibility of young people with dysarthria and listener ratings of ease of understanding , 2014, International journal of speech-language pathology.

[22]  Elmar Nöth,et al.  Automatic scoring of the intelligibility in patients with cancer of the oral cavity , 2007, INTERSPEECH.

[23]  E. Nöth,et al.  Automatic Quantification of Speech Intelligibility of Adults with Oral Squamous Cell Carcinoma , 2008, Folia Phoniatrica et Logopaedica.

[24]  Tiago H. Falk,et al.  A Non-Intrusive Quality and Intelligibility Measure of Reverberant and Dereverberated Speech , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  Elmar Nöth,et al.  Intelligibility of laryngectomees’ substitute speech: automatic speech recognition and subjective rating , 2005, European Archives of Oto-Rhino-Laryngology and Head & Neck.

[26]  Jesper Jensen,et al.  An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[27]  Elmar Nöth,et al.  Automatic intelligibility assessment of speakers after laryngeal cancer by means of acoustic modeling. , 2012, Journal of voice : official journal of the Voice Foundation.

[28]  Jesper Jensen,et al.  An Algorithm for Predicting the Intelligibility of Speech Masked by Modulated Noise Maskers , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[29]  P. Enderby Disorders of communication: dysarthria. , 2013, Handbook of clinical neurology.

[30]  Elmar Nöth,et al.  Automatic Speech Recognition Systems for the Evaluation of Voice and Speech Disorders in Head and Neck Cancer , 2010, EURASIP J. Audio Speech Music. Process..