Objective intelligibility assessment of pathological speakers

Intelligibility is a primary measure for the assessment of pathological speech. Traditionally, it is measured using a perceptual test, which is by definition subjective in nature. Consequently, there is a great interest in reliable, automatic and therefore objective methods. This paper presents such a method that incorporates an automatic speech recognizer (ASR) for producing features that characterize the pronunciations of a speaker and an intelligibility prediction model (IPM) for converting these features into an intelligibility score. High correlations (about 0.90) between objective and perceptual scores are obtained with a system comprising two different speech recognizers: one with traditional acoustic models relating acoustical observations to triphone states and one using phonological features as an intermediate layer between the acoustical observations and the phonetic states.

[1]  Paul Van de Heyning,et al.  Reliability and Clinical Relevance of Segmental Analysis Based on Intelligibility Assessment , 2008, Folia Phoniatrica et Logopaedica.

[2]  Jean-Pierre Martens,et al.  On The Use of Phonological Features for Pronunciation Scoring , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[3]  J. Martens,et al.  Speech technology-based assessment of phoneme intelligibility in dysarthria. , 2009, International journal of language & communication disorders.

[4]  Jean-Pierre Martens,et al.  Speech recognition with phonological features: some issues to attend , 2006, INTERSPEECH.

[5]  Raymond D. Kent,et al.  Toward phonetic intelligibility testing in dysarthria. , 1989, The Journal of speech and hearing disorders.

[6]  Kris Demuynck,et al.  Extracting, modelling and combining information in speech recognition , 2001 .

[7]  Elmar Nöth,et al.  Towards robust automatic evaluation of pathologic telephone speech , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[8]  Kathryn M. Yorkston,et al.  Comprehensibility of Dysarthric Speech , 1996 .

[9]  Elmar Nöth,et al.  Automatic scoring of the intelligibility in patients with cancer of the oral cavity , 2007, INTERSPEECH.

[10]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[11]  Phil D. Green,et al.  Revisiting dysarthria assessment intelligibility metrics , 2004, INTERSPEECH.