Towards an ASR-free objective analysis of pathological speech

Nowadays, intelligibility is a popular measure of the severity of the articulatory deficiencies of a pathological speaker. Usually, this measure is obtained by means of a perceptual test, consisting of nonconventional and/or nonconnected words. In previous work, we developed a system incorporating two Automatic Speech Recognizers (ASR) that could fairly accurately estimate phoneme intelligibility (PI). In the present paper, we propose a novel method that aims to assess the running speech intelligibility (RSI) as a more relevant indicator of the communication efficiency of a speaker in a natural setting. The proposed method computes a phonological characterization of the speaker by means of a statistical analysis of frame-level phonological features. Important is that this analysis requires no knowledge of what the speaker was supposed to say. The new characterization is demonstrated to predict PI and to provide valuable information about the nature and severity of the pathology. Index Terms: objective intelligibility assessment, pathological speech, phonological features, running speech

[1]  S G Fletcher,et al.  Nasalance in utterances of hearing-impaired speakers. , 1976, Journal of communication disorders.

[2]  Paul Van de Heyning,et al.  Reliability and Clinical Relevance of Segmental Analysis Based on Intelligibility Assessment , 2008, Folia Phoniatrica et Logopaedica.

[3]  Jean-Pierre Martens,et al.  Objective intelligibility assessment of pathological speakers , 2008, INTERSPEECH.

[4]  R. J. Lickley,et al.  Proceedings of the International Conference on Spoken Language Processing. , 1992 .

[5]  Tino Haderlein,et al.  EVALUATION AND ASSESSMENT OF SPEECH INTELLIGIBILITY ON PAT HOLOGIC VOICES BASED UPON ACOUSTIC SPEAKER MODELS , 2009 .

[6]  J. Martens,et al.  Speech technology-based assessment of phoneme intelligibility in dysarthria. , 2009, International journal of language & communication disorders.

[7]  Dirk Van Compernolle,et al.  CoGeN een corpus gesproken Nederlands voor spraaktechnologisch onderzoek , 1997 .

[8]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[9]  Hugo Van hamme,et al.  Evaluation of phone lattice based speech decoding , 2009, INTERSPEECH.

[10]  Jean-Pierre Martens,et al.  Automated Intelligibility Assessment of Pathological Speech Using Phonological Features , 2009, EURASIP J. Adv. Signal Process..

[11]  Raymond D. Kent,et al.  Toward phonetic intelligibility testing in dysarthria. , 1989, The Journal of speech and hearing disorders.

[12]  J B Spitzer,et al.  A perceptual evaluation of the speech of adventitiously deaf adult males. , 1990, Ear and hearing.