论文信息 - Adapting acoustic and lexical models to dysarthric speech

Adapting acoustic and lexical models to dysarthric speech

Dysarthria is a motor speech disorder resulting from neurological damage to the part of the brain that controls the physical production of speech. It is, in part, characterized by pronunciation errors that include deletions, substitutions, insertions, and distortions of phonemes. These errors follow consistent intra-speaker patterns that we exploit through acoustic and lexical model adaptation to improve automatic speech recognition (ASR) on dysarthric speech. We show that acoustic model adaptation yields an average relative word error rate (WER) reduction of 36.99% and that pronunciation lexicon adaptation (PLA) further reduces the relative WER by an average of 8.29% on a large vocabulary task of over 1500 words for six speakers with severe to moderate dysarthria. PLA also shows an average relative WER reduction of 7.11% on speaker-dependent models evaluated using 5-fold cross-validation.

Frank Rudzicz | Kinfe Tadesse Mengistu | K. T. Mengistu | Frank Rudzicz

[1] Sheri Hunnicutt,et al. An investigation of different degrees of dysarthric speech as input to speaker-adaptive and speaker-dependent recognition systems , 2001 .

[2] P. Enderby,et al. Frenchay Dysarthria Assessment , 1983 .

[3] Lawrence S. Meyers,et al. Computer recognition of the speech of adults with cerebral palsy and dysarthria , 1991 .

[4] W. Marsden. I and J , 2012 .

[5] Karen A Hux,et al. Accuracy of three speech recognition systems: Case study of dysarthric speech , 2000 .

[6] Victor Zue,et al. Speech database development at MIT: Timit and beyond , 1990, Speech Commun..

[7] P. Green,et al. Automatic speech recognition and training for severely dysarthric users of assistive technology: The STARDUST project , 2006, Clinical linguistics & phonetics.

[8] David R. Beukelman,et al. Clinical Management of Dysarthric Speakers , 1987 .

[9] Frank RudziczAravind. The TORGO database of acoustic and articulatory speech from speakers with dysarthria , 2012 .

[10] K. Hux,et al. Speech recognition training for enhancing written language generation by a traumatic brain injury survivor. , 2000, Brain injury.

[11] Raymond D. Kent,et al. Toward phonetic intelligibility testing in dysarthria. , 1989, The Journal of speech and hearing disorders.