Speech assistive technology to improve the interaction of dysarthric speakers with machines

In this paper, we propose a communication scheme using automatic speech recognition and speech synthesis in order to assist dysarthric speakers. This system aims at improving recognition rate and intelligibility of dysarthric speech. An HMM-based recognizer using variable duration of Hamming window permits to raise the recognition rate of dysarthric speech up to 80 %. In order to improve the intelligibility of synthetic speech while keeping the naturalness close to the voice of dysarthric speaker, we synthesize the recognized text using new basic unit segmenter, a new concatenating algorithm and a grafting technique to correct the bad pronounced phonemes. The NEMOURS dysarthric database is used to evaluate the proposed assistive communication system. Results show that a rate of 65 % to 80% of correct word recognition and a Mean Opinion Score (MOS) of 4 were obtained.

[1]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[2]  Douglas D. O'Shaughnessy,et al.  Speech communications - human and machine, 2nd Edition , 2000 .

[3]  A. Aronson,et al.  Differential diagnostic patterns of dysarthria. , 1969, Journal of speech and hearing research.

[4]  Raymond D. Kent,et al.  Toward phonetic intelligibility testing in dysarthria. , 1989, The Journal of speech and hearing disorders.

[5]  K Bon,et al.  A Statistical Causal Model for the Assessment of Dysarthric Speech and the Utility of Computer-Based Speech Recognition , 1993 .

[6]  Fangxin Chen,et al.  Optimization of dysarthric speech recognition , 1997, Proceedings of the 19th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 'Magnificent Milestones and Emerging Opportunities in Medical Engineering' (Cat. No.97CH36136).

[7]  P.D. Polur,et al.  Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov model , 2005, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[8]  John-Paul Hosom,et al.  Intelligibility of modifications to dysarthric speech , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  Luís C. Oliveira,et al.  Pitch-synchronous time-scaling for prosodic and voice quality transformations , 2005, INTERSPEECH.

[10]  H. Timothy Bunnell,et al.  The Nemours database of dysarthric speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.