论文信息 - Effect of Learning on Listening to Ultra-Fast Synthesized Speech

Effect of Learning on Listening to Ultra-Fast Synthesized Speech

A text-to-speech synthesizer that would produce easily understandable voices at very fast speaking rates is expected to help persons with visual disability to acquire information effectively with screen reading softwares. We investigated the intelligibility of Japanese text-to-speech systems at fast speaking rates, using four-digit random numbers as the vocabulary of the recall test. We also studied the fast and intelligible text-to-speech engine, using HMM-based synthesizer with the corpus with fast speaking rate. As the results, the statistical models trained with the fast speaking corpus was effective. The learning effect was significant in the early stage of the trials and the effect sustained for several weeks

[1] D. Pisoni,et al. Speech Perception as a Talker-Contingent Process , 1993, Psychological science.

[2] Gordon E. Legge,et al. Learning unfamiliar voices , 1984 .

[3] S. Goldinger,et al. Episodic encoding of voice attributes and recognition memory for spoken words. , 1993, Journal of experimental psychology. Learning, memory, and cognition.

[4] Takayuki Watanabe,et al. Evaluation of text-to-speech synthesizers at fast speaking rates , 2005 .

[5] Tohru Ifukube,et al. Maximum listening speeds for the blind , 2003 .