论文信息 - Modeling of english speech for the design of a distributed speech understanding system

Modeling of english speech for the design of a distributed speech understanding system

This paper describes the derivation and verification of a phoneme model of English speech. The model is used to generate a stream of phonemically labeled speech frames to model speech input for the design of a distributed speech understanding system. New computer architectures to perform speech understanding in real time should incorporate information about the characteristics of English speech. In order to predict the performance of a new architecture, it is necessary to simulate the design using either massive amounts of speech data or, as an alternative, a statistical model of speech. A statistically generated phoneme stream is used to avoid the difficulty of performing computationally intensive acoustic parameterization on the enormous amount of speech input data which would be required to obtain representative phoneme distributions and patterns of speech.

Edward J. Coyle | Leah J. Siegel | Edward C. Bronson

[1] T H Crystal,et al. Segmental durations in connected speech signals: preliminary results. , 1982, The Journal of the Acoustical Society of America.

[2] R M Dalston,et al. Acoustic characteristics of English /w,r,l/ spoken correctly by young children and adults. , 1975, The Journal of the Acoustical Society of America.

[3] Leah H. Jamieson,et al. A Parallel Architecture for Labeling, Segmentation, and Lexical Processing in Speech Understanding , 1983, ICPP.

[4] Lawrence R. Rabiner,et al. A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition , 1976 .

[5] N. Umeda. Vowel duration in American English. , 1975, The Journal of the Acoustical Society of America.

[6] N. Umeda. Consonant duration in American English , 1977 .