Phonemic segmentation of fluent speech

A hierarchical approach to phonemic segmentation of continuous, speaker-independent speech is presented. Each sentence is divided into distinct obstruent and sonorant regions using a Bayesian decision surface. Rules are then used to make context specific corrections with these regions. Finally, finer segmentation is performed using a number of rules specific to obstruent and sonorant boundaries. Around 80% of the boundaries are located with an insertion rate of 12%. The developed system is suitable for use in phoneme recognition and automatic labelling of speech.<<ETX>>

[1]  V.W. Zue,et al.  The use of speech knowledge in automatic speech recognition , 1985, Proceedings of the IEEE.

[2]  Harold T. Edwards,et al.  Applied Phonetics: The Sounds of American English , 1992 .

[3]  Renato De Mori,et al.  Learning and Plan Refinement in a Knowledge-Based System for Automatic Speech Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  James Glass,et al.  Acoustic segmentation and phonetic classification in the SUMMIT system , 1988, International Conference on Acoustics, Speech, and Signal Processing,.

[5]  David B. Grayden,et al.  Recognition of obstruent phonemes in speaker-independent fluent speech using a hierarchical approach , 1993, EUROSPEECH.

[6]  D. O'Shaughnessy,et al.  Using phoneme duration and energy contour information to improve large vocabulary isolated-word recognition , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[7]  James Glass,et al.  Multi-level acoustic segmentation of continuous speech , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[8]  Kiyohiro Shikano,et al.  Phoneme segmentation using spectrogram reading knowledge , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[9]  C. Espy-Wilson A phonetically based semivowel recognition system , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Laurent Miclet,et al.  Recognition of speaker-dependent continuous speech with KEAL , 1990 .