Detection of vowel onset point events using excitation information

This paper proposes a method for the detection of Vowel Onset Point (VOP) events in speech using excitation information. VOP event is defined as the instant at which the onset of vowel takes place. For syllable-like units such as Consonant Vowel (CV) type, VOP event is the instant at which the consonant ends and the vowel begins. The speech signal is processed by the Linear Prediction (LP) analysis to extract the LP residual. The LP residual mostly contains the excitation information. The Hilbert envelope of the LP residual is derived using the analytic signal concept. A method is developed for detecting the VOP events using the Hilbert envelope of the LP residual and a modulated Gaussian window function. The performance of the proposed method is evaluated using reference VOP markings. The performance of the proposed method is also compared with the existing methods based on the vocal tract system features. The comparison shows that the excitation source also contains significant information about the VOP events.

[1]  Jhing-Fa Wang,et al.  A C/V segmentation algorithm for Mandarin speech signal based on wavelet transforms , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[2]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[3]  J. Zwislocki,et al.  Short-term adaptation and incremental responses of single auditory-nerve fibers , 1975, Biological Cybernetics.

[4]  D B Pisoni,et al.  Perception of static and dynamic acoustic cues to place of articulation in initial stop consonants. , 1983, The Journal of the Acoustical Society of America.

[5]  S. Furui On the role of spectral transition for speech perception. , 1986, The Journal of the Acoustical Society of America.

[6]  W L Cullinan,et al.  The perception of temporally segmented vowels and consonant-vowel syllables. , 1979, Journal of speech and hearing research.

[7]  B. Yegnanarayana,et al.  Neural Networks based Approach for Detection of Vowel Onset Points , 1999 .

[8]  D. J. Hermes,et al.  Vowel-onset detection. , 1990, The Journal of the Acoustical Society of America.

[9]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.