Detection of the glottal closure by jumps in the statistical properties of the speech signal

Abstract Two new methods are presented here for the detection of the glottal closure instant from the speech waveform. Both detect abrupt changes in the short-term spectral characteristics of the speech signal within a pitch period caused by glottal events. The same statistical approach to the sequential detection of events by hypothesis testing is used for this purpose. The first method is based on the maximization of the likelihood ratio, while the second uses a divergence convexity test. Experiments on real speech data demonstrate the robust of these methods.

[1]  Lou Boves,et al.  Robust ARMA analysis for accurate determination of system parameters of the voice source and vocal tract , 1989, EUROSPEECH.

[2]  Per Hedelin High quality glottal LPC-vocoding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[4]  John N. Holmes,et al.  Formant excitation before and after glottal closure , 1976, ICASSP.

[5]  Régine André-Obrecht,et al.  A new statistical approach for the automatic segmentation of continuous speech signals , 1988, IEEE Trans. Acoust. Speech Signal Process..

[6]  Douglas D. O'Shaughnessy,et al.  Automatic and reliable estimation of glottal closure instant and period , 1989, IEEE Trans. Acoust. Speech Signal Process..

[7]  John S. Bridle,et al.  Interactive digital inverse filtering and its relation to linear prediction methods , 1978, ICASSP.

[8]  Steven Kay Recursive maximum likelihood estimation of autoregressive processes , 1983 .

[9]  John Makhoul,et al.  Adaptive lattice analysis of speech , 1981 .

[10]  H. Strube Determination of the instant of glottal closure from the speech wave. , 1974, The Journal of the Acoustical Society of America.

[11]  B. Guérin,et al.  [Influences of acoustic coupling between source and vocal tract of the Fo of oral vowels. Consequence for the study of intrinsic characteristics]. , 1980, Phonetica.

[12]  B. Yegnanarayana,et al.  Epoch extraction of voiced speech , 1975 .

[13]  David G. Messerschmitt,et al.  Adaptive Filters: Structures, Algorithms and Applications , 1984 .

[14]  John R. Deller Evaluation of laryngeal dysfunction based on features of an accurate estimate of the glottal waveform , 1982, ICASSP.

[15]  Kishan G. Mehrotra,et al.  Performance comparison of autoregressive estimation methods , 1984, ICASSP.

[16]  A. Gray,et al.  Least squares glottal inverse filtering from the acoustic speech waveform , 1979 .

[17]  T. V. Ananthapadmanabha,et al.  Calculation of true glottal flow and its components , 1982, Speech Commun..

[18]  D. Veeneman,et al.  Automatic glottal inverse filtering from speech and electroglottographic signals , 1985, IEEE Trans. Acoust. Speech Signal Process..

[19]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[20]  Solomon Kullback,et al.  Information Theory and Statistics , 1960 .