Inverse glottal filtering using a parameterized input model

In this paper computational algorithms for inverse glottal filtering are studied. The objective of inverse glottal filtering is to estimate the driving source. A good model for the glottal pulse is useful for, e.g., speech synthesis, speech recognition and speaker diagnostics. One common approach is to use a parameterized model of the input signal, i.e., the glottal pulses. The algorithm presented enables simultaneous estimation of the parameters of the input signal and the parameters of the system transfer function, the vocal tract model. The presentation here is restricted to transfer functions of all-pole type, i.e., AR-models. The method can be extended to handle zeros in the transfer function. The computational burden would, however, increase significantly. The algorithm uses efficient numerical methods, as, for instance, QR-factorization through Householder transformations.

[1]  Paul H. Milenkovic,et al.  Glottal inverse filtering by joint estimation of an AR system with a linear input model , 1986, IEEE Trans. Acoust. Speech Signal Process..

[2]  V. Klema LINPACK user's guide , 1980 .

[3]  Åke Björck,et al.  Numerical Methods , 1995, Handbook of Marine Craft Hydrodynamics and Motion Control.

[4]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[5]  H. Strube Determination of the instant of glottal closure from the speech wave. , 1974, The Journal of the Acoustical Society of America.

[6]  Hiroya Fujisaki,et al.  Estimation of voice source and vocal tract parameters based on ARMA analysis and a model for the Glottal source waveform , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Per Hedelin High quality glottal LPC-vocoding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Gene H. Golub,et al.  Matrix computations , 1983 .

[9]  Per Hedelin A glottal LPC-vocoder , 1984, ICASSP.

[10]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[11]  R. B. Monsen,et al.  Study of variations in the male and female glottal wave. , 1976, The Journal of the Acoustical Society of America.

[12]  Hiroya Fujisaki,et al.  Proposal and evaluation of models for the glottal source waveform , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[14]  John E. Dennis,et al.  Numerical methods for unconstrained optimization and nonlinear equations , 1983, Prentice Hall series in computational mathematics.

[15]  J. Holmes,et al.  The influence of glottal waveform on the naturalness of speech from a parallel formant synthesizer , 1973 .

[16]  A. Gray,et al.  Least squares glottal inverse filtering from the acoustic speech waveform , 1979 .