LF model based glottal source parameter estimation by extended Kalman filtering

Abstract — A new algorithm for glottal source parameter estimation of voiced speech based on the Liljencrants-Fant (LF) model is presented in this work. Each pitch period of the inverse filtered glottal flow derivative is divided into two phases according to the glottal closing instant and an extended Kalman filter is iteratively applied to estimate the shape controlling parameters for both phases. By searching the minimal mean square error between the reconstructed LF pulse and the original signal, an optimal set of estimates can be obtained. Preliminary experimental results show that the proposed algorithm is effective for a wide range of LF parameters for different voice qualities with different noise levels, and accuracy especially for estimation of return phase parameters compares better than standard time-domain fitting methods while requiring a significantly lower computational load.

[1]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969, The Journal of the Acoustical Society of America.

[2]  R. Shumway,et al.  AN APPROACH TO TIME SERIES SMOOTHING AND FORECASTING USING THE EM ALGORITHM , 1982 .

[3]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[4]  Paavo Alku,et al.  Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..

[5]  Lou Boves,et al.  Fitting a LF-model to inverse filter signals , 1993, EUROSPEECH.

[6]  Greg Welch,et al.  Welch & Bishop , An Introduction to the Kalman Filter 2 1 The Discrete Kalman Filter In 1960 , 1994 .

[7]  Lou Boves,et al.  Automatic estimation of voice source parameters , 1994, ICSLP.

[8]  Christophe d'Alessandro,et al.  Zeros of z-transform (ZZT) decomposition of speech for source-tract separation , 2004, INTERSPEECH.

[9]  Marion Kee,et al.  Analysis , 2004, Machine Translation.

[10]  Peter J. Murphy,et al.  Estimation of the vocal tract transfer function with application to glottal wave analysis , 2005, Speech Commun..

[11]  Qiang Fu,et al.  Robust Glottal Source Estimation Based on Joint Source-Filter Model Optimization , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Junichi Yamagishi,et al.  Towards an improved modeling of the glottal source in statistical parametric speech synthesis , 2007, SSW.

[13]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[14]  Junichi Yamagishi,et al.  Glottal spectral separation for parametric speech synthesis , 2008, INTERSPEECH.

[15]  Matti Airas,et al.  TKK Aparat: An environment for voice inverse filtering and parameterization , 2008, Logopedics, phoniatrics, vocology.

[16]  John Kane,et al.  Automatic parameterisation of the glottal waveform combining time and frequency domain measures , 2009, MAVEBA.

[17]  Thierry Dutoit,et al.  Glottal closure and opening instant detection from speech signals , 2019, INTERSPEECH.

[18]  John Kane,et al.  A spectral LF model based approach to voice source parameterisation , 2010, INTERSPEECH.