Estimating the voice source in noise

Estimation of the glottal source has applications in many areas of speech processing. Therefore, a noise-robust automatic source estimation algorithm is proposed in this paper. The source signal is estimated using a codebook search approach. The glottal area waveforms extracted from high-speed recordings of the glottis is converted to the glottal flow signals in order to evaluate the performance of the proposed source estimation algorithm. Results in clean and noisy conditions, on average, show that the proposed algorithm provides more accurate estimation than the software toolkit Aparat [1] as well as an earlier approach [2].

[1]  T. V. Ananthapadmanabha,et al.  Calculation of true glottal flow and its components , 1982, Speech Commun..

[2]  Abeer Alwan,et al.  A new voice source model based on high-speed imaging and its application to voice source estimation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Abeer Alwan,et al.  A novel codebook search technique for estimating the open quotient , 2009, INTERSPEECH.

[4]  Hiroya Fujisaki,et al.  Proposal and evaluation of models for the glottal source waveform , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  John Kane,et al.  A spectral LF model based approach to voice source parameterisation , 2010, INTERSPEECH.

[6]  H. Strube,et al.  SIM--simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals. , 2001, The Journal of the Acoustical Society of America.

[7]  I. Titze,et al.  Acoustic interactions of the voice source with the lower vocal tract. , 1997, The Journal of the Acoustical Society of America.

[8]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[9]  Bert Cranen,et al.  Modeling a leaky glottis. , 1992 .

[10]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[11]  Abeer Alwan,et al.  The voice source in speech production: data, analysis and models , 2010 .

[12]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[13]  E. Hoffman,et al.  Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.

[14]  I. Titze,et al.  Rules for controlling low-dimensional vocal fold models with muscle activation. , 2002, The Journal of the Acoustical Society of America.

[15]  J W Hawks,et al.  A formant bandwidth estimation procedure for vowel synthesis [43.72.Ja]. , 1995, The Journal of the Acoustical Society of America.

[16]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[17]  Paavo Alku,et al.  Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..

[18]  Roy D. Patterson,et al.  An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite , 1998, ICSLP.

[19]  Matti Airas,et al.  TKK Aparat: An environment for voice inverse filtering and parameterization , 2008, Logopedics, phoniatrics, vocology.

[20]  I. Titze,et al.  Voice simulation with a body-cover model of the vocal folds. , 1995, The Journal of the Acoustical Society of America.

[21]  Abeer Alwan,et al.  Acoustic Correlates of Glottal Gaps , 2011, INTERSPEECH.

[22]  Elliot Moore,et al.  Algorithm for automatic glottal waveform estimation without the reliance on precise glottal closure information , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[23]  Paavo Alku,et al.  Parameterisation Methods of the Glottal Flow Estimated by Inverse Filtering , 1985 .

[24]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969, The Journal of the Acoustical Society of America.

[25]  Los Angeles,et al.  The Voice Source in Speech Production: Data, Analysis and Models , 2010 .