Investigating the relationship between glottal area waveform shape and harmonic magnitudes through computational modeling and laryngeal high-speed videoendoscopy

The glottal open quotient (OQ) is often associated with the amplitude of the first source harmonic relative to the second (H1*H2*), which is assumed to be one cause of a change in vocal quality along a breathy-to-pressed continuum. The association between OQ and H1*-H2* was investigated in a group of 5 human subjects and also in a computational voice production simulation. The simulation incorporated a parametric voice source model into a nonlinear source-filter framework. H1*-H2* and OQ were measured synchronously from audio recordings and high-speed laryngeal videoendoscopy of “glide” phonations in which quality varied continuously from breathy to pressed. Analyses of individual speakers showed large differences in the relationship between OQ and H1*-H2*. The variability in laryngeal high-speed data was consistent with simulation results, which showed that the relationship between OQ and H1*-H2* depended on mean glottal area, a parameter associated with the degree of source-filter interaction and not directly measurable from high-speed video of the vocal folds. In addition, H1*-H2* may change with increasing glottal gap size; this change contributes to the observed variability in the relationship between H1*-H2* and OQ. Index Terms: harmonic magnitudes, laryngeal high-speed videoendoscopy, glottal area waveform, open quotient

[1]  I. Titze Theoretical analysis of maximum flow declination rate versus maximum area declination rate in phonation. , 2006, Journal of speech, language, and hearing research : JSLHR.

[2]  Abeer Alwan,et al.  A new voice source model based on high-speed imaging and its application to voice source estimation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[4]  A. Alwan,et al.  Variability in the relationships among voice quality, harmonic amplitudes, open quotient, and glottal area waveform shape in sustained phonation. , 2012, The Journal of the Acoustical Society of America.

[5]  Raymond N. J. Veldhuis,et al.  The effect of speech melody on voice quality , 2001, Speech Commun..

[6]  Patricia A. Keating,et al.  Voicesauce: A Program for Voice Analysis , 2009, ICPhS.

[7]  I. Titze,et al.  Acoustic interactions of the voice source with the lower vocal tract. , 1997, The Journal of the Acoustical Society of America.

[8]  Abeer Alwan,et al.  Age, sex, and vowel dependencies of acoustic measures related to the voice source. , 2007, The Journal of the Acoustical Society of America.

[9]  I. Titze The myoelastic aerodynamic theory of phonation , 2006 .

[10]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[11]  Abeer Alwan,et al.  On the interdependencies between voice quality, glottal gaps, and voice-source related acoustic measures , 2010, INTERSPEECH.

[12]  Brad H Story,et al.  Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling. , 2011, Journal of speech, language, and hearing research : JSLHR.

[13]  B. Story A parametric model of the vocal tract area function for vowel and consonant simulation. , 2005, The Journal of the Acoustical Society of America.

[14]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969, The Journal of the Acoustical Society of America.

[15]  I. Titze Nonlinear source-filter coupling in phonation: theory. , 2008, The Journal of the Acoustical Society of America.

[16]  Luc Mongeau,et al.  Influence of acoustic loading on an effective single mass model of the vocal folds. , 2007, The Journal of the Acoustical Society of America.

[17]  G. Fant,et al.  Preliminaries to analysis of the human voice source , 2007 .

[18]  M. Döllinger,et al.  High‐Precision Measurement of the Vocal Fold Length and Vibratory Amplitudes , 2002, The Laryngoscope.

[19]  Daryush D. Mehta,et al.  Investigating acoustic correlates of human vocal fold vibratory phase asymmetry through modeling and laryngeal high-speed videoendoscopy. , 2011, The Journal of the Acoustical Society of America.

[20]  I. Titze,et al.  Voice simulation with a body-cover model of the vocal folds. , 1995, The Journal of the Acoustical Society of America.

[21]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[22]  Neil Genzlinger A. and Q , 2006 .

[23]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[24]  Marion Kee,et al.  Analysis , 2004, Machine Translation.

[25]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[26]  A. Alwan,et al.  Development of a glottal area index that integrates glottal gap size and open quotient. , 2013, The Journal of the Acoustical Society of America.

[27]  Abeer Alwan,et al.  Acoustic Correlates of Glottal Gaps , 2011, INTERSPEECH.

[28]  J. Perkell,et al.  Comparisons among aerodynamic, electroglottographic, and acoustic spectral measures of female voice. , 1995, Journal of speech and hearing research.

[29]  Abeer Alwan,et al.  Estimating the voice source in noise , 2012, INTERSPEECH.

[30]  Ken-Ichi Sakakibara,et al.  Estimation of glottal area function using stereo-endoscopic high-speed digital imaging , 2010, INTERSPEECH.

[31]  Hiroya Fujisaki,et al.  Proposal and evaluation of models for the glottal source waveform , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  B. Blagnys,et al.  To "EE" or not to "EE". , 2007, The Journal of otolaryngology.

[33]  D. Bless,et al.  Videostroboscopic examination of the larynx , 1993 .

[34]  Christophe d'Alessandro,et al.  Spectral correlates of voice open quotient and glottal flow asymmetry : theory, limits and experimental data , 2001, INTERSPEECH.

[35]  H M Hanson,et al.  Glottal characteristics of female speakers: acoustic correlates. , 1997, The Journal of the Acoustical Society of America.

[36]  I. Titze,et al.  Nonlinear source-filter coupling in phonation: vocal exercises. , 2008, The Journal of the Acoustical Society of America.