Variability in the relationships among voice quality, harmonic amplitudes, open quotient, and glottal area waveform shape in sustained phonation.

Increases in open quotient are widely assumed to cause changes in the amplitude of the first harmonic relative to the second (H1*-H2*), which in turn correspond to increases in perceived vocal breathiness. Empirical support for these assumptions is rather limited, and reported relationships among these three descriptive levels have been variable. This study examined the empirical relationship among H1*-H2*, the glottal open quotient (OQ), and glottal area waveform skewness, measured synchronously from audio recordings and high-speed video images of the larynges of six phonetically knowledgeable, vocally healthy speakers who varied fundamental frequency and voice qualities quasi-orthogonally. Across speakers and voice qualities, OQ, the asymmetry coefficient, and fundamental frequency accounted for an average of 74% of the variance in H1*-H2*. However, analyses of individual speakers showed large differences in the strategies used to produce the same intended voice qualities. Thus, H1*-H2* can be predicted with good overall accuracy, but its relationship to phonatory characteristics appears to be speaker dependent.

[1]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[2]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969, The Journal of the Acoustical Society of America.

[3]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[4]  Hiroya Fujisaki,et al.  Proposal and evaluation of models for the glottal source waveform , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  M. Huffman Measures of phonation type in Hmong. , 1987, The Journal of the Acoustical Society of America.

[6]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[7]  M. Södersten,et al.  Glottal closure and perceived breathiness during phonation in normally speaking subjects. , 1990, Journal of speech and hearing research.

[8]  B. Hammarberg,et al.  Vocal Fold Physiology: Acoustic, Perceptual, and Physiological Aspects of Voice Mechanisms , 1991 .

[9]  J W Hawks,et al.  A formant bandwidth estimation procedure for vowel synthesis [43.72.Ja]. , 1995, The Journal of the Acoustical Society of America.

[10]  J. Perkell,et al.  Comparisons among aerodynamic, electroglottographic, and acoustic spectral measures of female voice. , 1995, Journal of speech and hearing research.

[11]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. , 1996, Journal of speech and hearing research.

[12]  Gunnar Fant,et al.  The voice source in connected speech , 1997, Speech Commun..

[13]  Boris Doval,et al.  Spectral correlates of glottal waveform models: an analytic study , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  H M Hanson,et al.  Glottal characteristics of female speakers: acoustic correlates. , 1997, The Journal of the Acoustical Society of America.

[15]  R Veldhuis,et al.  A computationally efficient alternative for the Liljencrants-Fant model and its perceptual evaluation. , 1998, The Journal of the Acoustical Society of America.

[16]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[17]  J Sundberg,et al.  Effects of subglottal pressure variation on professional baritone singers' voice sources. , 1999, The Journal of the Acoustical Society of America.

[18]  Christophe d'Alessandro,et al.  Spectral correlates of voice open quotient and glottal flow asymmetry : theory, limits and experimental data , 2001, INTERSPEECH.

[19]  Raymond N. J. Veldhuis,et al.  The effect of speech melody on voice quality , 2001, Speech Commun..

[20]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[21]  Frank H. Guenther,et al.  A neural network model of speech acquisition and motor equivalent speech production , 2004, Biological Cybernetics.

[22]  Nathalie Henrich Bernardoni,et al.  The spectrum of glottal flow models , 2006 .

[23]  B. Blagnys,et al.  To "EE" or not to "EE". , 2007, The Journal of otolaryngology.

[24]  M. S. Howe,et al.  Sound generated by aerodynamic sources near a deformable body, with application to voiced speech , 2007, Journal of Fluid Mechanics.

[25]  Abeer Alwan,et al.  Age, sex, and vowel dependencies of acoustic measures related to the voice source. , 2007, The Journal of the Acoustical Society of America.

[26]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[27]  Christian T. DiCanio The phonetics of register in Takhian Thong Chong , 2009, Journal of the International Phonetic Association.

[28]  Jody Kreiman,et al.  Effects of native language on perception of voice quality , 2010, J. Phonetics.

[29]  Jody Kreiman,et al.  Integrated software for analysis and synthesis of voice quality , 2010, Behavior research methods.

[30]  Abeer Alwan,et al.  A new voice source model based on high-speed imaging and its application to voice source estimation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[31]  Patricia A. Keating,et al.  Voicesauce: A Program for Voice Analysis , 2009, ICPhS.

[32]  Recognizing Speaker Identity From Voice: Theoretical and Ethological Perspectives and a Psychological Model , 2011 .

[33]  Jody Kreiman,et al.  Foundations of Voice Studies: An Interdisciplinary Approach to Voice Production and Perception , 2011 .