Development of a glottal area index that integrates glottal gap size and open quotient.

Because voice signals result from vocal fold vibration, perceptually meaningful vibratory measures should quantify those aspects of vibration that correspond to differences in voice quality. In this study, glottal area waveforms were extracted from high-speed videoendoscopy of the vocal folds. Principal component analysis was applied to these waveforms to investigate the factors that vary with voice quality. Results showed that the first principal component derived from tokens without glottal gaps was significantly (p < 0.01) associated with the open quotient (OQ). The alternating-current (AC) measure had a significant effect (p < 0.01) on the first principal component among tokens exhibiting glottal gaps. A measure AC/OQ, defined as the ratio of AC to OQ, was proposed to combine both amplitude and temporal characteristics of the glottal area waveform for both complete and incomplete glottal closures. Analyses of "glide" phonations in which quality varied continuously from breathy to pressed showed that the AC/OQ measure was able to characterize the corresponding continuum of glottal area waveform variation, regardless of the presence or absence of glottal gaps.

[1]  R TIMCKE,et al.  Laryngeal vibrations: measurements of the glottic wave. I. The normal vibratory cycle. , 1958, A.M.A. archives of otolaryngology.

[2]  H. Hollien,et al.  On the nature of vocal fry. , 1966, Journal of speech and hearing research.

[3]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[4]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969, The Journal of the Acoustical Society of America.

[5]  M. Rothenberg A new inverse-filtering technique for deriving the glottal air flow waveform during voicing. , 1970, The Journal of the Acoustical Society of America.

[6]  M. Hirano,et al.  Glottal-area time function and subglottal-pressure variation. , 1973, The Journal of the Acoustical Society of America.

[7]  M. Morrison,et al.  Muscular tension dysphonia. , 1983, The Journal of otolaryngology.

[8]  B. Gerratt,et al.  Cinegraphic observations of laryngeal function in parkinson's disease , 1984, The Laryngoscope.

[9]  Hiroya Fujisaki,et al.  Proposal and evaluation of models for the glottal source waveform , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  M. Huffman Measures of phonation type in Hmong. , 1987, The Journal of the Acoustical Society of America.

[11]  I Maddieson,et al.  Digital inverse filtering for linguistic research. , 1987, Journal of speech and hearing research.

[12]  J. Perkell,et al.  Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice. , 1988, The Journal of the Acoustical Society of America.

[13]  Joseph S. Perkell,et al.  Glottal airflow and transglottal air pressure measurements for male and female speakers in low, normal, and high pitch , 1989 .

[14]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[15]  M. Södersten,et al.  Glottal closure and perceived breathiness during phonation in normally speaking subjects. , 1990, Journal of speech and hearing research.

[16]  Paavo Alku,et al.  Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..

[17]  Bert Cranen,et al.  Modeling a leaky glottis. , 1992 .

[18]  G. de Krom A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. , 1993, Journal of speech and hearing research.

[19]  Guus de Krom,et al.  A Cepstrum-Based Technique for Determining a Harmonics-to-Noise Ratio in Speech Signals , 1993 .

[20]  Gunnar Fant,et al.  Some problems in voice source analysis , 1993, Speech Commun..

[21]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality. , 1994, Journal of speech and hearing research.

[22]  Bert Cranen,et al.  Modeling a leaky glottis , 1995 .

[23]  Helen M. Hanson,et al.  Glottal characteristics of male speakers: acoustic correlates and comparison with female data. , 1996 .

[24]  I. Titze,et al.  Acoustic interactions of the voice source with the lower vocal tract. , 1997, The Journal of the Acoustical Society of America.

[25]  H M Hanson,et al.  Glottal characteristics of female speakers: acoustic correlates. , 1997, The Journal of the Acoustical Society of America.

[26]  K. Omori,et al.  Influence of Size and Etiology of Glottal Gap in Glottic Incompetence Dysphonia , 1998, The Laryngoscope.

[27]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[28]  E. Chuang,et al.  Glottal characteristics of male speakers: acoustic correlates and comparison with female data. , 1996, The Journal of the Acoustical Society of America.

[29]  Christophe d'Alessandro,et al.  Spectral correlates of voice open quotient and glottal flow asymmetry : theory, limits and experimental data , 2001, INTERSPEECH.

[30]  P. Alku,et al.  Normalized amplitude quotient for parametrization of the glottal flow. , 2002, The Journal of the Acoustical Society of America.

[31]  Ingo R Titze A theoretical study of F0-F1 interaction with application to resonant speaking and singing voice. , 2004, Journal of voice : official journal of the Voice Foundation.

[32]  I. Titze Theoretical analysis of maximum flow declination rate versus maximum area declination rate in phonation. , 2006, Journal of speech, language, and hearing research : JSLHR.

[33]  Paavo Alku,et al.  Estimation of the Voice Source from Speech Pressure Signals: Evaluation of an Inverse Filtering Technique Using Physical Modelling of Voice Production , 2006, Folia Phoniatrica et Logopaedica.

[34]  B. Blagnys,et al.  To "EE" or not to "EE". , 2007, The Journal of otolaryngology.

[35]  G. Fant,et al.  Preliminaries to analysis of the human voice source , 2007 .

[36]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[37]  I. Titze Nonlinear source-filter coupling in phonation: theory. , 2008, The Journal of the Acoustical Society of America.

[38]  P. Alku,et al.  Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering. , 2009, The Journal of the Acoustical Society of America.

[39]  Abeer Alwan,et al.  A novel codebook search technique for estimating the open quotient , 2009, INTERSPEECH.

[40]  Jody Kreiman,et al.  Effects of native language on perception of voice quality , 2010, J. Phonetics.

[41]  Abeer Alwan,et al.  On the interdependencies between voice quality, glottal gaps, and voice-source related acoustic measures , 2010, INTERSPEECH.

[42]  Abeer Alwan,et al.  The voice source in speech production: data, analysis and models , 2010 .

[43]  Abeer Alwan,et al.  Acoustic Correlates of Glottal Gaps , 2011, INTERSPEECH.

[44]  Brad H Story,et al.  Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling. , 2011, Journal of speech, language, and hearing research : JSLHR.

[45]  Patricia A. Keating,et al.  Voicesauce: A Program for Voice Analysis , 2009, ICPhS.

[46]  Daryush D. Mehta,et al.  Investigating acoustic correlates of human vocal fold vibratory phase asymmetry through modeling and laryngeal high-speed videoendoscopy. , 2011, The Journal of the Acoustical Society of America.

[47]  Jody Kreiman,et al.  Foundations of Voice Studies: An Interdisciplinary Approach to Voice Production and Perception , 2011 .

[48]  A. Alwan,et al.  Variability in the relationships among voice quality, harmonic amplitudes, open quotient, and glottal area waveform shape in sustained phonation. , 2012, The Journal of the Acoustical Society of America.