Voice Source Change During Fundamental Frequency Variation

Prosody refers to certain properties of the speech signal including audible changes in pitch, loudness, and syllable length. The acoustic manifestation of prosody is typically measured in terms of fundamental frequency (f0), amplitude and duration. These three cues have formed the basis for extensive studies of prosody in natural speech. The present work seeks to go beyond this level of representation and to examine additional factors that arise as a result of the underlying production mechanism. For example, intonation is studied with reference to the f0 contour. However, to change f0 requires changes in the laryngeal configuration that results in glottal flow parameter changes. These glottal changes may serve as important psychoacoustic markers in addition to (or in conjunction with) the f0 targets. The present work examines changes in open quotient with f0 in connected speech using electroglottogram and volume velocity at the lips signals. This preliminary study suggests that individual differences may exist in terms of glottal changes for a particular f0 variation.

[1]  H M Hanson,et al.  Glottal characteristics of female speakers: acoustic correlates. , 1997, The Journal of the Acoustical Society of America.

[2]  Peter J. Murphy,et al.  Estimation of the vocal tract transfer function with application to glottal wave analysis , 2005, Speech Commun..

[3]  Quarterly Progress and Status Report Glottal wave forms for normal female speakers , 2007 .

[4]  平野 実,et al.  Vocal fold physiology : voice quality control , 1995 .

[5]  K. Stevens,et al.  Glottal characteristics of female speakers , 1995 .

[6]  Joseph S. Perkell,et al.  Glottal airflow and transglottal air pressure measurements for male and female speakers in low, normal, and high pitch , 1989 .

[7]  Thomas F. Quatieri,et al.  Shape invariant time-scale and pitch modification of speech , 1992, IEEE Trans. Signal Process..

[8]  Lou Boves,et al.  On the relation between voice source parameters and prosodic features in connected speech , 1992, Speech Commun..

[9]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[10]  Qiang Fu,et al.  Robust Glottal Source Estimation Based on Joint Source-Filter Model Optimization , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[12]  Peter Murphy,et al.  Production based pitch modification of voiced speech , 2002, INTERSPEECH.

[13]  Yannis Stylianou,et al.  Applying the harmonic plus noise model in concatenative speech synthesis , 2001, IEEE Trans. Speech Audio Process..

[14]  Raymond N. J. Veldhuis,et al.  The effect of speech melody on voice quality , 2001, Speech Commun..