Amplitude-Based Source Parameters for Measur ing Voice Quality

This paper explores amplitude-based measures of the glottal flow signal, as it has been suggested that they may provide a more robust method for analysing voice quality than the more typical time-based ones. Centred around an analysis of some samples from the VOQUAL’03 Speech database 2 of Japanese spontaneous speech, this paper (i) presents a discussion of whether a single global source parameter, such as the effective pulse declination time (which can be calculated from two amplitudes) or the T0-normalised equivalent, is likely to capture the tense–lax dimension of voice quality differentiation, across tokens exhibiting large ƒ0 differences; (ii) presents formulations for an extended set of amplitude-based source parameters, to provide a richer coverage of voice source variation, and (iii) compares results for the amplitude vs. time-based measures. Results for (i) indicate that the non-normalised measure AQ is more effective in capturing tense–lax differences across a large ƒ0 range than the normalised NAQ. Results for (iii) show a promising correspondence between the proposed amplitude-based measures and the time-based ones. A brief description is given of our planned research on Irish prosody, for which these measures will be important.

[1]  J. Laver The phonetic description of voice quality , 1980 .

[2]  K. Scherer Vocal affect expression: a review and a model for future research. , 1986, Psychological bulletin.

[3]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[4]  Christer Gobl,et al.  Acoustic characteristics of voice quality , 1992, Speech Commun..

[5]  C. Gobl,et al.  Contextual Variation of the Vowel Voice Source as a Function of Adjacent Consonants , 1993, Language and speech.

[6]  Johan Liljencrants,et al.  Voice source parameters in continuous speech, transformation of LF-parameters , 1994, ICSLP.

[7]  G. Fant Dept. for Speech, Music and Hearing Quarterly Progress and Status Report the Lf-model Revisited. Transformations and Frequency Domain Analysis the Lf-model Revisited. Transformations and Frequency Domain Analysis* , 2022 .

[8]  P. Alku,et al.  A comparison of glottal voice source quantification parameters in breathy, normal and pressed phonation of female and male speakers. , 1996, Folia phoniatrica et logopaedica : official organ of the International Association of Logopedics and Phoniatrics.

[9]  Paavo Alku,et al.  Amplitude domain quotient for characterization of the glottal volume velocity waveform estimated by inverse filtering , 1996, Speech Commun..

[10]  Gunnar Fant,et al.  Voice source properties of the speech code , 1996 .

[11]  Gunnar Fant,et al.  The voice source in connected speech , 1997, Speech Commun..

[12]  Nick Campbell Recording techniques for capturing natural every-day speech , 2002, LREC.

[13]  P. Alku,et al.  Normalized amplitude quotient for parametrization of the glottal flow. , 2002, The Journal of the Acoustical Society of America.

[14]  C. Gobl,et al.  Expressive synthesis: how crucial is voice quality? , 2002, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002..

[15]  Nick Campbell,et al.  Automatic Detection of Acoustic Centres of Reliability for Tagging Paralinguistic Information in Expressive Speech , 2002, LREC.

[16]  Ailbhe Ní Chasaide,et al.  The role of voice quality in communicating emotion, mood and attitude , 2003, Speech Commun..