On the perception of emotions in speech: the role of voice quality

This experiment studied the role of voice quality in the communication of emotions in speech. The material was derived from an earlier study. There three subjects produced a three-word utterance expressing different emotional states. The glottal airflow waveform was estimated from the acoustic speech pressure signal using an inverse filtering technique.In the present study the differences in F0 level and the intrasyllabic F0 changes were artificially eliminated and only the first 200 ms of the primarily stressed syllable replayed at equal sound volume to the listeners (10 in total). The listeners tended to categorize the samples to represent emotions implying either high or low psychophysiological activity level. This seemed to emanate from signs of vocal effort level. Perception of vocal effort was significantly related to the glottal source type and F1.Decision of valence of the perceived emotion was in this material significantly related to F1 and F4. The type of glottal source, however, may also contr...

[1]  P Kitzing,et al.  A photoglottographical study of the female vocal folds during phonation. , 1974, Folia phoniatrica.

[2]  Paavo Alku,et al.  Preliminary experiences in using automatic inverse filtering of acoustical signals for the voice source analysis , 1992 .

[3]  R TIMCKE,et al.  Laryngeal vibrations: measurements of the glottic wave. I. The normal vibratory cycle. , 1958, A.M.A. archives of otolaryngology.

[4]  Paavo Alku,et al.  Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..

[5]  J. Ohala Cross-Language Use of Pitch: An Ethological View , 1983, Phonetica.

[6]  T. Hacki Klassifizierung von Glottisdysfunktionen mit Hilfe der Elektroglottographie , 1989 .

[7]  J. Ohala,et al.  An Ethological Perspective on Common Cross-Language Utilization of F₀ of Voice , 1984, Phonetica.

[8]  J. Sundberg,et al.  Spectral correlates of glottal voice source waveform characteristics. , 1989, Journal of speech and hearing research.

[9]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[10]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.

[11]  Experimentelle Untersuchungen über den Zusammenhang zwischen dem Ausdruck der Sprechstimme und dem vegetativen Nervensystem , 1952 .

[12]  G. Blood,et al.  Judging personality and appearance from voice disorders. , 1979, Journal of communication disorders.

[13]  I. Fónagy Mimik auf glottaler Ebene , 1962 .

[14]  P. Alku,et al.  Physical variations related to stress and emotional state: A preliminary study. , 1996 .

[15]  D G Hanson,et al.  Frequency, intensity, and target matching effects on photoglottographic measures of open quotient and speed quotient. , 1990, Journal of speech and hearing research.

[16]  Paavo Alku,et al.  On the Perception of Emotional Content in Speech , 1995 .

[17]  E Vilkman,et al.  Effects of bandwidth on glottal airflow waveforms estimated by inverse filtering. , 1995, The Journal of the Acoustical Society of America.

[18]  Kim E. A. Silverman,et al.  Evidence for the independent function of intonation contour type, voice quality, and F0 range in signaling speaker affect , 1985 .

[19]  Sheldon B. Michaels,et al.  Some Aspects of Fundamental Frequency and Envelope Amplitude as Related to the Emotional Content of Speech , 1962 .

[20]  Iain R. Murray,et al.  Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. , 1993, The Journal of the Acoustical Society of America.

[21]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[22]  A M Engebretson,et al.  Indirect assessment of the contribution of subglottal air pressure and vocal-fold tension to changes of fundamental frequency in English. , 1978, The Journal of the Acoustical Society of America.

[23]  Mark A. Clements,et al.  Analysis of glottal waveforms across stress styles , 1990, International Conference on Acoustics, Speech, and Signal Processing.