Testing the reliability of Grade, Roughness and Breathiness scores by means of synthetic speech stimuli

Abstract This article describes a synthesizer of disordered voices and reports a test of the reliability of Grade, Roughness, and Breathiness scores assigned to synthetic stimuli by eight expert listeners in two sessions. Speech stimuli [a], [i], [u], [ai], and [ia] were synthesized with three values of vocal frequency and four levels of vocal jitter and pulsatile additive noise each. The agreement and correlation of scores assigned by the same rater in different sessions, or by different raters in the same session, accord with published data. Only a small part of the variance of the arithmetic differences between the scores that are assigned to the same stimulus is explained by the stimuli properties. The conclusion is that differences between scores that are assigned to the same stimulus are not attributable to biases of individual raters; such biases would shift all the scores assigned on a scale, and the shift would be interpretable in terms of the properties of the stimuli.

[1]  Coleman Rf,et al.  Effect of Waveform Changes Upon Roughness Perception , 1971 .

[2]  平野 実 Clinical examination of voice , 1981 .

[3]  J. Hillenbrand,et al.  Perception of aperiodicities in synthetically generated voices. , 1988, The Journal of the Acoustical Society of America.

[4]  Kathryn Hird,et al.  Perception of synthesized voice quality in connected speech by Cantonese speakers. , 2002, The Journal of the Acoustical Society of America.

[5]  Jean Schoentgen,et al.  Development and perceptual assessment of a synthesizer of disordered voices. , 2012, The Journal of the Acoustical Society of America.

[6]  R. W. Wendahl,et al.  Laryngeal analog synthesis of jitter and shimmer auditory parameters of harshness. , 1966, Folia phoniatrica.

[7]  J. Schoentgen Vocal cues of disordered voices: an overview , 2006 .

[8]  Anton J. Rozsypal,et al.  Perception of jitter and shimmer in synthetic vowels , 1979 .

[9]  J Kreiman,et al.  Validity of rating scale measures of voice quality. , 1998, The Journal of the Acoustical Society of America.

[10]  Vicki L. Heiberger,et al.  Jitter and Shimmer in Sustained Phonation , 1982 .

[11]  Jean Schoentgen,et al.  Shaping function models of the phonatory excitation signal. , 2003, The Journal of the Acoustical Society of America.

[12]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.

[13]  J Kreiman,et al.  Comparing internal and external standards in voice quality judgments. , 1993, Journal of speech and hearing research.

[14]  D G Childers,et al.  Modeling the glottal volume-velocity waveform for three voice types. , 1995, The Journal of the Acoustical Society of America.

[15]  J. Kreiman,et al.  Perception of vocal tremor. , 2003, Journal of speech, language, and hearing research : JSLHR.

[16]  I R Titze,et al.  Perception of pitch and roughness in vocal signals with subharmonics. , 2001, Journal of voice : official journal of the Voice Foundation.

[17]  Rahul Shrivastav,et al.  Some difference limens for the perception of breathiness. , 2006, The Journal of the Acoustical Society of America.

[18]  Satoshi Imaizumi,et al.  Acoustic and perceptual modelling of the voice quality caused by fundamental frequency perturbation , 1992, ICSLP.

[19]  Yi Xu,et al.  Perceived pitch of synthesized voice with alternate cycles. , 2002, Journal of voice : official journal of the Voice Foundation.

[20]  I. Titze,et al.  The perception of two vocal qualities in a synthesized vocal utterance: ring and pressed voice. , 2004, Journal of voice : official journal of the Voice Foundation.

[21]  J. Kreiman,et al.  When and why listeners disagree in voice quality assessment tasks. , 2007, The Journal of the Acoustical Society of America.

[22]  Jody Kreiman,et al.  Measuring vocal quality with speech synthesis , 2000 .

[23]  Leonardo Bocchi,et al.  Validity of jitter measures in non-quasi-periodic voices. Part I: Perceptual and computer performances in cycle pattern recognition , 2011, Logopedics, phoniatrics, vocology.

[24]  D. G. Childers,et al.  Articulatory synthesis: nasal sounds and male and female voices , 1991 .

[25]  J. Kreiman,et al.  Sources of listener disagreement in voice quality assessment. , 2000, The Journal of the Acoustical Society of America.

[26]  Norman J. Lass,et al.  Speech and Language: Advances in Basic Research and Practice , 1979 .

[27]  Rolf Carlson,et al.  Experiments with voice modelling in speech synthesis , 1991, Speech Commun..

[28]  I. Titze The myoelastic aerodynamic theory of phonation , 2006 .

[29]  Jody Kreiman,et al.  Perception of aperiodicity in pathological voice. , 2005, The Journal of the Acoustical Society of America.

[30]  Jody Kreiman,et al.  Integrated software for analysis and synthesis of voice quality , 2010, Behavior research methods.

[31]  D. Wollschläger,et al.  Grundlagen der Datenanalyse mit R , 2010 .

[32]  Peter Ladefoged,et al.  Vowels and Consonants , 2000, Manchu Grammar.