Perception of aperiodicity in pathological voice.

Although jitter, shimmer, and noise acoustically characterize all voice signals, their perceptual importance in naturally produced pathological voices has not been established psychoacoustically. To determine the role of these attributes in the perception of vocal quality, listeners were asked to adjust levels of jitter, shimmer, and the noise-to-signal ratio in a speech synthesizer, so that synthetic voices matched naturally produced tokens. Results showed that, although listeners agreed well in their judgments of the noise-to-signal ratio, they did not agree with one another in their chosen settings for jitter and shimmer. Noise-dependent differences in listeners' ability to detect changes in amounts of jitter and shimmer implicate both listener insensitivity and inability to isolate jitter and shimmer as separate dimensions in the overall pattern of aperiodicity in a voice as causes of this poor agreement. These results suggest that jitter and shimmer are not useful as independent indices of perceived vocal quality, apart from their acoustic contributions to the overall pattern of spectrally shaped noise in a voice.

[1]  J. Hillenbrand,et al.  Perception of aperiodicities in synthetically generated voices. , 1988, The Journal of the Acoustical Society of America.

[2]  Jacqueline Vaissière,et al.  Objective acoustic and aerodynamic measures of breathiness in paralytic dysphonia , 2003, European Archives of Oto-Rhino-Laryngology.

[3]  Neil A. Macmillan,et al.  Detection Theory: A User's Guide , 1991 .

[4]  M P Karnell,et al.  Comparison of fundamental frequency and perturbation measurements among three analysis systems. , 1995, Journal of voice : official journal of the Voice Foundation.

[5]  R. W. Wendahl,et al.  Some parameters of auditory roughness. , 1966, Folia phoniatrica.

[6]  I R Titze,et al.  Some technical considerations in voice perturbation measurements. , 1987, Journal of speech and hearing research.

[7]  J Kreiman,et al.  The perceptual structure of pathologic voice quality. , 1996, The Journal of the Acoustical Society of America.

[8]  G. de Krom A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. , 1993, Journal of speech and hearing research.

[9]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[10]  Jody Kreiman,et al.  Measuring vocal quality with speech synthesis , 2000 .

[11]  B. Walden,et al.  An evaluation of residue features as correlates of voice disorders. , 1987, Journal of communication disorders.

[12]  M. D. Fresneda,et al.  Acoustic and Perceptual Indicators of Normal and Pathological Voice , 2003, Folia Phoniatrica et Logopaedica.

[13]  Robert F. Coleman,et al.  Vocal roughness and stimulus duration , 1967 .

[14]  C. Su,et al.  A New Paramedian Approach to Arytenoid Adduction and Strap Muscle Transposition for Vocal Fold Medialization , 2002, The Laryngoscope.

[15]  A. Andrianopoulos,et al.  Multimodal standardization of voice among four multicultural populations: fundamental frequency and spectral characteristics. , 2001, Journal of voice : official journal of the Voice Foundation.

[16]  M. Abrahão,et al.  Noise-to-harmonics ratio as an acoustic measure of voice disorders in boys. , 2002, Journal of voice : official journal of the Voice Foundation.

[17]  P. Mueller The Aging Voice , 1997, Seminars in speech and language.

[18]  I Honjo,et al.  A new index for evaluation of the turbulent noise in pathological voice. , 1988, The Journal of the Acoustical Society of America.

[19]  J. Kreiman,et al.  Listener experience and perception of voice quality. , 1988, Journal of speech and hearing research.

[20]  J Kreiman,et al.  Comparison of voice analysis systems for perturbation measurement. , 1993, Journal of speech and hearing research.

[21]  Hans Werner Strube,et al.  Glottal-to-Noise Excitation Ratio - a New Measure for Describing Pathological Voices , 1997 .

[22]  Brian Charles Gabelman,et al.  Analysis and synthesis of pathological vowels , 2003 .

[23]  P. Murphy,et al.  Perturbation-free measurement of the harmonics-to-noise ratio in voice signals using pitch synchronous harmonic analysis. , 1999, The Journal of the Acoustical Society of America.

[24]  G. Niedzielska Acoustic analysis in the diagnosis of voice disorders in children. , 2001, International journal of pediatric otorhinolaryngology.

[25]  P. Jensen,et al.  Adequacy of terminology for clinical judgment of voice quality deviation. , 1965, Eye, ear, nose & throat monthly.

[26]  F. Emanuel,et al.  Some waveform and spectral features of vowel roughness. , 1978, Journal of speech and hearing research.

[27]  Jody Kreiman,et al.  Comparison of Voice Analysis Systems for Perturbation Measurement , 1996 .

[28]  B M Cheetham,et al.  Objective assessment of hoarseness by measuring jitter. , 2001, Clinical otolaryngology and allied sciences.

[29]  G. de Krom,et al.  Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[30]  Guus de Krom,et al.  A Cepstrum-Based Technique for Determining a Harmonics-to-Noise Ratio in Speech Signals , 1993 .

[31]  J. Kreiman,et al.  Perception of vocal tremor. , 2003, Journal of speech, language, and hearing research : JSLHR.

[32]  Jack J. Jiang,et al.  Acoustic Measurement of Change in Voice Quality with Treatment for Chronic Posterior Laryngitis , 1997, The Annals of otology, rhinology, and laryngology.

[33]  J Hillenbrand,et al.  A methodological study of perturbation and additive noise in synthetically generated voice signals. , 1987, Journal of speech and hearing research.

[34]  I Maddieson,et al.  Digital inverse filtering for linguistic research. , 1987, Journal of speech and hearing research.

[35]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[36]  G. Krom Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[37]  R. W. Wendahl,et al.  LARYNGEAL ANALOG SYNTHESIS OF HARSH VOICE QUALITY. , 1963, Folia phoniatrica.

[38]  J. Kreiman,et al.  The multidimensional nature of pathologic vocal quality. , 1994, The Journal of the Acoustical Society of America.

[39]  V. Wolfe,et al.  Pathologic voice type and the acoustic prediction of severity. , 1995, Journal of speech and hearing research.

[40]  Vicki L. Heiberger,et al.  Jitter and Shimmer in Sustained Phonation , 1982 .

[41]  R. W. Wendahl,et al.  Laryngeal analog synthesis of jitter and shimmer auditory parameters of harshness. , 1966, Folia phoniatrica.

[42]  Jensen Pj,et al.  Adequacy of terminology for clinical judgment of voice quality deviation. , 1965 .