Classification of dysphonic voice: acoustic and auditory-perceptual measures.

The purpose of this study was (1) to determine the relationship between acoustic measures and auditory-perceptual dimensions of overall voice severity and pleasantness and (2) to evaluate the ability of acoustic and auditory-perceptual measures to discriminate normal from dysphonic voices. Thirty adult dysphonic speakers and six, age-matched normal control speakers were asked to provide oral reading samples of the Rainbow Passage. Acoustic analysis of the speech samples was used to identify abnormal phonatory events associated with dysphonia. The acoustic program calculated long-term average spectral measures, glottal noise measures, and those measures based on linear prediction (LP) modeling. Twelve adult listeners judged overall voice severity and pleasantness from the connected speech samples using direct magnitude estimation (DME) procedures. The acoustic measures accounted for 48% of overall voice severity and 40% of voice pleasantness for dysphonic speakers. The classification performance of the acoustic measures and auditory-perceptual measures was quantified using logistic regression analysis. When acoustic measures or auditory-perceptual measures were considered in isolation, classification was generally accurate and similar across measures. Classification accuracy improved to 100% when acoustic and auditory-perceptual measures were combined. These data provide further support for use of both auditory-perceptual evaluation and acoustic analyses for classifying and evaluating dysphonia.

[1]  D. Childers,et al.  Acoustic correlates of vocal quality. , 1990, Journal of speech and hearing research.

[2]  W. S. Brown,et al.  Comparison of various automatic means for measuring mean fundamental frequency. , 1996, Journal of voice : official journal of the Voice Foundation.

[3]  V. Wolfe,et al.  Pathologic voice type and the acoustic prediction of severity. , 1995, Journal of speech and hearing research.

[4]  Y. Qi,et al.  The estimation of signal-to-noise ratio in continuous speech for disordered voices. , 1999, The Journal of the Acoustical Society of America.

[5]  J. Kreiman,et al.  Sources of listener disagreement in voice quality assessment. , 2000, The Journal of the Acoustical Society of America.

[6]  Norman J. Lass,et al.  Speech and Language: Advances in Basic Research and Practice , 1979 .

[7]  G. Fairbanks Voice and articulation drillbook , 1960 .

[8]  D. Jamieson,et al.  Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[9]  D G Jamieson,et al.  A comparison of high precision F0 extraction algorithms for sustained vowels. , 1999, Journal of speech, language, and hearing research : JSLHR.

[10]  M. P. Gelfer Perceptual attributes of voice: Development and use of rating scales , 1988 .

[11]  J. Kreiman,et al.  Individual differences in voice quality perception. , 1992 .

[12]  V. Wolfe,et al.  Acoustic correlates of pathologic voice types. , 1991, Journal of speech and hearing research.

[13]  C. Sapienza,et al.  Three Treatments for Teachers With Voice Disorders , 2003 .

[14]  Raymond D. Kent,et al.  Self-organizing map for the classification of normal and disordered female voices. , 1999, Journal of speech, language, and hearing research : JSLHR.

[15]  Jody Kreiman,et al.  Comparison of Voice Analysis Systems for Perturbation Measurement , 1996 .

[16]  L. E. Travis Handbook of speech pathology and audiology , 1931 .

[17]  P. van de Heyning,et al.  Test-retest study of the GRBAS scale: influence of experience and professional background on perceptual rating of voice quality. , 1997, Journal of voice : official journal of the Voice Foundation.

[18]  C R Rabinov,et al.  Comparing reliability of perceptual ratings of roughness and acoustic measure of jitter. , 1995, Journal of speech and hearing research.

[19]  M. Hirano,et al.  Clinical Examination of Voice , 1981 .

[20]  W S Winholtz,et al.  Effect of microphone type and placement on voice perturbation measurements. , 1993, Journal of speech and hearing research.

[21]  J. Kreiman,et al.  Perceptual evaluation of voice quality: review, tutorial, and a framework for future research. , 1993, Journal of speech and hearing research.

[22]  Motor Speech Disorders: Advances in Assessment and Treatment , 1994 .

[23]  E Truy,et al.  Evaluation of cochlear implanted children's voices. , 1999, International journal of pediatric otorhinolaryngology.

[24]  T Murry,et al.  Multidimensional classification of normal voice qualities. , 1977, The Journal of the Acoustical Society of America.

[25]  Gary Weismer,et al.  Direct magnitude estimates of speech intelligibility in dysarthria: effects of a chosen standard. , 2002, Journal of speech, language, and hearing research : JSLHR.

[26]  Raymond D. Kent Hearing and Believing , 1996 .

[27]  Jody Kreiman,et al.  Measuring vocal quality with speech synthesis , 2000 .

[28]  V. Wolfe,et al.  Acoustic correlates of dysphonia: type and severity. , 1997, Journal of communication disorders.

[29]  G. Krom Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[30]  J Kreiman,et al.  Validity of rating scale measures of voice quality. , 1998, The Journal of the Acoustical Society of America.

[31]  M P Karnell,et al.  Laryngeal perturbation analysis: minimum length of analysis window. , 1991, Journal of speech and hearing research.

[32]  T. Whitehill,et al.  Direct magnitude estimation and interval scaling of hypernasality. , 2002, Journal of speech, language, and hearing research : JSLHR.

[33]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. , 1996, Journal of speech and hearing research.

[34]  J. Hillenbrand,et al.  Multidimensional scaling analysis of dysphonia in two speaker groups. , 1991, Journal of speech and hearing research.

[35]  S. B. Davis Acoustic Characteristics of Normal and Pathological Voices , 1979 .

[36]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[37]  B. Hammarberg,et al.  Vocal Fold Physiology: Acoustic, Perceptual, and Physiological Aspects of Voice Mechanisms , 1991 .

[38]  Philip C Doyle,et al.  Direct magnitude estimation and interval scaling of pleasantness and severity in dysphonic and normal speakers. , 2002, The Journal of the Acoustical Society of America.

[39]  F. Klingholtz Acoustic recognition of voice disorders: a comparative study of running speech versus sustained vowels. , 1990, The Journal of the Acoustical Society of America.

[40]  Jean Schoentgen,et al.  Jitter in sustained vowels and isolated sentences produced by dysphonic speakers , 1989, Speech Commun..

[41]  M. P. Gelfer A multidimensional scaling study of voice quality in females. , 1993, Phonetica.

[42]  E T Doherty,et al.  Tape recorder effects on jitter and shimmer extraction. , 1988, Journal of speech and hearing research.

[43]  J. Kreiman,et al.  The multidimensional nature of pathologic vocal quality. , 1994, The Journal of the Acoustical Society of America.

[44]  A. Reich,et al.  Teflon laryngoplasty: an acoustical and perceptual study. , 1978, The Journal of speech and hearing disorders.

[45]  J. Kreiman,et al.  Listener experience and perception of voice quality. , 1990 .