Comparison of voice analysis systems for perturbation measurement.

Dysphonic voices are often analyzed using automated voice analysis software. However, the reliability of acoustic measures obtained from these programs remains unknown, particularly when they are applied to pathological voices. This study compared perturbation measures from CSpeech, Computerized Speech Laboratory, SoundScope, and a hand marking voice analysis system. Sustained vowels from 29 male and 21 female speakers with mild to severe dysphonia were digitized, and fundamental frequency (F0), jitter, shimmer, and harmonics- or signal-to-noise ratios were computed. Commercially available acoustical analysis programs agreed well, but not perfectly, in their measures of F0. Measures of perturbation in the various analysis packages use different algorithms, provide results in different units, and often yield values for voices that violate the assumption of quasi-periodicity. As a result, poor rank order correlations between programs using similar measures of perturbation were noted. Because measures of aperiodicity apparently cannot be reliably applied to voices that are even mildly aperiodic, we question their utility in quantifying vocal quality, especially in pathological voices.

[1]  T. Baer,et al.  Harmonics-to-noise ratio as an index of the degree of hoarseness. , 1982, The Journal of the Acoustical Society of America.

[2]  M P Karnell,et al.  Laryngeal perturbation analysis: minimum length of analysis window. , 1991, Journal of speech and hearing research.

[3]  P. Milenkovic,et al.  Least mean square measures of voice perturbation. , 1987, Journal of speech and hearing research.

[4]  J F Deem,et al.  The automatic extraction of pitch perturbation using microcomputers: some methodological considerations. , 1989, Journal of speech and hearing research.

[5]  S. B. Davis Acoustic Characteristics of Normal and Pathological Voices , 1979 .

[6]  J. Till,et al.  Time-shift, trial, and gender effects on vocal perturbation measures. , 1993, Journal of voice : official journal of the Voice Foundation.

[7]  M P Karnell,et al.  Comparison of acoustic voice perturbation measures among three independent voice laboratories. , 1991, Journal of speech and hearing research.

[8]  D. Childers,et al.  Acoustic correlates of vocal quality. , 1990, Journal of speech and hearing research.

[9]  Y. Koike Application of Some Acoustic Measures for the Evaluation of Laryngeal Dysfunction , 1967 .

[10]  J. Hillenbrand Perception of aperiodicities in synthetically generated voices. , 1988, The Journal of the Acoustical Society of America.

[11]  I R Titze,et al.  Some technical considerations in voice perturbation measurements. , 1987, Journal of speech and hearing research.

[12]  V. Wolfe,et al.  Acoustic prediction of severity in commonly occurring voice problems. , 1995, Journal of speech and hearing research.

[13]  Y Horii,et al.  Vocal shimmer in sustained phonation. , 1980, Journal of speech and hearing research.

[14]  I. Titze,et al.  Comparison of Fo extraction methods for high-precision voice perturbation measurements. , 1993, Journal of speech and hearing research.

[15]  E T Doherty,et al.  Tape recorder effects on jitter and shimmer extraction. , 1988, Journal of speech and hearing research.

[16]  S. Linville,et al.  Fundamental frequency stability characteristics of elderly women's voices. , 1987, The Journal of the Acoustical Society of America.

[17]  Y. Koike,et al.  Some perceptual dimensions and acoustical correlates of pathologic voices. , 1976, Acta oto-laryngologica. Supplementum.

[18]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.

[19]  W S Winholtz,et al.  Effect of microphone type and placement on voice perturbation measurements. , 1993, Journal of speech and hearing research.

[20]  M. Maves,et al.  Acoustic Characteristics of Post-Thyroplasty Patients , 1992, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[21]  S. Feijóo,et al.  Short-term stability measures for the evaluation of vocal quality. , 1990, Journal of speech and hearing research.

[22]  M. Ito,et al.  Technical considerations in computation of spectral harmonics-to-noise ratios for sustained vowels. , 1989, Journal of speech and hearing research.

[23]  D J Povel,et al.  Predicting voice quality of deaf speakers on the basis of glottal characteristics. , 1990, Journal of speech and hearing research.

[24]  Y Qi Time normalization in voice analysis. , 1992, The Journal of the Acoustical Society of America.

[25]  R S McGowan,et al.  Acoustic measurements of men's and women's voices: a study of context effects and covariation. , 1990, Journal of speech and hearing research.