Short-term stability measures for the evaluation of vocal quality.

The vocal quality of 64 normal subjects and 57 subjects suffering various degrees of glottal cancer was investigated using acoustic measures of six different aspects of the voice signal: tone period perturbation, amplitude perturbation, waveform perturbation, vocal noise, spectral periodicity and spectral distortion. The measures were estimated taking the glottal cycle as temporal reference unit to make the influence of the differences in tone period from one person to another as low as possible. The measures were evaluated with regard to (a) their ability to discriminate between healthy and sick subjects, and (b) their correlation with the perceptual evaluation of four trained listeners. The results suggest that signal processing techniques are unsatisfactory for clinical diagnoses but useful for monitoring voice quality.

[1]  P. Lieberman Some Acoustic Measures of the Fundamental Periodicity of Normal and Pathologic Larynges , 1963 .

[2]  Paul H. Ptacek,et al.  Phonatory and Related Changes with Advanced Age , 1966 .

[3]  Y. Koike Application of Some Acoustic Measures for the Evaluation of Laryngeal Dysfunction , 1967 .

[4]  Y. Koike Vowel amplitude modulations in patients with laryngeal diseases. , 1969, The Journal of the Acoustical Society of America.

[5]  E. C. Hammond,et al.  Histologic changes in the larynx in relation to smoking habits , 1970, Cancer.

[6]  Keinosuke Fukunaga,et al.  Estimation of Classification Error , 1970, IEEE Transactions on Computers.

[7]  H. Hollien,et al.  Speaking fundamental frequency and chronologic age in males. , 1972, Journal of speech and hearing research.

[8]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[9]  Harvey R. Gilbert,et al.  The effects of smoking on the speaking fundamental frequency of adult women , 1974 .

[10]  W. Gould,et al.  Vocal Shimmer in Sustained Phonation of Normal and Pathologic Voice , 1976, The Annals of otology, rhinology, and laryngology.

[11]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[12]  Alan V. Oppenheim,et al.  Applications of digital signal processing , 1978 .

[13]  J. Sundberg,et al.  Perceptual and acoustic correlates of abnormal voice qualities. , 1980, Acta oto-laryngologica.

[14]  T. Murry,et al.  Selected Acoustic Characteristics of Pathologic and Normal Speakers , 1980 .

[15]  R. Gray,et al.  Distortion measures for speech processing , 1980 .

[16]  Y Horii,et al.  Age and changes in vocal jitter. , 1980, Journal of gerontology.

[17]  M. Stoicheff Speaking fundamental frequency characteristics of nonsmoking female adults. , 1981, Journal of speech and hearing research.

[18]  K. Kitajima,et al.  Quantitative evaluation of the noise level in the pathologic voice. , 1981, Folia phoniatrica.

[19]  T. Baer,et al.  Harmonics-to-noise ratio as an index of the degree of hoarseness. , 1982, The Journal of the Acoustical Society of America.

[20]  Eiji Yumoto,et al.  The Quantitative Evaluation of Hoarseness: A New Harmonics to Noise Ratio Method , 1983 .

[21]  L. Ramig,et al.  Effects of physiological aging on selected acoustic characteristics of voice. , 1983, Journal of speech and hearing research.

[22]  E. Yumoto,et al.  Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness. , 1984, Journal of speech and hearing research.

[23]  B Hammarberg,et al.  Teflon injection in 16 patients with paralytic dysphonia: perceptual and acoustic evaluations. , 1984, The Journal of speech and hearing disorders.

[24]  H. Kasuya,et al.  Characteristics of pitch period and amplitude perturbation quotients for the detection of glottic cancer , 1984 .

[25]  Y Kitazoe,et al.  Harmonic-intensity analysis of normal and hoarse voices. , 1984, The Journal of the Acoustical Society of America.

[26]  F. Klingholz,et al.  Quantitative spectral evaluation of shimmer and jitter. , 1985, Journal of speech and hearing research.

[27]  Comparative study of several distortion measures for speech recognition , 1985, Speech Commun..

[28]  Jean Schoentgen,et al.  An acoustic feature related to vocal efficiency in normal and pathological speakers , 1985, Speech Commun..

[29]  Hideki Kasuya,et al.  An adaptive comb filtering method as applied to acoustic analyses of pathological voice , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[30]  H. Kasuya,et al.  Normalized noise energy as an acoustic measure to evaluate pathologic voice. , 1986, The Journal of the Acoustical Society of America.

[31]  J. Laver,et al.  An acoustic screening system for the detection of laryngeal pathology , 1986 .

[32]  Hideki Kasuya,et al.  An acoustic analysis of pathological voice and its application to the evaluation of laryngeal pathology , 1986, Speech Commun..

[33]  Masaki Kato,et al.  NON SPECIFIC GRANULOMA OF THE LARYNX , 1986 .

[34]  Hideki Kasuya,et al.  Preliminary experiments on voice screening , 1986 .

[35]  Yasuo Koike Cepstrum analysis of pathologic voices , 1986 .

[36]  A G Askenfelt,et al.  Speech waveform perturbation analysis: a perceptual-acoustical comparison of seven measures. , 1986, Journal of speech and hearing research.

[37]  A. Rauhut,et al.  Classification of voice qualities , 1986 .

[38]  S. Linville,et al.  Fundamental frequency stability characteristics of elderly women's voices. , 1987, The Journal of the Acoustical Society of America.

[39]  S. Linville Intraspeaker variability in fundamental frequency stability: an age-related phenomenon? , 1988, The Journal of the Acoustical Society of America.