Glottal-to-Noise Excitation Ratio - a New Measure for Describing Pathological Voices

Summary In this article a new acoustic parameter for the objective description of voice quality is introduced. It is based on the correlation coefficientfor Hilbert envelopes of different frequency bands. The parameter indicates whether a given voice signal originates from vibrations of the vocal folds or from turbulent noise generated in the vocal tract and is thus related to (but not a direct measure of) breathiness. Therefore it is named Glottal-to-Noise Excitation Ratio (GNE Ratio). GNE is compared to HNR (Harmonics-to-Noise Ratio) and NNE (Normalized Noise Energy), existing measures also sensitive to additive noise (turbulence). Experiments with artificialsignals show that only the GNE is almost independent of frequency modulation noise (jitter) and amplitude modulation noise (shimmer).

[1]  T.H. Crystal,et al.  Linear prediction of speech , 1977, Proceedings of the IEEE.

[2]  G. de Krom A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. , 1993, Journal of speech and hearing research.

[3]  F. Klingholz The measurement of the signal-to-noise ratio (SNR) in continuous speech , 1987, Speech Commun..

[4]  Hideki Kasuya,et al.  Novel acoustic measurements of jitter and shimmer characteristics from pathological voice , 1993, EUROSPEECH.

[5]  T. Baer,et al.  Harmonics-to-noise ratio as an index of the degree of hoarseness. , 1982, The Journal of the Acoustical Society of America.

[6]  W. Aures,et al.  Ein Berechnungsverfahren der Rauhigkeit , 1985 .

[7]  Hideki Kasuya,et al.  An adaptive comb filtering method as applied to acoustic analyses of pathological voice , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  G. Krom Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[9]  G. de Krom,et al.  Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[10]  Guus de Krom,et al.  A Cepstrum-Based Technique for Determining a Harmonics-to-Noise Ratio in Speech Signals , 1993 .

[11]  T. Baer,et al.  A pitch-synchronous analysis of hoarseness in running speech. , 1988, The Journal of the Acoustical Society of America.