Estimation of vocal noise in running speech by means of bi-directional double linear prediction

The presentation concerns forward and backward double linear prediction of speech with a view to the characterization of vocal noise due to voice disorders. Bi-directional double linear prediction consists in a conventional short-term prediction followed by a distal inter-cycle prediction that enables removing inter-cycle correlations owing to voicing. The long-term prediction is performed forward and backward. The minimum of the forward and backward prediction error is a cue of vocal noise. The minimum backward and forward prediction error has been calculated for corpora involving connected speech and sustained vowels. Comparisons have been performed between the estimated vocal noise and the perceived hoarseness in steady vowel fragments, as well as between the estimated vocal noise in connected speech and sustained vowels produced by the same speakers.

[1]  Peter Kabal,et al.  Pitch prediction filters in speech coding , 1989, IEEE Trans. Acoust. Speech Signal Process..

[2]  F. Klingholtz Acoustic recognition of voice disorders: a comparative study of running speech versus sustained vowels. , 1990, The Journal of the Acoustical Society of America.

[3]  Martin J. Ball,et al.  Voice Quality Measurement , 1999 .

[4]  N Yanagihara,et al.  Significance of harmonic changes and noise components in hoarseness. , 1967, Journal of speech and hearing research.

[5]  Jean Schoentgen,et al.  Spectral models of additive and modulation noise in speech and phonatory excitation signals. , 2003, The Journal of the Acoustical Society of America.

[6]  J. Schoentgen,et al.  Multivariate statistical analysis of flat vowel spectra with a view to characterizing dysphonic voices. , 2000, Journal of speech, language, and hearing research : JSLHR.

[7]  G. de Krom,et al.  Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[8]  D. Jamieson,et al.  Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[9]  Y. Qi,et al.  The estimation of signal-to-noise ratio in continuous speech for disordered voices. , 1999, The Journal of the Acoustical Society of America.