Estimation of vocal dysperiodicities in disordered connected speech by means of distant-sample bidirectional linear predictive analysis.

The article presents an analysis of vocal dysperiodicities in connected speech produced by dysphonic speakers. The processing is based on a comparison of the present speech fragment with future and past fragments. The size of the dysperiodicity estimate is zero for periodic speech signals. A feeble increase of the vocal dysperiodicity is guaranteed to produce a feeble increase of the estimate. No spurious noise boosting occurs owing to cycle insertion and omission errors, or phonetic segment boundary artifacts. Additional objectives of the study have been investigating whether deviations from periodicity are larger or more commonplace in connected speech than in sustained vowels, and whether sentences that comprise frequent voice onsets and offsets are noisier than sentences that comprise few. The corpora contain sustained vowels as well as grammatically- and phonetically matched sentences. An acoustic marker that correlates with the perceived degree of hoarseness summarizes the size of the dysperiodicities. The marker values for sustained vowels have been highly correlated with those for connected speech, and the marker values for sentences that comprise few voiced/unvoiced transients have been highly correlated with the marker values for sentences that comprise many.

[1]  Elvira Mendoza Lara,et al.  High-frequency components of normal and dysphonic voices. , 1994 .

[2]  N Yanagihara,et al.  Significance of harmonic changes and noise components in hoarseness. , 1967, Journal of speech and hearing research.

[3]  Jean Schoentgen,et al.  Quantitative evaluation of the discrimination performance of acoustic features in detecting laryngeal pathology , 1982, Speech Commun..

[4]  F. Klingholtz Acoustic recognition of voice disorders: a comparative study of running speech versus sustained vowels. , 1990, The Journal of the Acoustical Society of America.

[5]  J. Laver,et al.  An acoustic screening system for the detection of laryngeal pathology , 1986 .

[6]  D. Jamieson,et al.  Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[7]  Peter Kabal,et al.  Pitch prediction filters in speech coding , 1989, IEEE Trans. Acoust. Speech Signal Process..

[8]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. , 1996, Journal of speech and hearing research.

[9]  Y. Qi,et al.  The estimation of signal-to-noise ratio in continuous speech for disordered voices. , 1999, The Journal of the Acoustical Society of America.

[10]  T. Baer,et al.  A pitch-synchronous analysis of hoarseness in running speech. , 1988, The Journal of the Acoustical Society of America.

[11]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[12]  P. Lieberman Some Acoustic Measures of the Fundamental Periodicity of Normal and Pathologic Larynges , 1963 .

[13]  L. Dolansky,et al.  On certain irregularities of voiced-speech waveforms , 1968 .

[14]  T Shipp,et al.  Some acoustic and perceptual factors in acute-laryngitic hoarseness. , 1965, The Journal of speech and hearing disorders.

[15]  J. Makhoul Stable and efficient lattice methods for linear prediction , 1977 .

[16]  Margaret Kahn,et al.  The effects of five voice characteristics on LPC quality , 1983, ICASSP.

[17]  Y. Lebrun,et al.  Variations in vocal wave duration. , 1971, The Journal of laryngology and otology.

[18]  Guus de Krom,et al.  An experiment involving the consistency and reliability of voice quality ratings for different types of speech fragments , 1993, EUROSPEECH.

[19]  J. Schoentgen,et al.  Multivariate statistical analysis of flat vowel spectra with a view to characterizing dysphonic voices. , 2000, Journal of speech, language, and hearing research : JSLHR.