Preliminary glottal source modeling for pathologic voices

A first attempt at implementing a flexible model for the glottal source waveform of pathologic voices is described. The LF (Liljencrants & Fant) model is the source model used. We also add various noise types, shimmer and jitter to the excitation source in order to replicate more closely the pathologic glottal waveform. Various vocal characteristics are then modeled in order to evaluate the performance of the glottal source model.

[1]  Donald G. Childers,et al.  Speech processing and synthesis toolboxes , 1999 .

[2]  Lou Boves,et al.  Fitting a LF-model to inverse filter signals , 1993, EUROSPEECH.

[3]  Donald G. Childers,et al.  Modeling vocal disorders via formant synthesis , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4]  J Kreiman,et al.  The perceptual structure of pathologic voice quality. , 1996, The Journal of the Acoustical Society of America.

[5]  Abeer Alwan,et al.  Analysis by synthesis of pathological voices using the Klatt synthesizer , 1997, Speech Commun..

[6]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.

[7]  SOURCE MODEL ADEQUACY FOR PATHOLOGICAL VOICE SYNTHESIS , 1999 .