Estimation of a physical model of the vocal folds via dynamic programming techniques

This work presents a procedure for the estimation of a two-mass vocal fold model starting from a time-varying target flow signal. The model is specified by a large number of physical parameters, computed as functions of four articulatory parameters (three laryngeal muscle activations and subglottal pressure). Flow waveforms synthesized by the model are characterized by means of a set of typical voice source quantification acoustic parameters. Given a sequences of target acoustic parameters, dynamic programming techniques and interpolation based on Radial Basis Function Networks are used to derive sequences of articulatory parameters that lead to resynthesis of the target signal.

[1]  Rnj Raymond Veldhuis,et al.  A symmetrical two-mass vocal-fold model coupled to vocal tract and trachea, with application to prosthesis design , 1998 .

[2]  P. Alku,et al.  A comparison of glottal voice source quantification parameters in breathy, normal and pressed phonation of female and male speakers. , 1996, Folia phoniatrica et logopaedica : official organ of the International Association of Logopedics and Phoniatrics.

[3]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[4]  Paavo Alku,et al.  A toolkit for voice inverse filtering and parametrisation , 2005, INTERSPEECH.

[5]  I. Titze,et al.  Rules for controlling low-dimensional vocal fold models with muscle activation. , 2002, The Journal of the Acoustical Society of America.

[6]  D. Sciamarella On the Acoustic Sensitivity of a Symmetrical Two-Mass Model of the Vocal Folds to the Variation o fC ontrol Parameters , 2004 .

[7]  F. Girosi,et al.  Networks for approximation and learning , 1990, Proc. IEEE.

[8]  Carlo Drioli,et al.  Physiological control of low-dimensional glottal models with applications to voice source parameter matching , 2005, MAVEBA.

[9]  P. Alku,et al.  Normalized amplitude quotient for parametrization of the glottal flow. , 2002, The Journal of the Acoustical Society of America.

[10]  I. Titze,et al.  Acoustic interactions of the voice source with the lower vocal tract. , 1997, The Journal of the Acoustical Society of America.

[11]  Juergen Schroeter,et al.  Speech coding based on physiological models of speech production , 1992 .