论文信息 - Multipopulation genetic learning of midsagittal articulatory models for speech synthesis

Multipopulation genetic learning of midsagittal articulatory models for speech synthesis

This paper discusses an application of multipopulation continuous genetic algorithms to learning of vocal tract configurations on a midsagittal plane. Speaker dependent and independent target signal corpora are formed and processed by the genetic approach, which evolves populations of articulatory vectors in order to approximate the acoustic traits of artificial utterances to those of natural signals in the corpora. Analyzed signals correspond to venezuelan spanish speakers, increasing novelty of the study. Subjective evaluations have confirmed effectiveness of the method, reaching a 19% recognition error for speaker independent trials, and no error for the speaker dependent case.

José Brito | Wladimir Rodriguez | W. Rodriguez | José Brito

[1] Donald G. Childers,et al. Speech processing and synthesis toolboxes , 1999 .

[2] Richard S. McGowan,et al. Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests , 1994, Speech Commun..

[3] Shinji Maeda,et al. A digital simulation method of the vocal-tract system , 1982, Speech Commun..

[4] P. Mermelstein. Articulatory model for the study of speech production. , 1973, The Journal of the Acoustical Society of America.

[5] Hani Yehia,et al. A method to combine acoustic and morphological constraints in the speech production inverse problem , 1996, Speech Commun..

[6] Man Mohan Sondhi,et al. Techniques for estimating vocal-tract shapes from the speech signal , 1994, IEEE Trans. Speech Audio Process..

[7] Miguel Á. Carreira-Perpiñán,et al. Continuous latent variable models for dimensionality reduction and sequential data reconstruction , 2001 .

[8] J. Flanagan. Speech Analysis, Synthesis and Perception , 1971 .

[9] Abraham Kandel,et al. A fuzzy information space approach to speech signal non-linear analysis , 2000, Int. J. Intell. Syst..

[10] P. Denes,et al. The speech chain : the physics and biology of spoken language , 1963 .

[11] Qiguang Lin. Speech production theory and articulatory speech synthesis , 1991 .

[12] Elliot Saltzman,et al. Task Dynamic Coordination of the Speech Articulators: A Preliminary Model , 1986 .

[13] Abraham Kandel,et al. Similarity of dynamical systems , 1998 .

[14] Elliot Saltzman,et al. The dynamical perspectives on speech production: Data and theory , 1986 .