Analisis y síntesis de expresión emocional en cuentos leídos en voz alta

An important challenge for text-to-speech is to get a synthesized voice that sounds as like as possible to the human voice. The voice synthesized by these systems sounds artificial and this is the most principal cause of rejection by the public at the moment. In order to obtain a lively synthesized voice it is necessary to generate a voice with emotions. The main goal of the generation of emotional voice is try to generate an emotion so clear that there will be no confusion in the listener. There are a lot of theories in order to define an emotional scale. The choice of a specific scale determines the emotions that we try to distinguish. Another important challenge is analyse the acoustic characteristics at different emotional states in order to try to regenerate the same characteristics by the synthesizer (Montero, 2003). This project raises to explorer the possibility of model the lack of the tales through control parameters in the synthesizer. In order to obtain these parameters we have to carry out an analysis of emotional audio and then, once we have obtained a model, we have carried out a test.