论文信息 - The dramatic piece reader for the blind and visually impaired

The dramatic piece reader for the blind and visually impaired

The paper presents the concept and realization of t he intelligent audio-book reader for the visually impa ired. The system is capable of presenting personalities of di fferent characters. The synthesizer mimics the way how a puppeteer portrays different characters. A traditional puppet eer generally uses up to a dozen different marionettes in one pie ce. Each of them impersonates a character with its own typical voice manifestation. We studied the techniques the puppet eer uses to change his voice and the acoustical correlates of t hese changes. The results are used to predict appropriat e settings of the parameters of the voice for every character of the piece. The information on the personality features of ever y particular character is inserted manually by a human operator. Similarly to the puppeteer’s show only one speaker’s voice is used in this concept and all the modifications are made usi ng speech synthesis methods.

Marián Trnka | Sakhia Darjaa | Milan Rusko | Juraj Hamar

[1] John H. L. Hansen,et al. Analysis and classification of speech mode: whispered through shouted , 2007, INTERSPEECH.

[2] Milan Rusko,et al. Multilinguality , singing synthesis , acoustic emoticons , and other extensions of the Slovak speech synthesizer for SMS reading , 2004 .

[3] Heiga Zen,et al. Statistical Parametric Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[4] Nadia Magnenat-Thalmann,et al. Imparting Individuality to Virtual Humans , 2002 .

[5] Alan W. Black,et al. Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6] Ana Paiva,et al. Telling Stories with a Synthetic Character: Understanding Inter-modalities Relations , 2007, COST 2102 Workshop.

[7] Darjaa Sakhia,et al. Three Generations of Speech Synthesis Systems in Slovakia , 2006 .

[8] Milan Rusko,et al. Character Identity Expression in Vocal Performance of Traditional Puppeteers , 2006, TSD.

[9] Taniya Mishra,et al. Predicting Character-Appropriate Voices for a TTS-based Storyteller System , 2012, INTERSPEECH.

[10] Christophe d'Alessandro,et al. Designing French Tale Corpora for Entertaining Text To Speech Synthesis , 2012, LREC.

[11] H. A. Buurman,et al. Virtual Storytelling: Emotions for the narrator , 2007 .

[12] Milos Cernak,et al. Slovak Speech Database for Experiments and Application Building in Unit-Selection Speech Synthesis , 2004, TSD.

[13] John Laver,et al. The gift of speech , 1991 .

[14] Leo Breiman,et al. Classification and Regression Trees , 1984 .

[15] John Kane,et al. Detecting a targeted voice style in an audiobook using voice quality features , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16] J. M. Digman. PERSONALITY STRUCTURE: EMERGENCE OF THE FIVE-FACTOR MODEL , 1990 .

[17] M. Rusko,et al. Acoustic, semantic and personality dimensions in the speech of traditional puppeteers , 2012, 2012 IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom).

[18] R. McCrae,et al. An introduction to the five-factor model and its applications. , 1992, Journal of personality.