Simple4All proposals for the Albayzin Evaluations in Speech Synthesis

Simple4All is a European funded project that aims to stream- line the production of multilanguage expressive synthetic voices by means of unsupervised data extraction techniques, allowing the automatic pro- cess of freely available data into exible task specic rst technique introduces the principle of speaker adaptation from average models consisting of multiple voices, with the second being an extension of this adaptation concept into allowing the control of the expressive strength of the syn- thetic voice. Finally, an unsupervised approach to synthesis capable of learning from unlabelled text data is introduced in detail.

[1]  Takao Kobayashi,et al.  Average-Voice-Based Speech Synthesis Using HSMM-Based Speaker Adaptation and Adaptive Training , 2007, IEICE Trans. Inf. Syst..

[2]  Simon King,et al.  Detecting Acronyms from Capital Letter Sequences in Spanish , 2012, INTERSPEECH.

[3]  Heiga Zen,et al.  Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005 , 2007, IEICE Trans. Inf. Syst..

[4]  Oliver Watts,et al.  The CSTR/EMIME HTS system for Blizzard Challenge 2010 , 2010 .

[5]  Oliver Watts,et al.  Unsupervised learning for text-to-speech synthesis , 2013 .

[6]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[7]  Heiga Zen,et al.  Speaker-Independent HMM-based Speech Synthesis System: HTS-2007 System for the Blizzard Challenge 2007 , 2007 .

[8]  Simon King,et al.  Multisyn: Open-domain unit selection for the Festival speech synthesis system , 2007, Speech Commun..

[9]  Kishore Prahallad,et al.  Significance of early tagged contextual graphemes in grapheme based speech synthesis and recognition systems , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Simon King,et al.  Speech synthesis without a phone inventory , 2009, INTERSPEECH.

[11]  Takao Kobayashi,et al.  Model Adaptation Approach to Speech Synthesis with Diverse Voices and Styles , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[12]  A.W. Black,et al.  Unit selection without a phoneme set , 2002, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002..

[13]  Takao Kobayashi,et al.  Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm , 2009, IEEE Transactions on Audio, Speech, and Language Processing.