The AHOLAB Blizzard Challenge 2008 Entry

This paper describes the process of building unit selection voices for our participation in the Blizzard Challenge 2008. Out of the three voices required (15 hours UK English, 1 hour UK English subset and 6.5 hours Mandarin Chinese) we only built the English ones.

[1]  Inma Hernáez,et al.  Improving quality in a speech synthesizer based on the MBROLA algorithm , 1999, EUROSPEECH.

[2]  Inma Hernáez,et al.  Description of the AhoTTS system for the Basque language , 2001, SSW.

[3]  Alan W. Black,et al.  Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[4]  Jon Sánchez,et al.  Evaluation of Pitch Detection Algorithms Under Real Conditions , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[5]  Simon King,et al.  Multisyn voices from ARCTIC data for the blizzard challenge , 2005, INTERSPEECH.

[6]  Daniel Erro,et al.  Flexible harmonic/stochastic speech synthesis , 2007, SSW.

[7]  Paul Taylor,et al.  The architecture of the Festival speech synthesis system , 1998, SSW.

[8]  Antonio Bonafonte,et al.  ECESS Inter-Module Interface Specification for Speech Synthesis , 2006, LREC.

[9]  Antoine Raux,et al.  A unit selection approach to F0 modeling and its application to emphasis , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).