SPICE: web-based tools for rapid language adaptation in speech processing systems

In this paper we describe the design and implementation of a user interface for SPICE, a web-based toolkit for rapid prototyping of speech and language processing components. We report on the challenges and experiences gathered from testing these tools in an advanced graduate hands-on course, in which we created speech recognition, speech synthesis, and smalldomain translation components for 10 different languages within only 6 weeks.

[1]  Tanja Schultz,et al.  Acoustic-Phonetic Unit Similarities For Context Dependent Acoustic Model Portability , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[2]  Alan W. Black,et al.  CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling , 2006, INTERSPEECH.

[3]  Tanja Schultz,et al.  Globalphone: a multilingual speech and text database developed at karlsruhe university , 2002, INTERSPEECH.

[4]  Aneliya Mircheva Bulgarian Speech Recognition and Multilingual Language Modeling , 2006 .

[5]  Tanja Schultz,et al.  TOWARDS RAPID LANGUAGE PORTABILITY OF SPEECH PROCESSING SYSTEMS , 2004 .

[6]  Alan W. Black,et al.  Learning Pronunciation Dictionaries: Language Complexity and Word Selection Strategies , 2006, NAACL.

[7]  Etienne Barnard,et al.  The efficient generation of pronunciation dictionaries: machine learning factors during bootstrapping , 2004, INTERSPEECH.

[8]  G. B. Varile Multilingual Speech Processing , 2005 .

[9]  Tanja Schultz,et al.  Rapid Development of an Afrikaans English Speech-to-Speech Translator , 2005, IWSLT.

[10]  Tanja Schultz,et al.  Speaker Clustering for Multilingual Synthesis , 2006 .

[11]  Tanja Schultz,et al.  Challenges with Rapid Adaptation of Speech Translation Systems to New Language Pairs , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[12]  A. Waibel,et al.  A one-pass decoder based on polymorphic linguistic context assignment , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..