论文信息 - SPICE: web-based tools for rapid language adaptation in speech processing systems

SPICE: web-based tools for rapid language adaptation in speech processing systems

In this paper we describe the design and implementation of a user interface for SPICE, a web-based toolkit for rapid prototyping of speech and language processing components. We report on the challenges and experiences gathered from testing these tools in an advanced graduate hands-on course, in which we created speech recognition, speech synthesis, and smalldomain translation components for 10 different languages within only 6 weeks.

[1] Tanja Schultz,et al. Acoustic-Phonetic Unit Similarities For Context Dependent Acoustic Model Portability , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[2] Alan W. Black,et al. CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling , 2006, INTERSPEECH.

[3] Tanja Schultz,et al. Globalphone: a multilingual speech and text database developed at karlsruhe university , 2002, INTERSPEECH.

[4] Aneliya Mircheva. Bulgarian Speech Recognition and Multilingual Language Modeling , 2006 .

[5] Tanja Schultz,et al. TOWARDS RAPID LANGUAGE PORTABILITY OF SPEECH PROCESSING SYSTEMS , 2004 .

[6] Alan W. Black,et al. Learning Pronunciation Dictionaries: Language Complexity and Word Selection Strategies , 2006, NAACL.

[7] Etienne Barnard,et al. The efficient generation of pronunciation dictionaries: machine learning factors during bootstrapping , 2004, INTERSPEECH.

[8] G. B. Varile. Multilingual Speech Processing , 2005 .

[9] Tanja Schultz,et al. Rapid Development of an Afrikaans English Speech-to-Speech Translator , 2005, IWSLT.

[10] Tanja Schultz,et al. Speaker Clustering for Multilingual Synthesis , 2006 .

[11] Tanja Schultz,et al. Challenges with Rapid Adaptation of Speech Translation Systems to New Language Pairs , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[12] A. Waibel,et al. A one-pass decoder based on polymorphic linguistic context assignment , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..