A Platform for Multilingual Research in Spoken Dialogue Systems

Abstract : Multilingual speech technology research would be greatly facilitated by an integrated and comprehensive set of software tools that enable research and development of core language technologies and interactive language systems in any language. Such a multilingual platform has been one of our goals in developing the CSLU Toolkit. The Toolkit is composed of components that are essentially language-independent, and support research and development of recognition, understanding, text-to-speech synthesis, facial animation, and spoken dialogue systems. Portions of the Toolkit have already been ported to Italian, German, and Vietnamese. In addition, a complete Mexican-Spanish version of the Toolkit has been created, and is in daily use at the Universidad de las Americas in Puebla (UDLA). In this paper we outline some of the issues involved in porting the Toolkit to a new language, and describe why the Toolkit is well suited to multilingual adaptation.

[1]  John-Paul Hosom,et al.  A computer-based course in spectrogram reading , 1999 .

[2]  D. Massaro,et al.  Perceiving Talking Faces , 1995 .

[3]  Yonghong Yan,et al.  Universal speech tools: the CSLU toolkit , 1998, ICSLP.

[4]  Ronald A. Cole,et al.  Creating a mexican Spanish version of the CSLU toolkit , 1998, ICSLP.

[5]  Michael F. McTear,et al.  Modelling spoken dialogues with state transition diagrams: experiences with the CSLU toolkit , 1998, ICSLP.

[6]  Ronald A. Cole,et al.  Building 10,000 spoken dialogue systems , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[7]  Yonghong Yan,et al.  Accessible technology for interactive systems: a new approach to spoken language research , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[8]  Michael Johnston,et al.  PROFER: predictive, robust finite-state parsing for spoken language , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[9]  Ben Serridge An undergraduate course on speech recognition based on the CSLU toolkit , 1998, ICSLP.

[10]  D. Massaro Perceiving talking faces: from speech perception to a behavioral principle , 1999 .

[11]  Piero Cosi,et al.  HMM / NEURAL NETWORK-BASED SYSTEM FOR ITALIAN CONTINUOUS DIGIT RECOGNITION , 1999 .

[12]  M. McTear - 113-Using the CSLU Toolkit for Practicals in Spoken Dialogue Technology , 1999 .

[13]  Worldbet,et al.  ASCII Phonetic Symbols for the World s Languages Worldbet , 1994 .

[14]  R. Cole,et al.  Improvements in Neural-Network Training and Search Techniques for Continuous Digit Recognition , 1998 .

[15]  Ronald A. Cole,et al.  Evaluation and integration of neural-network training techniques for continuous digit recognition , 1998, ICSLP.

[16]  Paul Taylor,et al.  Festival Speech Synthesis System , 1998 .

[17]  Edward C. Kaiser Robust, Finite-State Parsing for Spoken Language Understanding , 1999, ACL.

[18]  Ronald A. Cole,et al.  Connected digit recognition experiments with the OGI Toolkit's neural network and HMM-based recognizers , 1998, Proceedings 1998 IEEE 4th Workshop Interactive Voice Technology for Telecommunications Applications. IVTTA '98 (Cat. No.98TH8376).

[19]  Ronald A. Cole,et al.  TOOLS FOR RESEARCH AND EDUCATION IN SPEECH SCIENCE , 1999 .