论文信息 - Bilingual speech corpus in two phonetically similar languages

Bilingual speech corpus in two phonetically similar languages

As Speech Recognition Systems improve, they become suitable for facingnew problems. Multilingual speech recognition is one such problems.In the present work, the case of the Comunitat Valenciana multilingual environment is studied.The official languages in the Comunitat Valenciana (Spanish and Valencian) share most of their acoustic units, and their vocabularies and syntax are quite similar.They have influenced each other for many years.A small corpus on an Information System task was developed for experimentationpurposes.This choice will make it possible to develop a working prototype in the future,and it is simple enough to build semi-automatic language models.The design of the acoustic corpus is discussed, showing that all combinations of accents have been studied (native, non-native speakers, male, female, etc.).

Vicente Alabau | Carlos D. Martínez-Hinarejos | Vicente Alabau | C. Martínez-Hinarejos

[1] Biing-Hwang Juang,et al. Speech recognition in adverse environments , 1991 .

[2] Sander J. van Wijngaarden,et al. Intelligibility of native and non-native Dutch speech , 2001, Speech Commun..

[3] Antonio Rubio,et al. ALBAYZIN: a task-oriented spanish speech corpus , 1998 .

[4] F. Canavesio,et al. Automation of Telecom Italia directory assistance service: field trial results , 1998, Proceedings 1998 IEEE 4th Workshop Interactive Voice Technology for Telecommunications Applications. IVTTA '98 (Cat. No.98TH8376).

[5] Robert Eklund,et al. Xenophones: An investigation of phone set expansion in Swedish and implications for speech recognition and speech synthesis , 2001, Speech Commun..

[6] A. Quilis. Tratado de fonología y fonética españolas , 1993 .

[7] Giuseppe Riccardi,et al. How may I help you? , 1997, Speech Commun..

[8] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[9] Bob Carpenter,et al. Vector-based Natural Language Call Routing , 1999, Comput. Linguistics.

[10] Benjamin Peter Milner,et al. Speech recognition in adverse environments , 1994 .