Arabic and English speech recognition using cross-language acoustic models

In recent years there has been an increasing interest in speech and language processing systems dedicated to Arabic language. In order to perform adequate design and evaluation of those systems, speech databases are needed. The aim of this paper is to evaluate the design of Arabic and English speech recognition systems by using common acoustic models. Cross-language experiments between Arabic and English are conducted and discussed with respect to the main class of phonemes in each language. The LDC WestPoint Arabic database and TIMIT are used in these experiments. The results show that lack of enough speech resource that faces Arabic language can be solved by considering models' features of common phonemes given by English.

[1]  A. Jongman,et al.  Acoustic characteristics of English fricatives. , 2000, The Journal of the Acoustical Society of America.

[2]  Elizabeth C. Botha,et al.  Cross-language use of acoustic information for automatic speech recognition , 2002, Speech Commun..

[3]  Roger K. Moore,et al.  Cross-Language Phone Recognition when the Target Language Phoneme Inventory is not Known , 2011, INTERSPEECH.

[4]  R. Port,et al.  Cross-Language Phonetic Interference: Arabic to English , 1981 .

[5]  Kai Feng,et al.  The subspace Gaussian mixture model - A structured model for speech recognition , 2011, Comput. Speech Lang..

[6]  J Hillenbrand,et al.  Differential use of temporal cues to the /s/-/z/ contrast by native and non-native speakers of English. , 1986, The Journal of the Acoustical Society of America.

[7]  M A Mines,et al.  Frequency of Occurrence of Phonemes in Conversational English , 1978, Language and speech.

[8]  Sid-Ahmed Selouani,et al.  Evaluating the MSA West Point Speech Corpus , 2009, Int. J. Comput. Process. Orient. Lang..

[9]  R. Bayeh,et al.  Broadcast News Transcription Baseline System using the NEMLAR database , 2006 .

[10]  Nizar Habash,et al.  Parsing Arabic Dialects , 2006, EACL.

[11]  Chafic Mokbel,et al.  Towards multilingual speech recognition using data driven source/target acoustical units association , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Tanja Schultz,et al.  Language-independent and language-adaptive acoustic modeling for speech recognition , 2001, Speech Commun..

[13]  Nizar Habash,et al.  Spoken Arabic Dialect Identification Using Phonotactic Modeling , 2009, SEMITIC@EACL.