论文信息 - Automatic Identification of Arabic Dialects USING Hidden Markov Models

Automatic Identification of Arabic Dialects USING Hidden Markov Models

The Arabic language has many different dialects, they must beidentified before Automatic Speech Recognition can take place.This thesis examines the difficult task of properly identifyingvarious Arabic dialects. We also present a novel design of anArabic dialect identification system using Hidden Markov Models(HMM). Due to the similarities and the differences between Arabicdialects, we build a ergodic HMM that has two types of states; oneof them represents the common sounds across Arabic dialects, whilethe other represents the unique sounds of the specific dialect. Wetie the common states across all models since they share the samesounds. We focus only on two major dialects: Egyptian and theGulf. An improved initialization process is used to achieve betterArabic dialect identification. Moreover, we utilize many differentcombinations of speech features related to MFCC such as timederivatives, energy, and the Shifted Delta Cepstra in training andtesting the system. We present a detailed comparison of theperformance of our Arabic dialect identification system using thedifferent combinations. The best result of the Arabic dialectidentification system is 96.67\% correct identification.

Fawzi Suliman Alorifi | F. Alorifi

[1] Jeff A. Bilmes,et al. A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .

[2] Marc A. Zissman,et al. Automatic language identification using Gaussian mixture and hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] Douglas A. Reynolds,et al. A Gaussian mixture modeling approach to text-independent speaker identification , 1992 .

[4] B. Atal. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. , 1974, The Journal of the Acoustical Society of America.

[5] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[6] Albino Nogueiras,et al. Orientel: speech-based interactive communication applications for the mediterranean and the middle east , 2002, INTERSPEECH.

[7] D J Van Tasell,et al. Electrode ranking of "place pitch" and speech recognition in electrical hearing. , 1995, The Journal of the Acoustical Society of America.

[8] Jonathan Lareau. Application of shifted delta cepstral features for GMM language identification , 2006 .

[9] John H. L. Hansen,et al. Discrete-Time Processing of Speech Signals , 1993 .

[10] ITAHASHI Shuichi,et al. Discrimination of Spoken Languages and Dialects , 1994 .

[11] M. A. Kohler,et al. Language identification using shifted delta cepstra , 2002, The 2002 45th Midwest Symposium on Circuits and Systems, 2002. MWSCAS-2002..

[12] Richard M. Schwartz,et al. Probabilistic models for topic detection and tracking , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[13] Bruce Ingham,et al. Najdi Arabic: Central Arabian , 1994 .

[14] S. Al-Ani. Arabic Phonology: An Acoustical and Physiological Investigation , 1970 .

[15] Yousef Ajami Alotaibi,et al. Pharyngeal and emphatic sounds in arabic speech recognition , 1997 .

[16] Alex Acero,et al. Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[17] Douglas A. Reynolds,et al. Approaches to language identification using Gaussian mixture models and shifted delta cepstral features , 2002, INTERSPEECH.

[18] Abdulhadi S. Al-Otaibi. Arabic speech processing: syllabic segmentation and speech recognition , 1988 .

[19] S. Young. Large Vocabulary Continuous Speech Recognition : a ReviewSteve , 1996 .

[20] M.G. Bellanger,et al. Digital processing of speech signals , 1980, Proceedings of the IEEE.

[21] John J. Ohala,et al. Prosody as a distinctive feature for the discrimination of arabic dialects , 1999, EUROSPEECH.

[22] Jr. J.P. Campbell,et al. Speaker recognition: a tutorial , 1997, Proc. IEEE.

[23] Pavel Matejka,et al. Automatic Language Identification Using Phoneme and Automatically Derived Unit Strings , 2004, TSD.

[24] Rupert G. Miller. The jackknife-a review , 1974 .

[25] Alex Bateman,et al. An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[26] Douglas A. Reynolds,et al. Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[27] Steve Young,et al. The HTK book version 3.4 , 2006 .

[28] Marwan Al-Zabibi. An acoustic-phonetic approach in automatic arabic speech recognition , 1990 .

[29] Biing-Hwang Juang,et al. Hidden Markov Models for Speech Recognition , 1991 .

[30] Treebank Penn,et al. Linguistic Data Consortium , 1999 .

[31] Khalid Choukri,et al. OrienTel – Arabic speech resources for the IT market , 2002 .

[32] J. Picone,et al. Continuous speech recognition using hidden Markov models , 1990, IEEE ASSP Magazine.

[33] V. Ramasubramanian,et al. Automatic language identification using ergodic-HMM , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[34] Hermann Ney,et al. A word graph algorithm for large vocabulary continuous speech recognition , 1994, Comput. Speech Lang..

[35] M. Savic,et al. An automatic language identification system , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[36] Melissa Barkat,et al. IDENTIFICATION OF ARABIC DIALECTS AND EXPERIMENTAL DETERMINATION OF DISTINCTIVE CUES , 1999 .

[37] Frederick Jelinek,et al. Statistical methods for speech recognition , 1997 .

[38] Marc A. Zissman,et al. Automatic dialect identification of extemporaneous conversational, Latin American Spanish speech , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[39] David R. Miller,et al. Statistical dialect classification based on mean phonetic features , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[40] M. H. Bakalla. Arabic Culture: Through Its Language and Literature , 1984 .

[41] Procházka,et al. Saudi Arabian Dialects , 1988 .

[42] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[43] K. Versteegh. The Arabic Language , 1997 .