Speech Recognition System and Formant Based Analysis of Spoken Arabic Vowels

Arabic is one of the world's oldest languages and is currently the second most spoken language in terms of number of speakers. However, it has not received much attention from the traditional speech processing research community. This study is specifically concerned with the analysis of vowels in modern standard Arabic dialect. The first and second formant values in these vowels are investigated and the differences and similarities between the vowels are explored using consonant-vowels-consonant (CVC) utterances. For this purpose, an HMM based recognizer was built to classify the vowels and the performance of the recognizer analyzed to help understand the similarities and dissimilarities between the phonetic features of vowels. The vowels are also analyzed in both time and frequency domains, and the consistent findings of the analysis are expected to facilitate future Arabic speech processing tasks such as vowel and speech recognition and classification.

[1]  Daniel Newman,et al.  Frequency analysis of Arabic vowels in connected speech , 2002 .

[2]  Anna Esposito,et al.  Biometric ID Management and Multimodal Communication, Joint COST 2101 and 2102 International Conference, BioID_MultiComm 2009, Madrid, Spain, September 16-18, 2009. Proceedings , 2009, COST 2101/2102 Conference.

[3]  T. M. Nazmy,et al.  A Novel Method for Arabic Consonant/Vowel Segmentation Using Wavelet Transform , 2005, Egypt. Comput. Sci. J..

[4]  Yousif A. El-Imam An unrestricted vocabulary Arabic speech synthesis system , 1989, IEEE Trans. Acoust. Speech Signal Process..

[5]  Mohd Yamani Idna Idris,et al.  Quranic Verse Recitation Feature Extraction using Mel-Frequency Cepstral Coefficient (MFCC) , 2008 .

[6]  M. Alkanhal,et al.  A Manual System to Segment and Transcribe Arabic Speech , 2007, 2007 IEEE International Conference on Signal Processing and Communications.

[7]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[8]  Shahid Masud,et al.  On Vowels Segmentation and Identification Using Formant Transitions in Continuous Recitation of Quranic Arabic , 2008, New Challenges in Applied Intelligence Technologies.

[9]  Andreas Spanias,et al.  High-performance alphabet recognition , 1996, IEEE Trans. Speech Audio Process..

[10]  Yousef Ajami Alotaibi,et al.  Formant Based Analysis of Spoken Arabic Vowels , 2009, COST 2101/2102 Conference.

[11]  Ngoc Thanh Nguyen,et al.  New Challenges in Applied Intelligence Technologies , 2008, New Challenges in Applied Intelligence Technologies.

[12]  Donald G. Childers,et al.  Speech Processing , 1999 .

[13]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[14]  Biing-Hwang Juang,et al.  Hidden Markov Models for Speech Recognition , 1991 .