Hybrid SVM/HMM model for the arab phonemes recognition

Hidden Markov Models (HMM) are currently widely used in Automatic Speech Recognition (ASR) as being the most effective models. Yet, they sometimes pose some problems of discrimination. The hybridization of Artificial Neural Networks (ANN) in particular Multi Layer Perceptrons (MLP) with HMM is a promising technique to overcome these limitations. In order to ameliorate results of recognition system, we use Support Vector Machines (SVM) witch characterized by a high predictive power and discrimination. The incorporation of SVM with HMM brings into existence of the new system of ASR. So, by using 2800 occurrences of Arabic phonemes, this work arises a comparative study of our acknowledgment system of it as the following: The use of especially the HMM standards lead to a recognition rate of 66.98%. Also, with the hybrid system MLP/HMM we succeed in achieving the value of 73.78%. Moreover, our proposed system SVM/HMM realizes the best performances, whereby, we achieve 75.8% as a recognition frequency.

[1]  Abduladhem A. Ali Arabic Phoneme Recognition using Hierarchical Neural Fuzzy Petri Net and LPC Feature Extraction , 2009 .

[2]  Ahmad Emami,et al.  Empirical study of neural network language models for Arabic speech recognition , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[3]  Hassan Satori,et al.  Investigation arabic speech recognition using CMU sphinx system , 2009, Int. Arab J. Inf. Technol..

[4]  Hervé Bourlard,et al.  Connectionist Speech Recognition: A Hybrid Approach , 1993 .

[5]  Yousef Ajami Alotaibi Investigating spoken Arabic digits in speech recognition setting , 2005, Inf. Sci..

[6]  Carmen Peláez-Moreno,et al.  Hybrid Models for Automatic Speech Recognition: A Comparison of Classical ANN and Kernel Based Methods , 2007, NOLISP.

[7]  Marco Gori,et al.  A survey of hybrid ANN/HMM models for automatic speech recognition , 2001, Neurocomputing.

[8]  Gérard Chollet,et al.  Confidence measures for keyword spotting using support vector machines , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  Bernhard Schölkopf,et al.  A Compression Approach to Support Vector Model Selection , 2004, J. Mach. Learn. Res..

[10]  Rafik Djemili,et al.  A combination approach of Gaussian mixture models and support vector machines for speaker identification , 2009, Int. Arab J. Inf. Technol..

[11]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[12]  S. Masud,et al.  A novel approach to increase the robustness of speaker independent Arabic speech recognition , 2003, 7th International Multi Topic Conference, 2003. INMIC 2003..

[13]  Kurt Hornik,et al.  Some new results on neural network approximation , 1993, Neural Networks.

[14]  Paolo Fiorini,et al.  Hybrid HMM/SVM model for the analysis and segmentation of teleoperation tasks , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[15]  Sofiane Baloul Développement d'un système automatique de synthèse de la parole à partir du texte arabe standard voyellé , 2003 .

[16]  Marwan Al-Zabibi An acoustic-phonetic approach in automatic arabic speech recognition , 1990 .

[17]  M. Harti,et al.  Arabic Speech Recognition System Based on CMUSphinx , 2007, 2007 International Symposium on Computational Intelligence and Intelligent Informatics.

[18]  S. Osowski,et al.  MLP and SVM networks - a comparative study , 2004, Proceedings of the 6th Nordic Signal Processing Symposium, 2004. NORSIG 2004..

[19]  Climent Nadeu,et al.  Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system , 2005, IEEE Transactions on Speech and Audio Processing.

[20]  Hervé Bourlard,et al.  Using KL-based acoustic models in a large vocabulary recognition task , 2008, INTERSPEECH.

[21]  M.M. Awais,et al.  Recognition of Arabic phonemes using fuzzy rule base system , 2003, 7th International Multi Topic Conference, 2003. INMIC 2003..

[22]  Arlo Faria,et al.  I N T E R N a T I O N a L C O M P U T E R S C I E N C E an Investigation of Tandem Mlp Features for Asr , 2022 .

[23]  S. Elmougy,et al.  A comparison of combined classifier architectures for Arabic Speech Recognition , 2008, 2008 International Conference on Computer Engineering & Systems.

[24]  S.H. El-Ramly,et al.  Neural networks used for speech recognition , 2002, Proceedings of the Nineteenth National Radio Science Conference.

[25]  Pietro Laface,et al.  Adaptation of Hybrid ANN/HMM Models Using Linear Hidden Transformations and Conservative Training , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[26]  Daniel P. W. Ellis,et al.  Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition , 1999 .

[27]  K. R. Al-Balushi,et al.  Artificial neural networks and support vector machines with genetic algorithm for bearing fault detection , 2003 .

[28]  Amer Al-Nassiri,et al.  Arabic phoneme recognition using neural networks , 2006 .

[29]  Valeri Mladenov,et al.  Neural networks used for speech recognition , 2010 .

[30]  Hervé Bourlard,et al.  Confidence Measures in Hybrid HMM/ANN Speech Recognition , 1998 .

[31]  Mokhtar Sellami,et al.  A hybrid approach for Arabic speech recognition , 2003 .

[32]  Y.A. Alotaibi,et al.  Spoken Arabic digits recognizer using recurrent neural networks , 2004, Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004..

[33]  A. Alimi,et al.  Beta fuzzy neural network application in recognition of spoken isolated arabic words , 2002 .

[34]  Hervé Bourlard,et al.  Continuous speech recognition , 1995, IEEE Signal Process. Mag..

[35]  Hervé Bourlard,et al.  Task independent and dependent training: performance comparison of HMM and hybrid HMM/MLP approaches , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[36]  Jeff A. Bilmes,et al.  Natural statistical models for automatic speech recognition , 1999 .

[37]  V. Vapnik Estimation of Dependences Based on Empirical Data , 2006 .

[38]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.