论文信息 - Speaker Independent Urdu speech recognition using HMM

Speaker Independent Urdu speech recognition using HMM

Automatic Speech Recognition (ASR) is one of the advanced fields of Natural Language Processing (NLP). Recent past has witnessed valuable research activities in ASR in English, European and East Asian languages. But unfortunately South Asian Languages in general and "Urdu" in particular have received very less attention. In this paper we present an approach to develop an ASR system for Urdu language. The proposed system is based on an open source speech recognition framework called Sphinx4 which uses statistical based approach (Hidden Markov Model) for developing ASR system. We present a Speaker Independent ASR system for small sized vocabulary, i.e. fifty two isolated most spoken Urdu words and suggest that this research work will form the basis to develop medium and large size vocabulary Urdu speech recognition system.

[1] T. Mehmood,et al. Speech recognition using multilayer perceptron , 2002, IEEE Students Conference, ISCON '02. Proceedings..

[2] Raed Abu Zitar,et al. Arabic speech recognition using SPHINX engine , 2006, Int. J. Speech Technol..

[3] Paul Lamere,et al. Sphinx-4: a flexible open source framework for speech recognition , 2004 .

[4] Paul Lamere,et al. Design of the CMU Sphinx-4 Decoder , 2022 .

[5] M. Harti,et al. Arabic Speech Recognition System Based on CMUSphinx , 2007, 2007 International Symposium on Computational Intelligence and Intelligent Informatics.

[6] Sarmad Hussain,et al. Letter-to-Sound Conversion for Urdu Text-to-Speech System , 2004, COLING 2004.

[7] M.S. Awan,et al. Recognizing spoken Urdu numbers using fourier descriptor and neural networks with Matlab , 2008, 2008 Second International Conference on Electrical Engineering.

[8] Wayne H. Ward,et al. Speech recognition , 1997 .

[9] J. Tebelskis,et al. Speech recognition using neural networks , 1996 .

[10] S. Mohsin,et al. Urdu Spoken Digits Recognition Using Classified MFCC and Backpropgation Neural Network , 2007, Computer Graphics, Imaging and Visualisation (CGIV 2007).

[11] M. Arif,et al. Design of an Urdu speech recognizer based upon acoustic phonetic modeling approach , 2004, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..