Large vocabulary continuous speech recognition for Urdu

This paper presents the development of acoustic and language models for robust Urdu speech recognition using the CMU Sphinx Open Source Toolkit for speech recognition. Three models have been developed incrementally, with the addition of speech data of up to two speakers per pass; one model using data from 40 female speakers only, one from 41 male speakers only, and one with both male and female speakers (81 speakers). This paper presents the current recognition results, and discusses approaches for improving these recognition rates.

[1]  Agha Ali Raza,et al.  Speech Corpus Development for a Speaker Independent Spontaneous Urdu Speech Recognition System , 2010 .

[2]  Sarmad Hussain,et al.  Letter-to-Sound Conversion for Urdu Text-to-Speech System , 2004, COLING 2004.

[3]  M.S. Awan,et al.  Recognizing spoken Urdu numbers using fourier descriptor and neural networks with Matlab , 2008, 2008 Second International Conference on Electrical Engineering.

[4]  John H. L. Hansen,et al.  Robust speech recognition in noise: an evaluation using the SPINE corpus , 2001, INTERSPEECH.

[5]  Juraj Kacur,et al.  Practical Issues of Building Robust HMM Models Using HTK and SPHINX Systems , 2008 .

[6]  S. Mohsin,et al.  Urdu Spoken Digits Recognition Using Classified MFCC and Backpropgation Neural Network , 2007, Computer Graphics, Imaging and Visualisation (CGIV 2007).

[7]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[8]  R. Cole,et al.  Survey of the State of the Art in Human Language Technology , 2010 .

[9]  Ashish Verma,et al.  A large-vocabulary continuous speech recognition system for Hindi , 2004, IBM J. Res. Dev..

[10]  M. Arif,et al.  Design of an Urdu speech recognizer based upon acoustic phonetic modeling approach , 2004, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..

[11]  Agha Ali Raza,et al.  An ASR System for Spontaneous Urdu Speech , 2010 .

[12]  Atsushi Nakamura,et al.  Japanese speech databases for robust speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[13]  Hassan Satori,et al.  Investigation arabic speech recognition using CMU sphinx system , 2009, Int. Arab J. Inf. Technol..

[14]  Naveed Sarfraz Khattak,et al.  Speaker Independent Urdu speech recognition using HMM , 2010, 2010 The 7th International Conference on Informatics and Systems (INFOS).

[15]  T. Mehmood,et al.  Speech recognition using multilayer perceptron , 2002, IEEE Students Conference, ISCON '02. Proceedings..