Incorporating finer acoustic phonetic features in lexicon for Hindi language speech recognition

Abstract Lexicon, used in speech recognition, is the list of words and corresponding pronunciation using a standard acoustic notation. Currently ILSL12 phone-set is widely used for Indian Language speech recognition. However, it has limitations to represent features of the speech like, voicing, fricatives, etc. that are found in Indian Languages. This paper addresses this limitation by considering the voiced and unvoiced features for Hindi speech recognition by incorporating finer representations at the time of lexicon expansion. The approach is tested for Hindi word recognition and has shown significant improvement in WER.

[1]  P. V. S. Rao,et al.  Hindi speech database , 2000, INTERSPEECH.

[2]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[3]  Anil Kumar Singh A Computational Phonetic Model for Indian Language Scripts , 2006 .

[4]  Mahua Bhattacharya,et al.  Speech based dialog query system over asterisk PBX server , 2010, 2010 2nd International Conference on Signal Processing Systems.