论文信息 - Development of Speech corpora for different Speech Recognition tasks in Malayalam language

Development of Speech corpora for different Speech Recognition tasks in Malayalam language

Speech corpus is the backbone of an Automatic speech Recognition system. This paper presents the development of speech corpora for different speech recognition tasks in Malayalam language. Pronunciation dictionary and Transcription file which are the other two essential resources for building a speech recognizer are also being created. Speech recognition performance of different speech recognition tasks are being presented. Speech corpus of about 18 hours have been collected for different speech recognition tasks. Keywords— Speech Recognition, corpus development, Malayalam

Cini Kurian | Cini Kurian

[1] T. Mohanan,et al. Lexical phonology of the consonant system in Malayalam , 1984 .

[2] Raj Reddy,et al. Large-vocabulary speaker-independent continuous speech recognition: the sphinx system , 1988 .

[3] Sreedivya Radhakrishnan. Perception of synthetic vowels by monolingual and bilingual Malayalam speakers , 2009 .

[4] Biing-Hwang Juang,et al. Minimum classification error rate methods for speech recognition , 1997, IEEE Trans. Speech Audio Process..

[5] Michael Picheny,et al. A method for the construction of acoustic Markov models for words , 1993, IEEE Trans. Speech Audio Process..

[6] C. K. Yuen,et al. Theory and Application of Digital Signal Processing , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[7] Steve Young,et al. Acoustic Modelling for Large Vocabulary Continuous Speech Recognition , 1999 .

[8] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[9] Stephen E. Levinson,et al. Speaker independent isolated digit recognition using hidden Markov models , 1982 .