Phoneme Set and Pronouncing Dictionary Creation for Large Vocabulary Continuous Speech Recognition of Vietnamese

This paper describes our study on solving two basic problems of large vocabulary continuous speech recognition (LVCSR) of Vietnamese, which can be used as a standard reference for Vietnamese researchers and other researchers who are interested in Vietnamese language. First, a standard phoneme set is proposed with its corresponding grapheme-to-phoneme map. This phoneme set is the core to solve other problems related to LVCSR of Vietnamese. Then the creation of standard pronouncing dictionary based on the grapheme-to-phoneme map and the analysis of Vietnamese syllable is also described. Finally, we present the results on LVCSR using different types of pronouncing dictionary, which show some interesting aspects of Vietnamese language such as the structure of Vietnamese syllable and the effect of tone in the relationship with syllable.

[1]  Tung Le,et al.  Progress in Transcription of Vietnamese Broadcast News , 2006, 2006 First International Conference on Communications and Electronics.

[2]  Ngoc Thang Vu,et al.  Vietnamese large vocabulary continuous speech recognition , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[3]  Dirk Van Compernolle,et al.  Vietnamese Automatic Speech Recognition: The FLaVoR Approach , 2006, ISCSLP.

[4]  Laurent Besacier,et al.  Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR , 2006, INTERSPEECH.

[5]  Duc A. Duong,et al.  A robust transcription system for soccer video database , 2010, 2010 International Conference on Audio, Language and Image Processing.

[6]  Ngoc Thang Vu,et al.  Optimization on Vietnamese large vocabulary speech recognition , 2010, SLTU.

[7]  Hong Quang Nguyen,et al.  Using tone information for Vietnamese continuous speech recognition , 2008, 2008 IEEE International Conference on Research, Innovation and Vision for the Future in Computing and Communication Technologies.

[8]  Hong Quang Nguyen,et al.  Large vocabulary continuous speech recognition for Vietnamese, an under-resourced language , 2008, SLTU.

[9]  Tuan Nguyen,et al.  Advances in Acoustic Modeling for Vietnamese LVCSR , 2009, 2009 International Conference on Asian Language Processing.

[10]  Steve Young,et al.  The HTK book version 3.4 , 2006 .

[11]  N. T. Chuong Selection of sentence set for vietnamese audiovisual corpus design , 2011, Proceedings of the 6th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems.