Comparison of Grapheme and Phoneme Based Acoustic Modeling in LVCSR Task in Slovak

Phonemes and allophones are the basic speech units for acoustic modeling in the majority of contemporary HMM based speech recognizers. Grapheme-based acoustic sub-word units were applied to multi-lingual and cross-lingual acoustic modeling in many tasks. Grapheme and phoneme based mono-, cross- and bilingual speech recognition of Czech and Slovak in the small and medium vocabulary task has been studied in our previous work. In this article we compare grapheme and phoneme based approach to acoustic modeling and model unit selection in large vocabulary continuous speech recognition (LVCSR) task in Slovak. The main goal of our experimental work is to investigate a possibility to select an optimal set of sub-word units for Slovak LVCSR system.

[1]  Tanja Schultz,et al.  A Grapheme Based Speech Recognition System for Russian , 2004 .

[2]  Hermann Ney,et al.  Context-dependent acoustic modeling using graphemes for large vocabulary speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Samy Bengio,et al.  Joint decoding for phoneme-grapheme continuous speech recognition , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Heinrich Niemann,et al.  Automatic speech recognition without phonemes , 1993, EUROSPEECH.

[5]  Tanja Schultz,et al.  TOWARDS RAPID LANGUAGE PORTABILITY OF SPEECH PROCESSING SYSTEMS , 2004 .

[6]  S. Bengio,et al.  Phoneme-grapheme based speech recognition system , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[7]  Tanja Schultz,et al.  Grapheme based speech recognition , 2003, INTERSPEECH.

[8]  Laurent Besacier,et al.  Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR , 2006, INTERSPEECH.

[9]  Franz Kummert,et al.  Grapheme based speech recognition for large vocabularies , 2000, INTERSPEECH.

[10]  Hermann Ney,et al.  Multilingual acoustic modeling using graphemes , 2003, INTERSPEECH.

[11]  Tanja Schultz,et al.  Thai Grapheme-Based Speech Recognition , 2006, NAACL.

[12]  Jozef Juhár,et al.  Comparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models , 2006, INTERSPEECH.

[13]  Narada D. Warakagoda,et al.  A Noise Robust Multilingual Reference Recogniser Based on Speechdat(II) , 2000, INTERSPEECH.