Syllable-based and hybrid acoustic models for Amharic speech recognition

This paper presents the results of our experiments on the use of hybrid acoustic units in speech recognition and the use of syllable and hybrid acoustic models (AM) in morphemebased speech recognition. Although hybrid AMs did not bring improvement in speech recognition performance when words are used as dictionary entries and units in a language model (LM), we observed a significant word error rate (WER) reduction (compared to triphone-based systems) in morpheme-based speech recognition. Syllable AMs also led to a significant WER reduction over the triphone-based systems. It was possible to obtain a 3% absolute WER reduction as a result of using syllable acoustic units. Generally, our result shows that syllable and hybrid AMs are best fitted in morpheme-based speech recognition.

[1]  Kishore Prahallad,et al.  Unit selection voice for Amharic using Festvox , 2004, SSW.

[2]  Wolf Leslau,et al.  Introductory grammar of Amharic , 2002 .

[3]  Solomon Teferra Abate,et al.  An Amharic speech corpus for large vocabulary continuous speech recognition , 2005, INTERSPEECH.

[4]  H Alemayehu Is syllable weight distinction relevant for Amharic stress assignment , 1995 .

[5]  Solomon Teferra Abate,et al.  Syllable-Based Speech Recognition for Amharic , 2007, SEMITIC@ACL.

[6]  Rainer Voigt,et al.  THE CLASSIFICATION OF CENTRAL SEMITIC , 1987 .

[7]  Solange Rossato,et al.  Comparison of Syllable and Triphone Based Speech Recognition For Amharic , 2011, LTC 2011.

[8]  Louis Boves,et al.  Syllable-Length Acoustic Units in Large-Vocabulary Continuous Speech Recognition , 2005 .

[9]  Mervat Fashal,et al.  Syllable-based automatic arabic speech recognition in noisy-telephone channel , 2008 .

[10]  Mehryar Mohri,et al.  A Rational Design for a Weighted Finite-State Transducer Library , 1997, Workshop on Implementing Automata.

[11]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[12]  Mathias Creutz,et al.  Unsupervised Morpheme Segmentation and Morphology Induction from Text Corpora Using Morfessor 1.0 , 2005 .

[13]  Martha Yifiru Tachbelie,et al.  Morphology-based language modeling for amharic , 2010 .

[14]  Joseph Picone,et al.  Syllable-based large vocabulary continuous speech recognition , 2001, IEEE Trans. Speech Audio Process..

[15]  Solomon Teferra Abate,et al.  Part-of-Speech Tagging for Under-Resourced and Morphologically Rich Languages - The Case of Amharic , 2011 .