Syllable-Based Speech Recognition for Amharic

Amharic is the Semitic language that has the second large number of speakers after Arabic (Hayward and Richard 1999). Its writing system is syllabic with Consonant-Vowel (CV) syllable structure. Amharic orthography has more or less a one to one correspondence with syllabic sounds. We have used this feature of Amharic to develop a CV syllable-based speech recognizer, using Hidden Markov Modeling (HMM), and achieved 90.43% word recognition accuracy.

[1]  Solomon Teferra Abate,et al.  Automatic speech recognition for Amharic , 2006 .

[2]  Björn Gambäck,et al.  A speaker independent continuous speech recognizer for Amharic , 2005, INTERSPEECH.

[3]  Martha Yifiru,et al.  Application of Amharic Speech Recognition System to Command and Control Computer: an Experiment with Microsoft Word , 2003 .

[4]  Zegaye Seifu,et al.  Hidden Markov Model Based Large Vocabulary, Speaker Independent, Continuous Amharic Speech Recognition , 2003 .

[5]  Rachod Thongprasirt,et al.  Pronunciation variation speech recognition without dictionary modification on sparse database , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[6]  Kinfe Tadesse,et al.  Sub-Word Based Amharic Word Wecognition : an Experiment using Hidden Markov Model (HMM) , 2002 .

[7]  S. Berhanu Isolated Amharic Consonant-Vowel (CV) Syllable Recognition An Experiment Using Hidden Markov Model (HMM) , 2001 .

[8]  Joseph Picone,et al.  Syllable-based large vocabulary continuous speech recognition , 2001, IEEE Trans. Speech Audio Process..

[9]  E. Vajda,et al.  Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet , 2000 .

[10]  Yoshinori Sagisaka,et al.  Automatic generation of multiple pronunciations based on neural networks , 1999, Speech Commun..

[11]  Ronald A. Cole,et al.  Speech recognition using syllable-like units , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[12]  Steve Young,et al.  The HTK book , 1995 .

[13]  Chin-Hui Lee,et al.  Large vocabulary speech recognition using subword units , 1993, Speech Commun..

[14]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[15]  R. Hayward Amharic , 1992, Journal of the International Phonetic Association.

[16]  Solomon Teferra Abate,et al.  An Amharic speech corpus for large vocabulary continuous speech recognition , 2005, INTERSPEECH.

[17]  Sherif Abdou,et al.  Recent progress in Arabic broadcast news transcription at BBN , 2005, INTERSPEECH.

[18]  Wolf Leslau,et al.  Introductory grammar of Amharic , 2002 .

[19]  Steven Greenberg,et al.  Incorporating information from syllable-length time scales into automatic speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[20]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.