Jawi Character Speech-to-Text Engine Using Linear Predictive and Neural Network for Effective Reading

Jawi is an old version of Malay Language Writing that need to be preserved. Therefore, it is important to develop tools for teaching kids about Jawi characters and Speech-To-Text (STT) application can serve this purpose well. Unlike English, Jawi uses special characters similar to Arabic Characters. However, its pronunciations are in Malay Language. This uniqueness makes STT development a challenging task. In this paper, we investigate the applicability of Linear Predictive Coding to extract important features from voice signal and Neural Network with Backpropagation to classify and recognize spoken words into Jawi Characters. A total of 225 samples of words in Jawi Characters are recorded from speakers with over 95% accuracy. Jawi Characters Speech-To-Text Engine aims to help students to read Jawi document accurately and independently without the need for close monitoring from parents or teachers.

[1]  Dewan Bahasa dan Pustaka Daftar ejaan rumi bahasa Malaysia , 1984 .

[2]  C.M.D. Acevedo,et al.  Integrated System Approach for the Automatic Speech Recognition using Linear predict Coding and Neural Networks , 2007, Electronics, Robotics and Automotive Mechanics Conference (CERMA 2007).

[3]  M. Fikri,et al.  Speaker independent isolated Arabic word recognition system , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Tarek Helmy,et al.  Multi-agent Based Arabic Speech Recognition , 2007, 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops.

[5]  Tarek Helmy,et al.  Agent Based Arabic Language Understanding , 2007, 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops.

[6]  Salina Abdul Samad,et al.  Pitch detection of speech signals using the cross-correlation technique , 2000, 2000 TENCON Proceedings. Intelligent Systems and Technologies for the New Millennium (Cat. No.00CH37119).

[7]  Hwang Soo Lee,et al.  LPC analysis incorporating spectral interpolation for speech coding , 1999 .