Speaker-independent Malay vowel recognition of children using multi-layer perceptron

Most of the speech recognitions are based on adult speech sounds. Less research is done in the recognition of children speech sounds. The speech of children is more dynamic and inconsistent if compared to adult's speech. This paper investigates the use of neural networks in recognizing 6 Malay vowels of Malay children in a speaker-independent manner. Multi-layer perceptron with one hidden layer was used to recognize these vowels. The multi-layer perceptron was trained and tested with speech samples of Malay children with their ages between seven and ten years old. A single frame of cepstral coefficients was extracted around the vowel onset point using linear predictive coding. The vowel length was examined from 5 ms to 70 ms. Experiments were conducted to determine the optimal vowel length as well as the number of cepstral coefficients.

[1]  Shrikanth S. Narayanan,et al.  Robust recognition of children's speech , 2003, IEEE Trans. Speech Audio Process..

[2]  Abdullah Hassan Linguistik am untuk guru bahasa Malaysia , 1980 .

[3]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[4]  R.P. Ramachandran,et al.  Isolated vowel recognition using linear predictive features and neural network classifier fusion , 2002, Proceedings of the Fifth International Conference on Information Fusion. FUSION 2002. (IEEE Cat.No.02EX5997).

[5]  M. Giurgiu On the use of neural networks for automatic vowel recognition , 1995, Proceedings IEEE Conference on Industrial Automation and Control Emerging Technology Applications.