Malay speech recognition in normal and noise condition

Automatic speech recognition (ASR) is an area of research which deals with the recognition of speech by machine in several conditions. ASR performs well under restricted conditions (quiet environment), but performance degrades in noisy environments. This paper presents a simple experiment by using famous feature extraction method (LPC, LPCC and WLPCC) and simple kNN classifier to investigate the sensitivity of Malay speech digits to noise by adding 5dB white Gaussian noise. There are four steps to design and develop the Malay speech digits recognition system. They are Digit syllable structure and Malay speech corpus, end-point detection processing, feature extraction and classification method. The highest average recognition rates for Malays digits recognition is 96.22% that the feature vectors were derived from LPCC. The objective of this paper is to shown the occurrence of noise during Malay speech recognition.

[1]  Hua Nong Ting,et al.  Speaker-independent Malay isolated sounds recognition , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[2]  Juergen Luettin,et al.  Audio-Visual Automatic Speech Recognition: An Overview , 2004 .

[3]  Sazali Yaacob,et al.  Classification of Speech Dysfluencies Using LPC Based Parameterization Techniques , 2012, Journal of Medical Systems.

[4]  Syed Abdul Rahman Al-Haddad,et al.  Robust Speech Recognition Using Fusion Techniques and Adaptive Filtering , 2009 .

[5]  A. Hussain,et al.  Decision Fusion for Isolated Malay Digit Recognition Using Dynamic Time Warping (DTW) and Hidden Markov Model (HMM) , 2007, 2007 5th Student Conference on Research and Development.

[6]  Ting Hua Nong,et al.  Classification of Malay speech sounds based on place of articulation and voicing using neural networks , 2001, Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239).

[7]  A.M. Ahmad,et al.  Malay language text-independent speaker verification using NN-MLP classifier with MFCC , 2008, 2008 International Conference on Electronic Design.

[8]  Haesun Park,et al.  A comparison of generalized linear discriminant analysis algorithms , 2008, Pattern Recognit..

[9]  M. Hariharan,et al.  A review: Malay speech recognition and audio visual speech recognition , 2012, 2012 International Conference on Biomedical Engineering (ICoBE).

[10]  Sh-Hussain Salleh,et al.  Malay speaker recognition system based on discrete HMM , 2005, 2005 1st International Conference on Computers, Communications, & Signal Processing with Special Track on Biomedical Engineering.

[11]  Hua Nong Ting,et al.  Speaker-independent Malay vowel recognition of children using multi-layer perceptron , 2004, 2004 IEEE Region 10 Conference TENCON 2004..

[12]  Dzulkifli Mohamad,et al.  Segmentation of Malay Syllables in Connected Digit Speech Using Statistical Approach , 2007 .

[13]  F. Rosdi,et al.  Isolated malay speech recognition using Hidden Markov Models , 2008, 2008 International Conference on Computer and Communication Engineering.

[14]  Goh Kia Eng,et al.  Applying Hybrid Neural Network For Malay Syllables Speech Recognition , 2005, TENCON 2005 - 2005 IEEE Region 10 Conference.

[15]  C. Benoît,et al.  28. The Intrinsic Bimodality of Speech Communication and the Synthesis of Talking Faces , 2000 .

[16]  Nordin Abu Bakar,et al.  An evaluation of endpoint detection measures for malay speech recognition of an isolated words , 2010, 2010 International Symposium on Information Technology.