Bangla Numeral Recognition from Speech Signal Using Convolutional Neural Network

Speech recognition is a process where an acoustic signal is converted to text or words or commands and recognizing the speech. In this paper, a Bangla numeral recognition system from the speech signal is developed utilizing Convolutional Neural Network (CNN). In the proposed system, a speech dataset of ten isolated Bangla digits has been developed consists of 6000 utterances (5 utterances for every 120 speakers) and a feature extraction procedure is performed to elicit significant features from the speech signals using Mel Frequency Cepstrum Coefficient (MFCC) analysis. Then, CNN is trained with the features of the speech signal as input. The efficiency of the proposed system is tested on the dataset developed for this purpose, and acquire 93.65% recognition accuracy. The proposed system is also compared with other existing methods of Bangla numeral speech recognition and outperforms most of the existing systems and proves the superiority of itself.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  Anup Kumar Paul,et al.  Bangla Speech Recognition System Using LPC and ANN , 2009, 2009 Seventh International Conference on Advances in Pattern Recognition.

[3]  Rashedur M. Rahman,et al.  Bangla Isolated Word Speech Recognition , 2018, ICEIS.

[4]  Mumit Khan,et al.  Isolated and continuous bangla speech recognition: implementation, performance and application perspective , 2007 .

[5]  Ratnadeep R. Deshmukh,et al.  CONTINUOUS SPEECH RECOGNITION SYSTEM: A REVIEW , 2014 .

[6]  Ghulam Muhammad,et al.  Bangla phoneme recognition using hybrid features , 2010, International Conference on Electrical & Computer Engineering (ICECE 2010).

[7]  Mohammad Nuruzzaman Bhuiyan,et al.  Automatic Speech Recognition Technique for Bangla Words , 2013 .

[8]  Ghulam Muhammad,et al.  Automatic speech recognition for Bangla digits , 2009, 2009 12th International Conference on Computers and Information Technology.

[9]  Md. Mijanur Rahman,et al.  Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition , 2013, ArXiv.

[10]  Md Saiful Islam,et al.  A noble approach for recognizing Bangla real number automatically using CMU Sphinx4 , 2016, 2016 5th International Conference on Informatics, Electronics and Vision (ICIEV).

[11]  Nidhi Desai Review on Speech Recognition with Deep Learning Methods , 2015 .

[12]  Valeri Mladenov,et al.  Neural networks used for speech recognition , 2010 .

[13]  Khalil Ahammad,et al.  Connected Bangla Speech Recognition using Artificial Neural Network , 2016 .

[14]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[15]  Tufail Muhammad,et al.  ARTIFICIAL NEURAL NETWORK-BASED SPEECH RECOGNITION USING DWT ANALYSIS APPLIED ON ISOLATED WORDS FROM ORIENTAL LANGUAGES , 2015 .