An Ensemble Learning-Based Bangla Phoneme Recognition System Using LPCC-2 Features

An array of devices have emerged lately for easing our daily life but one concern has always been towards designing simple user interface (UI) for such devices. A speech-based UI can be a solution to this, considering the fact that it is one of the most spontaneous and natural modes of interaction for most people. The process of identification of words and phrases from voice signals is known as Speech Recognition. Every language encompasses a unique set of atomic sounds termed as Phonemes. It is these sounds which constitute the vocabulary of that language. Speech Recognition in Bangla is a bit complicated task mostly due to the presence of compound characters. In this paper, a Bangla Phoneme Recognition system is proposed to help in the development of a Bangla Speech Recognizer using a new Linear Predictive Cepstral Coefficient-based feature, namely LPCC-2. The system has been tested on a data set of 3710 Bangla Swarabarna (Vowel) Phonemes, and an accuracy of 99.06% has been obtained using Ensemble Learning.

[1]  Muhammad Ghulam,et al.  Bangla triphone HMM based word recognition , 2010, 2010 IEEE Asia Pacific Conference on Circuits and Systems.

[2]  Pabitra Mitra,et al.  Bengali speech corpus for continuous auutomatic speech recognition system , 2011, 2011 International Conference on Speech Database and Assessments (Oriental COCOSDA).

[3]  J. Forgie,et al.  Results Obtained from a Vowel Recognition Computer Program , 1959 .

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  Md. Jakaria Rahimi,et al.  Performance comparison of MFCC based bangla ASR system in presence and absence of third differential coefficients , 2016, 2016 3rd International Conference on Electrical Engineering and Information Communication Technology (ICEEICT).

[6]  Foyzul Hassan,et al.  Bangla speech recognition using two stage multilayer neural networks , 2010, 2010 International Conference on Signal and Image Processing.

[7]  Foyzul Hassan,et al.  Phonetic Features enhancement for Bangla automatic speech recognition , 2015, 2015 International Conference on Computer and Information Engineering (ICCIE).

[8]  Santanu Phadikar,et al.  REARC-a Bangla Phoneme recognizer , 2016, 2016 International Conference on Accessibility to Digital World (ICADW).

[9]  Foyzul Hassan,et al.  Bangla phonetic feature table construction for automatic speech recognition , 2014, 16th Int'l Conf. Computer and Information Technology.

[10]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[11]  Ken'iti Kido,et al.  Bengali speech: Formant structures of single vowels and initial vowels of words , 1976, ICASSP.

[12]  M. A. H. Akhand,et al.  Acoustic modeling using deep belief network for Bangla speech recognition , 2015, 2015 18th International Conference on Computer and Information Technology (ICCIT).

[13]  Asm Sayem,et al.  Speech analysis for alphabets in Bangla language: automatic speech recognition , 2014 .