Feature Extraction and Classification of Speech Signal Using Hidden Markov-Gaussian Mixture Model (HM-GMM) for Driving the Rehabilitative Aids

For the last few decade, an increase in experimental research has been carried-out to help the people with different-level of disabilities, who seek help towards the mobility or movement or transport aids. Hence, in our study, we have developed a system which has the ability to classify the human speech signal into corresponding commands for triggering a rehabilitative aid such as wheelchair for helping the people seek for a movement from one place to other. For this purpose, we have introduced a classification model which uses hidden Markov-Gaussian mixture, for classifying a speech signal especially the words for driving a wheelchair. To improve the efficiency of the proposed classification model, the classification was performed based on the features extracted from the speech signal. Ten volunteers were used in this study to validate the result and it is found to have an accuracy of about 98% to classify the word into their corresponding command. Hence, we propose this model as a good tool for helping the people with differently-disabled people.

[1]  Gerhard Rigoll,et al.  Maximum mutual information neural networks for hybrid connectionist-HMM speech recognition systems , 1994, IEEE Trans. Speech Audio Process..

[2]  Climent Nadeu,et al.  Time and frequency filtering of filter-bank energies for robust HMM speech recognition , 2000, Speech Commun..

[3]  Mark J. F. Gales,et al.  Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..

[4]  Shefali Singh,et al.  MACHINE PERCEPTION IN BIOMEDICAL APPLICATIONS: AN INTRODUCTION AND REVIEW , 2014 .

[5]  M Congedo,et al.  A review of classification algorithms for EEG-based brain–computer interfaces , 2007, Journal of neural engineering.

[6]  R. H. Bernacki,et al.  Effects of noise on speech production: acoustic and perceptual analyses. , 1988, The Journal of the Acoustical Society of America.

[7]  Yi Hu,et al.  Speech enhancement based on wavelet thresholding the multitaper spectrum , 2004, IEEE Transactions on Speech and Audio Processing.

[8]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[9]  D. N. Tibarewala,et al.  Development of a Surface EMG-Based Control System for Controlling Assistive Devices: A Study on Robotic Vehicle , 2016 .

[10]  F. Itakura Line spectrum representation of linear predictor coefficients of speech signals , 1975 .

[11]  Li Deng,et al.  HMM adaptation using vector taylor series for noisy speech recognition , 2000, INTERSPEECH.

[12]  Joseph Picone,et al.  Signal modeling techniques in speech recognition , 1993, Proc. IEEE.

[13]  P. Mermelstein Articulatory model for the study of speech production. , 1973, The Journal of the Acoustical Society of America.

[14]  Alfred Mertins,et al.  Automatic speech recognition and speech variability: A review , 2007, Speech Commun..

[15]  Farzin Deravi,et al.  A review of speech-based bimodal recognition , 2002, IEEE Trans. Multim..

[16]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[17]  P. Lieberman Some Effects of Semantic and Grammatical Context on the Production and Perception of Speech , 1963 .

[18]  Oliver Chiu-sing Choy,et al.  An efficient MFCC extraction method in speech recognition , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[19]  Mark A Gregory,et al.  A novel approach for MFCC feature extraction , 2010, 2010 4th International Conference on Signal Processing and Communication Systems.

[20]  D. N. Tibarewala,et al.  Classification of Surface Electromyogram Signals Acquired from the Forearm of a Healthy Volunteer , 2016 .

[21]  Chun-Liang Hsu,et al.  EOG-based Human-Computer Interface system development , 2010, Expert Syst. Appl..