Fuzzy clustering optimized with genetic algorithms: Application for hybrid speech recognition system

In this paper, we report experimental results of hybrid system using Hidden Markov Models/Multi-Layer Perceptron (HMM/MLP) model as acoustic model and based on the Fuzzy C-Means (FCM) clustering with optimization with Genetic Algorithm (GA). In this context, we use the result of FCM clustering as initial population of GA, this allows training the GA with a population of empirically generated chromosomes and not randomly initialized. Our results on speech recognition tasks show an increase in the estimates of the posterior probabilities of the correct words after training. We demonstrate the effectiveness of the proposed clustering approach in large-vocabulary speaker-independent continuous speech recognition with regard to the three baseline systems : Discrete HMM, hybrid HMM/MLP with K-Means and FCM clustering.

[1]  Hervé Bourlard,et al.  Task independent and dependent training: performance comparison of HMM and hybrid HMM/MLP approaches , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Hervé Bourlard,et al.  Subband-based speech recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Dilip Kumar Pratihar,et al.  A Comparative Study of Fuzzy C-Means Algorithm and Entropy-Based Fuzzy Clustering Algorithms , 2011, Comput. Informatics.

[4]  Hervé Bourlard,et al.  MAP combination of multi-stream HMM or HMM/ANN experts , 2001, INTERSPEECH.

[5]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[6]  Krishna Kumar Sharma,et al.  Performance Comparison of Hard and Soft Approaches for Document Clustering , 2012 .

[7]  Georg Heigold,et al.  Discriminative HMMS, log-linear models, and CRFS: What is the difference? , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Marco Gori,et al.  A survey of hybrid ANN/HMM models for automatic speech recognition , 2001, Neurocomputing.

[9]  H. Timm,et al.  Fuzzy cluster analysis of classified data , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[10]  Mokhtar Sellami,et al.  Hybrid HMM-MLP System based on Fuzzy Logic for Arabic Speech Recognition , 2003, PRIS.