Speech recognition using ANN and predator-influenced civilized swarm optimization algorithm

This paper proposes a hybrid optimization technique, predator-influenced civilized swarm optimization, by integrating civilized swarm optimization (CSO) and predator–prey optimization (PPO) techniques. CSO is the integration of the attributes of particle swarm optimization and a society civilization algorithm (SCA). In the SCA, the swarm is divided into a few societies, and each society has its own society leader (SL); other individuals of the society are termed society members. The combination of all such societies forms a civilization, and the best-performing SL becomes the civilization leader (CL). In CSO, SLs and members update their positions through the guidance of their own CL and SLs, respectively, along with their best positions. In the proposed technique, the PPO technique is integrated with CSO, in which a predator particle is included in the swarm. The predator always tries to chase the CL in a controlled manner, which maintains diversity in the population and avoids local optimum solutions. The proposed optimization technique is applied to optimize the weights and biases of an artificial neural network (ANN) trained for speech recognition. Two databases have been used; one is a TI-46 isolated word database in clean and noisy conditions, and the other is a self-recorded Hindi numeral database. To evaluate the performance of the proposed technique, 2 performance criteria, correlation coefficient and mean square error, are applied. The results obtained by an ANN with the proposed technique outperform the results obtained by an ANN trained with particle swarm optimization, PPO, CSO, and backpropagation techniques in terms of correlation coefficient and mean square error.

[1]  Marley M. B. R. Vellasco,et al.  A comparison of different spectral analysis models for speech recognition using neural networks , 1996, Proceedings of the 39th Midwest Symposium on Circuits and Systems.

[2]  Prasant Kumar Pattnaik,et al.  Artificial Neural Network trained by Particle Swarm Optimization for non-linear channel equalization , 2014, Expert Syst. Appl..

[3]  Chin-Hui Lee,et al.  Automatic recognition of keywords in unconstrained speech using hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[4]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[5]  Chorkin Chan,et al.  Isolated Word Recognition by Neural Network Models with Cross-Correlation Coefficients for Speech Dynamics , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  János D. Pintér,et al.  Calibrating artificial neural networks by global optimization , 2012, Expert Syst. Appl..

[7]  Tapabrata Ray,et al.  Society and civilization: An optimization algorithm based on the simulation of social behavior , 2003, IEEE Trans. Evol. Comput..

[8]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[9]  Zhihong Man,et al.  Robust Single-Hidden Layer Feedforward Network-Based Pattern Classifier , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Dianhui Wang,et al.  Global Convergence of Online BP Training With Dynamic Learning Rate , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Leszek Rutkowski,et al.  A fast training algorithm for neural networks , 1998 .

[12]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[13]  A. Immanuel Selvakumar,et al.  Optimization using civilized swarm: Solution to economic dispatch with multiple minima , 2009 .

[14]  Omar Farooq,et al.  Mel filter-like admissible wavelet packet structure for speech recognition , 2001, IEEE Signal Processing Letters.

[15]  Nor Ashidi Mat Isa,et al.  Modified Recursive Least Squares algorithm to train the Hybrid Multilayered Perceptron (HMLP) network , 2010, Appl. Soft Comput..

[16]  Ernesto Costa,et al.  An Empirical Comparison of Particle Swarm and Predator Prey Optimisation , 2002, AICS.

[17]  Yoshua Bengio,et al.  Global optimization of a neural network-hidden Markov model hybrid , 1992, IEEE Trans. Neural Networks.

[18]  A. P. Kabilan,et al.  Speech Recognition System Based On Phonemes Using Neural Networks , 2009 .

[19]  L. Lebensztajn,et al.  Multiobjective Biogeography-Based Optimization Based on Predator-Prey Approach , 2012, IEEE Transactions on Magnetics.

[20]  P. Babu Anto,et al.  Speech Recognition of Isolated Malayalam Words Using Wavelet Features and Artificial Neural Network , 2008, 4th IEEE International Symposium on Electronic Design, Test and Applications (delta 2008).

[21]  Murat Hüsnü Sazli,et al.  Speech recognition with artificial neural networks , 2010, Digit. Signal Process..

[22]  Ziwu Ren,et al.  An Improved Teaching-Learning-Based Optimization , 2018, 2018 37th Chinese Control Conference (CCC).