Spoken Language Identification Based on Particle Swarm Optimisation–Extreme Learning Machine Approach

The determination and classification of natural language based on specified content and data set involves a process known as spoken language identification (LID). To initiate the process, useful features of the given data need to be extracted first in a mature process where the standard LID features have been previously developed by employing the use of MFCC, SDC, GMM and the i-vector-based framework. Nevertheless, optimisation of the learning process is still required to enable a comprehensive capturing of the extracted features’ embedded knowledge. The training of a single hidden layer neural network can be done using the extreme learning machine (ELM), which is an effective learning model for conducting classification and regression analysis. Nevertheless, the learning process of this model is not entirely effective (i.e. optimised) due to the random selection of weights within the input hidden layer. This study employs ELM as the LID learning model centred upon the extraction of the standard features. The enhanced self-adjusting extreme learning machine (ESA–ELM) is one of the ELM’s optimisation techniques which has been chosen as the benchmark and is enhanced by adopting a new alternative optimisation approach (PSO) instead of (EATLBO) in terms of achieving high performance. The improved ESA–ELM is named particle swarm optimisation–extreme learning machine (PSO–ELM). The generated results are based on LID with the same benchmarked data set derived from eight languages, which indicated the superior performance of the particle swarm optimisation–extreme learning machine LID (PSO–ELM LID) with an accuracy of 98.75% in comparison with the ESA–ELM LID which only achieved 96.25%.

[1]  Milan Tuba,et al.  An Improved Extreme Learning Machine Tuning by Flower Pollination Algorithm , 2019, Nature-Inspired Computation in Data Mining and Machine Learning.

[2]  Narasimhan Sundararajan,et al.  A Fast and Accurate Online Sequential Learning Algorithm for Feedforward Networks , 2006, IEEE Transactions on Neural Networks.

[3]  J. Gonzalez-Dominguez,et al.  Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks , 2016, PloS one.

[4]  Haizhou Li,et al.  Language Identification: A Tutorial , 2011, IEEE Circuits and Systems Magazine.

[5]  Ryan Hafen,et al.  Speech information retrieval: a review , 2012, Multimedia Systems.

[6]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[7]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[8]  Timothy A. Warner,et al.  Kernel-based extreme learning machine for remote-sensing image classification , 2013 .

[9]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[10]  Jacob Goldberger,et al.  A Semisupervised Approach for Language Identification based on Ladder Networks , 2016, Odyssey.

[11]  Lirong Dai,et al.  Deep Bottleneck Features for Spoken Language Identification , 2014, PloS one.

[12]  Bin Ma,et al.  The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS , 2016, INTERSPEECH.

[13]  Sazali Yaacob,et al.  Improved Emotion Recognition Using Gaussian Mixture Model and Extreme Learning Machine in Speech and Glottal Signals , 2015 .

[14]  Fahad Taha Al-Dhief,et al.  Spoken language identification based on the enhanced self-adjusting extreme learning machine approach , 2018, PloS one.

[15]  Po-Hung Chen,et al.  Particle Swarm Optimization for Power Dispatch with Pumped Hydro , 2009 .

[16]  Musatafa Abbas Abbood Albadr,et al.  Extreme learning machine: A review , 2017 .

[17]  Yuan Lan,et al.  An extreme learning machine approach for speaker recognition , 2012, Neural Computing and Applications.

[18]  Stan Szpakowicz,et al.  Beyond Accuracy, F-Score and ROC: A Family of Discriminant Measures for Performance Evaluation , 2006, Australian Conference on Artificial Intelligence.

[19]  Heysem Kaya,et al.  Efficient and effective strategies for cross-corpus acoustic emotion recognition , 2018, Neurocomputing.

[20]  S. K. Gupta,et al.  A Survey On Language Identification System , 2015 .

[21]  Chee Kheong Siew,et al.  Universal Approximation using Incremental Constructive Feedforward Networks with Random Hidden Nodes , 2006, IEEE Transactions on Neural Networks.

[22]  Dong Yu,et al.  Speech emotion recognition using deep neural network and extreme learning machine , 2014, INTERSPEECH.

[23]  Geoffrey Zweig,et al.  LSTM time and frequency recurrence for automatic speech recognition , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).

[24]  Pradipta Kishore Dash,et al.  Comparison of modified teaching–learning-based optimization and extreme learning machine for classification of multiple power signal disturbances , 2015, Neural Computing and Applications.

[25]  Wei Wang,et al.  I-vector features and deep neural network modeling for language recognition , 2018, IIKI.

[26]  Jie Tian,et al.  Mathematical method in optical molecular imaging , 2014, Science China Information Sciences.

[27]  Zhiyong Yang,et al.  A novel algorithm with differential evolution and coral reef optimization for extreme learning machine training , 2016, Cognitive Neurodynamics.

[28]  Yongsheng Ding,et al.  Extreme learning machine based on particle swarm optimization for estimation of reference evapotranspiration , 2017, 2017 36th Chinese Control Conference (CCC).

[29]  Vishal Gupta,et al.  A Survey of Language Identification Techniques and Applications , 2014 .

[30]  Sachin Kumar,et al.  A novel hybrid model based on particle swarm optimisation and extreme learning machine for short-term temperature prediction using ambient sensors , 2019, Sustainable Cities and Society.

[31]  Mohamed Kamal Omar,et al.  Robust language identification using convolutional neural network features , 2014, INTERSPEECH.

[32]  Jia Xu,et al.  Extreme learning machines: new trends and applications , 2014, Science China Information Sciences.

[33]  Fahad Taha Al-Dhief,et al.  Spoken language identification based on optimised genetic algorithm–extreme learning machine approach , 2019, International Journal of Speech Technology.

[34]  Joaquín González-Rodríguez,et al.  On the use of deep feedforward neural networks for automatic language identification , 2016, Comput. Speech Lang..

[35]  Adyan Nur Alfiyatin,et al.  Extreme Learning Machine and Particle Swarm Optimization for Inflation Forecasting , 2019, International Journal of Advanced Computer Science and Applications.

[36]  Jia Liu,et al.  Regularized minimum class variance extreme learning machine for language recognition , 2015, EURASIP J. Audio Speech Music. Process..