Refinement of HMM Model Parameters for Punjabi Automatic Speech Recognition (PASR) System

ABSTRACT An automatic speech recognition system follows an approach of pattern matching, which consists of a training phase and testing phase. Despite advancement in training phase, the performance of the acoustic model is adverse while adopting the statistical technique like hidden Markov model (HMM). However, HMM-based speech system faces high computational complexity and becomes challenging to provide accuracy during isolated Punjabi lexicon. As the corpus of the system increases, the complexity of training phase will also increase drastically. The redundancy and confusion occurred between feature distributions in training phase of the system. This paper proposes an approach for the generation of HMM parameters using two hybrid classifiers such as GA+HMM and DE+HMM. The proposed technique focuses on refinement of processed feature vectors after calculating its mean and variance. The refined parameters are further employed in the generation of HMM parameters that help in reduction of training complexity of the system. The proposed techniques are compared with an existing technique such as HMM on benchmark database and self-developed corpus in clean, noisy, and real-time environments. The results show the performance improvement in pattern matching of spoken utterance when demonstrated on large vocabulary isolated Punjabi lexicons.

[1]  Virender Kadyan,et al.  Punjabi Automatic Speech Recognition Using HTK , 2012 .

[2]  Sam Kwong,et al.  Optimization of HMM by a genetic algorithm , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Włodzimierz Kasprzak,et al.  An Evolutionary Programming Based Algorithm for HMM training , 2008 .

[4]  Rakesh Sharma,et al.  Development of Phonetic Engine for Punjabi Language , 2014 .

[5]  Virender Kadyan,et al.  Punjabi speech to text system for connected words , 2012 .

[6]  Ge Bai,et al.  A Novel Genetic Algorithm Based on Tabu Search for HMM Optimization , 2008, 2008 Fourth International Conference on Natural Computation.

[7]  Pavel Paramonov,et al.  Simplified scoring methods for HMM-based speech recognition , 2014, 2014 International Conference on Soft Computing and Machine Intelligence.

[8]  Surinder Dhanjal,et al.  Development of a standard text and speech corpus for the Punjabi language , 2013, 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE).

[9]  Martin Heckmann,et al.  Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation , 2012, Speech Commun..

[10]  Shing-Tai Pan,et al.  Performances comparison between Improved DHMM and Gaussian Mixture HMM for speech recognition , 2011, 2011 4th International Congress on Image and Signal Processing.

[11]  AggarwalRajesh Kumar,et al.  Acoustic modeling problem for automatic speech recognition system , 2011 .

[12]  Mohammad-R Akbarzadeh-T,et al.  Hybrid of Chaos Optimization and Baum-Welch algorithms for HMM training in Continuous speech recognition , 2010, 2010 International Conference on Intelligent Control and Information Processing.

[13]  Tieli Sun,et al.  Comparison of Particle Swarm Optimization and Genetic Algorithm for HMM training , 2008, 2008 19th International Conference on Pattern Recognition.

[14]  Xueying Zhang,et al.  A Hybrid Speech Recognition Training Method for HMM Based on Genetic Algorithm and Baum Welch Algorithm , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[15]  Kim-Fung Man,et al.  Optimisation of HMM topology and its model parameters by genetic algorithms , 2001, Pattern Recognit..

[16]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[17]  Hossein Sameti,et al.  A novel approach to HMM-based speech recognition system using particle swarm optimization , 2009, 2009 Fourth International on Conference on Bio-Inspired Computing.

[18]  Analysis of Punjabi tonemes , 2015, 2015 2nd International Conference on Computing for Sustainable Global Development (INDIACom).

[19]  Mitsuo Gen,et al.  Genetic algorithms and engineering optimization , 1999 .

[20]  JinFu Feng,et al.  Covariance and crossover matrix guided differential evolution for global numerical optimization , 2016, SpringerPlus.

[21]  Carlos Martín-Vide,et al.  Special Issue on Second International Conference on the Theory and Practice of Natural Computing, TPNC 2013 , 2016, Soft Comput..

[22]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[23]  Santiago Omar Caballero Morales,et al.  GA Approaches to HMM Optimization for Automatic Speech Recognition , 2012, MCPR.

[24]  Gurpreet Singh Josan,et al.  A Punjabi To Hindi Machine Translation System , 2008, COLING.

[25]  Chandan Singh,et al.  A Gurmukhi script recognition system , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[26]  D. Karaboga,et al.  A Simple and Global Optimization Algorithm for Engineering Problems: Differential Evolution Algorithm , 2004 .

[27]  Swaran Lata,et al.  Laryngeal tonal characteristics of Punjabi — An experimental study , 2013, 2013 International Conference on Human Computer Interactions (ICHCI).

[28]  Mayank Dave,et al.  Filterbank optimization for robust ASR using GA and PSO , 2012, Int. J. Speech Technol..

[29]  N. Thatphithakkul,et al.  HMM parameter optimization using tabu search [speech recognition] , 2004, IEEE International Symposium on Communications and Information Technology, 2004. ISCIT 2004..

[30]  Sruti Sruba Bharali,et al.  A comparative study of different features for isolated spoken word recognition using HMM with reference to Assamese language , 2015, Int. J. Speech Technol..

[31]  Louis Boves,et al.  Unsupervised detection of words questioning the relevance of segmentation , 2008 .

[32]  Kumar Ravinder,et al.  Comparison of HMM and DTW for Isolated Word Recognition System of Punjabi Language , 2010, CIARP.

[33]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[34]  Mayank Dave,et al.  Acoustic modeling problem for automatic speech recognition system: advances and refinements (Part II) , 2011, Int. J. Speech Technol..