A soft computing approach to improve the robustness of on-line ASR in previously unseen highly non-stationary acoustic environments

This paper presents a soft noise compensation algorithm in the feature space to improve the noise robustness of HMM-based on-line automatic speech recognition (ASR) in unknown highly non-stationary acoustic environments. Current hard computing techniques fail to track and compensate the non-stationary noises properly in previously unseen acoustic environments. The proposed soft noise compensation algorithm is based on a joint additive background noises and channel distortions compensation (JAC) technique in feature space. In this novel soft JAC (SJAC), we use an evolutionary dynamic multi-swarm particle swarm optimization (DMS-PSO)-based soft computing (SC) technique in the front-end, and a frame synchronous bias compensation technique in the back-end of the ASR, respectively, for frame adaptive modeling and compensation of the background additive noises and channel distortions in feature space that are highly non-linear and non-Gaussian. From the experimental results, we find that the proposed evolutionary DMS-PSO-based SJAC technique achieves significant improvement in recognition performance of on-line ASR compared to our previously developed baseline Bayesian on-line spectral change point detection (BOSCPD)-based SJAC technique when evaluated over the Aurora 2 speech database.

[1]  Douglas D. O'Shaughnessy,et al.  A Rapid Adaptation Algorithm for Tracking Highly Non-Stationary Noises based on Bayesian Inference for On-Line Spectral Change Point Detection , 2011, INTERSPEECH.

[2]  Lotfi A. Zadeh,et al.  Fuzzy logic, neural networks, and soft computing , 1993, CACM.

[3]  Gianluigi Mongillo,et al.  Online Learning with Hidden Markov Models , 2008, Neural Computation.

[4]  John B. Moore,et al.  On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure , 1993, IEEE Trans. Signal Process..

[5]  山川 烈,et al.  Soft Computing , 2000, Soft Comput..

[6]  Alejandro Acero,et al.  Acoustical and environmental robustness in automatic speech recognition , 1991 .

[7]  YangQuan Chen,et al.  Fusion of soft computing and hard computing: computational structures and characteristic features , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[8]  David Pearce,et al.  The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[9]  Douglas D. O'Shaughnessy,et al.  Bayesian on-line spectral change point detection: a soft computing approach for on-line ASR , 2011, International Journal of Speech Technology.

[10]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[11]  Douglas D. O'Shaughnessy,et al.  A study on bias-based speech signal conditioning techniques for improving the robustness of automatic speech recognition , 2009, 2009 Canadian Conference on Electrical and Computer Engineering.

[12]  Jing J. Liang,et al.  Dynamic multi-swarm particle swarm optimizer with local search , 2005, 2005 IEEE Congress on Evolutionary Computation.

[13]  Madan M. Gupta,et al.  SOFT COMPUTING AND INTELLIGENT SYSTEMS:THEORY AND APPLICATIONS , 2008 .

[14]  Yifan Gong,et al.  A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions , 2009, Computer Speech and Language.

[15]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[16]  Konstantinos E. Parsopoulos,et al.  PARTICLE SWARM OPTIMIZER IN NOISY AND CONTINUOUSLY CHANGING ENVIRONMENTS , 2001 .

[17]  Irina Illina,et al.  On-line Stochastic Matching compensation for non-stationary noise , 2008, Comput. Speech Lang..

[18]  Lotfi A. Zadeh The roles of soft computing and fuzzy logic in the conception, design and deployment of intelligent systems , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[19]  Jonathan Lawry,et al.  Online Learning for Fuzzy Bayesian Prediction , 2006, SMPS.

[20]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[21]  Ryan Turner Bayesian Change Point Detection for Satellite Fault Prediction , 2010 .

[22]  Ahmet Yardimci,et al.  Soft computing in medicine , 2009, Appl. Soft Comput..

[23]  Sid-Ahmed Selouani Speech Processing and Soft Computing , 2011, Springer Briefs in Electrical and Computer Engineering.

[24]  M. Geravanchizadeh,et al.  Asexual Reproduction-based Adaptive Quantum Particle Swarm Optimization algorithm for dual-channel speech enhancement , 2010, 2010 4th International Symposium on Communications, Control and Signal Processing (ISCCSP).

[25]  Laleh Badri Asl,et al.  Speech enhancement using sexual reproduction-based PSO , 2010, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010).