论文信息 - A soft computing approach to improve the robustness of on-line ASR in previously unseen highly non-stationary acoustic environments

A soft computing approach to improve the robustness of on-line ASR in previously unseen highly non-stationary acoustic environments

This paper presents a soft noise compensation algorithm in the feature space to improve the noise robustness of HMM-based on-line automatic speech recognition (ASR) in unknown highly non-stationary acoustic environments. Current hard computing techniques fail to track and compensate the non-stationary noises properly in previously unseen acoustic environments. The proposed soft noise compensation algorithm is based on a joint additive background noises and channel distortions compensation (JAC) technique in feature space. In this novel soft JAC (SJAC), we use an evolutionary dynamic multi-swarm particle swarm optimization (DMS-PSO)-based soft computing (SC) technique in the front-end, and a frame synchronous bias compensation technique in the back-end of the ASR, respectively, for frame adaptive modeling and compensation of the background additive noises and channel distortions in feature space that are highly non-linear and non-Gaussian. From the experimental results, we find that the proposed evolutionary DMS-PSO-based SJAC technique achieves significant improvement in recognition performance of on-line ASR compared to our previously developed baseline Bayesian on-line spectral change point detection (BOSCPD)-based SJAC technique when evaluated over the Aurora 2 speech database.

Douglas D. O'Shaughnessy | Sid-Ahmed Selouani | Md Foezur Rahman Chowdhury

[1] Douglas D. O'Shaughnessy,et al. A Rapid Adaptation Algorithm for Tracking Highly Non-Stationary Noises based on Bayesian Inference for On-Line Spectral Change Point Detection , 2011, INTERSPEECH.

[2] Lotfi A. Zadeh,et al. Fuzzy logic, neural networks, and soft computing , 1993, CACM.

[3] Gianluigi Mongillo,et al. Online Learning with Hidden Markov Models , 2008, Neural Computation.

[4] John B. Moore,et al. On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure , 1993, IEEE Trans. Signal Process..

[5] 山川烈,et al. Soft Computing , 2000, Soft Comput..

[6] Alejandro Acero,et al. Acoustical and environmental robustness in automatic speech recognition , 1991 .

[7] YangQuan Chen,et al. Fusion of soft computing and hard computing: computational structures and characteristic features , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[8] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[9] Douglas D. O'Shaughnessy,et al. Bayesian on-line spectral change point detection: a soft computing approach for on-line ASR , 2011, International Journal of Speech Technology.

[10] I. Cohen,et al. Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[11] Douglas D. O'Shaughnessy,et al. A study on bias-based speech signal conditioning techniques for improving the robustness of automatic speech recognition , 2009, 2009 Canadian Conference on Electrical and Computer Engineering.

[12] Jing J. Liang,et al. Dynamic multi-swarm particle swarm optimizer with local search , 2005, 2005 IEEE Congress on Evolutionary Computation.

[13] Madan M. Gupta,et al. SOFT COMPUTING AND INTELLIGENT SYSTEMS:THEORY AND APPLICATIONS , 2008 .

[14] Yifan Gong,et al. A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions , 2009, Computer Speech and Language.

[15] Russell C. Eberhart,et al. A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[16] Konstantinos E. Parsopoulos,et al. PARTICLE SWARM OPTIMIZER IN NOISY AND CONTINUOUSLY CHANGING ENVIRONMENTS , 2001 .

[17] Irina Illina,et al. On-line Stochastic Matching compensation for non-stationary noise , 2008, Comput. Speech Lang..

[18] Lotfi A. Zadeh. The roles of soft computing and fuzzy logic in the conception, design and deployment of intelligent systems , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[19] Jonathan Lawry,et al. Online Learning for Fuzzy Bayesian Prediction , 2006, SMPS.

[20] Alex Acero,et al. Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[21] Ryan Turner. Bayesian Change Point Detection for Satellite Fault Prediction , 2010 .

[22] Ahmet Yardimci,et al. Soft computing in medicine , 2009, Appl. Soft Comput..

[23] Sid-Ahmed Selouani. Speech Processing and Soft Computing , 2011, Springer Briefs in Electrical and Computer Engineering.

[24] M. Geravanchizadeh,et al. Asexual Reproduction-based Adaptive Quantum Particle Swarm Optimization algorithm for dual-channel speech enhancement , 2010, 2010 4th International Symposium on Communications, Control and Signal Processing (ISCCSP).

[25] Laleh Badri Asl,et al. Speech enhancement using sexual reproduction-based PSO , 2010, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010).