Pseudoanechoic blind source separation with improved Wiener postfilter

The pseudoanechoic model was proposed recently to simplify the parameter estimation in blind source separation based on frequency-domain independent component analysis. In the method, after separation based in the pseudoanechoic model a time-frequency Wiener postfilter to improve the separation is applied. In this contribution, a deeper analysis of the working principles of the Wiener post- filter is presented. Furthermore, a variation of this postfilter to improve the performance using the information of previous frames is in- troduced. The improvements obtained through the new method are evaluated in an automatic speech recognition task and with the PESQ ob- jective quality measure. The results show an increased robustness and stability of the pro- posed method.

[1]  Shoko Araki,et al.  The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech , 2003, IEEE Trans. Speech Audio Process..

[2]  Yifan Gong,et al.  Speech recognition in noisy environments: A survey , 1995, Speech Commun..

[3]  P LippmannRichard Speech recognition by machines and humans , 1997 .

[4]  Richard Lippmann,et al.  Speech recognition by machines and humans , 1997, Speech Commun..

[5]  Hiroshi Sawada,et al.  A robust and precise method for solving the permutation problem of frequency-domain blind source separation , 2004, IEEE Transactions on Speech and Audio Processing.

[6]  Maurizio Omologo,et al.  Environmental conditions and acoustic transduction in hands-free speech recognition , 1998, Speech Commun..

[7]  José B. Mariño,et al.  Albayzin speech database: design of the phonetic corpus , 1993, EUROSPEECH.

[8]  Diego H. Milone,et al.  Objective quality evaluation in blind source separation for speech recognition in a real room , 2007, Signal Process..

[9]  Diego H. Milone,et al.  Perceptual evaluation of blind source separation for robust speech recognition , 2008, Signal Process..

[10]  Andrzej Cichocki,et al.  Adaptive Blind Signal and Image Processing - Learning Algorithms and Applications , 2002 .

[11]  Jacob Benesty,et al.  Speech Enhancement (Signals and Communication Technology) , 2005 .

[12]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[13]  Diego H. Milone,et al.  Indeterminacy Free Frequency-Domain Blind Separation of Reverberant Audio Sources , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Lucas C. Parra,et al.  Convolutive blind separation of non-stationary sources , 2000, IEEE Trans. Speech Audio Process..

[15]  Jingdong Chen,et al.  Acoustic MIMO Signal Processing , 2006 .

[16]  Diego H. Milone,et al.  Introducing complexity measures in nonlinear physiological signals: application to robust speech recognition , 2004 .