ICA based single microphone Blind Speech Separation technique using non-linear estimation of speech

In this paper, a Blind Speech Separation (BSS) technique is introduced based on Independent Component Analysis (ICA) for underdetermined single microphone case. In general, ICA uses noisy speech from at least two microphones to separate speech and noise. But ICA fails to separate when only one stream of noisy speech is available. We use Log Spectral Magnitude Estimator based on Minimum Mean Square Error (LogMMSE) as a non-linear estimation technique to estimate the speech spectrum, which is used as the other input to ICA, with the noisy speech. The proposed method was tested for machinery, babble and traffic noise types mixed with speech at Signal to Noise Ratios (SNRs) of −5 dB, 0 dB and 5 dB. Objective and subjective results show high quality and intelligibility in the separated speech using the developed method.

[1]  A. Kraskov,et al.  Estimating mutual information. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  R. Balan,et al.  Independent component analysis based single channel speech enhancement , 2003, Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology (IEEE Cat. No.03EX795).

[3]  Xiaofeng Gong,et al.  An improved BLUES with adaptive threshold of condition number for separating underdetermined speech mixtures , 2012, 2012 Third International Conference on Intelligent Control and Information Processing.

[4]  Christian Jutten,et al.  Real-time independent vector analysis with Student's t source prior for convolutive speech mixtures , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Terrence J. Sejnowski,et al.  Learning Nonlinear Overcomplete Representations for Efficient Coding , 1997, NIPS.

[6]  Yi Hu,et al.  A Comparative Intelligibility Study of Speech Enhancement Algorithms , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7]  Tuomas Virtanen,et al.  Sound Source Separation Using Sparse Coding with Temporal Continuity Objective , 2003, ICMC.

[8]  Yi Hu,et al.  Subjective Comparison of Speech Enhancement Algorithms , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[9]  Issa M. S. Panahi,et al.  Two microphones spectral-coherence based speech enhancement for hearing aids using smartphone as an assistive device , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[10]  Pierre Comon,et al.  How fast is FastICA? , 2006, 2006 14th European Signal Processing Conference.

[11]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[12]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[13]  Hiroshi Sawada,et al.  Underdetermined blind speech separation with directivity pattern based continuous mask and ICA , 2004, 2004 12th European Signal Processing Conference.

[14]  T. Ens,et al.  Blind signal separation : statistical principles , 1998 .