Speech enhancement using degenerate unmixing estimation technique and adaptive noise cancellation technique as a post signal processing

Speech enhancement in degenerate mixtures is a challenging task for the signal processing engineers. Degenerate Unmixing Estimation Technique (DUET) is a commonly used algorithm in speech signal separation, however there are limitations in separating the speech babble as noise and/or competing speech as noise from speech signal. These noises are important for auditory related training and testing. In this research an enhanced approach has been adopted by pairing DUET with Adaptive Noise Cancellation (ANC) which acts as the post-processor. Results from the correlation analysis and spectrogram analysis show that DUET provides separation quality correlation value of minimum 0.57 when speech signals are separated from speech babble and maximum of 0.75 when speech signal is separated from competing speech. On the contrary, both of these minimum and maximum correlation values reach to 0.77 and 0.88 when ANC is used as a postprocessor. The pairing of DUET with ANC has provided a good separation in the tested speech signals.

[1]  Bernard Widrow,et al.  Least-Mean-Square Adaptive Filters: Haykin/Least-Mean-Square Adaptive Filters , 2005 .

[2]  Zied Lachiri,et al.  Speech separation using DUET and binary masking with temporal smoothing in cepstral domain , 2013, 2013 World Congress on Computer and Information Technology (WCCIT).

[3]  Özgür Yilmaz,et al.  On the approximate W-disjoint orthogonality of speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  M. V. Van Hulle,et al.  Clustering approach to square and non-square blind source separation , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[5]  Meir Feder,et al.  Multi-channel signal separation by decorrelation , 1993, IEEE Trans. Speech Audio Process..

[6]  Lucas C. Parra,et al.  On-line Convolutive Blind Source Separation of Non-Stationary Signals , 2000, J. VLSI Signal Process..

[7]  Jean-Francois Cardoso,et al.  Blind signal separation: statistical principles , 1998, Proc. IEEE.

[8]  Hiroshi Sawada,et al.  Underdetermined blind separation for speech in real environments with sparseness and ICA , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Abrar Hussain,et al.  Single channel speech enhancement using ideal binary mask technique based on computational auditory scene analysis , 2016 .

[10]  Özgür Yõlmaz,et al.  Blind Separation of Speech Mixtures via , 2004 .

[11]  Dorothea Kolossa,et al.  Nonlinear Postprocessing for Blind Speech Separation , 2004, ICA.

[12]  Pierre Comon,et al.  Blind channel identification and extraction of more sources than sensors , 1998, Optics & Photonics.

[13]  Lucas C. Parra,et al.  Convolutive blind separation of non-stationary sources , 2000, IEEE Trans. Speech Audio Process..

[14]  Petre Stoica,et al.  Source separation: A TITO system identification approach , 1999, Signal Process..

[15]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.