论文信息 - Target speech enhancement based on degenerate unmixing and estimation technique for real-world applications (Speech and audio processing and translation)

Target speech enhancement based on degenerate unmixing and estimation technique for real-world applications (Speech and audio processing and translation)

An algorithm for target speech enhancement based on degenerate unmixing and estimation technique (DUET) is described. Although the DUET can accomplish source separation only from two mixtures, the requirements of knowing the number of sources in advance and of estimating the attenuation and delay parameters for all sources prevent it from being used in real-world applications. Circumventing these requirements, the described algorithm is useful for speech enhancement where only one target speech should be extracted. Experimental results show that the algorithm provides much faster convergence of all the required parameters and noise suppression performances that are better than or comparable to the DUET with negligible distortion of the recovered speech.

Hyung-Min Park | Jin-Bum Kim

[1] H. Lane,et al. The Lombard Sign and the Role of Hearing in Speech , 1971 .

[2] Scott Rickard,et al. Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[3] Philipos C. Loizou,et al. Speech Enhancement: Theory and Practice , 2007 .