Time-frequency masking for convolutive and noisy mixtures

Time-frequency masking is a suitable tool for speech enhancement and source separation of speech signals. This paper presents an efficient method to determine the time-frequency masks for two speech signals based on two microphone signals in a noisy and reverberant environment. A two-stage processing is proposed. In the first stage two linear filters are adapted using the LMS algorithm. In the second stage the time-frequency masks are determined based on the transfer functions of the adaptive filters. The presented simulation results verify that the time-frequency masks are well approximated.