DOUBLETALK DETECTION USING REAL TIME RECURRENT LEARNING

In this paper we present a new system for doubletalk detection that uses multiple signal detectors/discriminators based on recurrent networks. The goal is to build a simple system that learns to combine information from different signal sources to make robust decisions even under changing noise conditions. In this paper we use three detectors - two of these are frequency domain signal detectors, one at the far-end and one at the microphone channel. The third detector determines the relative level of nearend speech vs. far-end echo in the microphone signal. The new doubletalk detector combines information from all these detectors to make its decision. An important part of this proposed design is that the features used by these detectors can be easily tracked online in the presence of noise. We compare our results with cross-correlation based doubletalk detectors to show its effectiveness.

[1]  Hua Ye,et al.  A new double-talk detection algorithm based on the orthogonality theorem , 1991, IEEE Trans. Commun..

[2]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[3]  Ronald J. Williams,et al.  Experimental Analysis of the Real-time Recurrent Learning Algorithm , 1989 .

[4]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[5]  Jacob Benesty,et al.  An objective technique for evaluating doubletalk detectors in acoustic echo cancelers , 1999, IEEE Trans. Speech Audio Process..

[6]  Israel Cohen,et al.  Speech enhancement for non-stationary noise environments , 2001, Signal Process..

[7]  Somsak Sukittanon,et al.  Logistic discriminative speech detectors using posterior SNR , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[9]  Jacob Benesty,et al.  A new class of doubletalk detectors based on cross-correlation , 2000, IEEE Trans. Speech Audio Process..

[10]  Ross Cutler The distributed meetings system , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..