A subband space constrained beamformer incorporating voice activity detection [speech enhancement applications]

This paper introduces a new subband adaptive space constrained beamforming structure for use in hands-free speech enhancement applications. The scheme incorporates a space constrained source model and voice activity information through the integration of a voice activity detector (VAD). The VAD information is used to estimate noise covariance information during non-speech periods and to optimally estimate the source power spectral density (PSD), which is used to provide a spectrally optimized constraint on the source. The proposed structure is evaluated in a real car environment, yielding results which compare well to the optimal Wiener solution where full knowledge of the source is known.

[1]  Sven Nordholm,et al.  A low complexity statistical voice activity detector with performance comparisons to ITU-T/ETSI voice activity detectors , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[2]  Sven Nordholm,et al.  Subband generalized sidelobe canceller - a constrained region approach , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[3]  Walter Kellermann A self-steering digital microphone array , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4]  Akihiko Sugiyama,et al.  A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters , 1999, IEEE Trans. Signal Process..

[5]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[6]  Sven Nordholm,et al.  Soft constrained subband beamforming for hands-free speech enhancement , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.