Beamforming for moving source speech enhancement

This paper presents a new constrained subband beamforming algorithm to enhance speech signals generated by a moving source in a noisy environment. The beamformer is based on the principle of a soft constraint defined for a specified region corresponding to a known source location. The soft constraint secures the spatial-temporal passage of the desired source signal in the adaptive update of the beamforming weights and guarantees the full rank property of the matrix inverted in the update. The source of interest is modelled as a cluster of stationary point sources and source motion is accommodated by revising the point source cluster. The source modelling and its direct exploitation in the beamformer through covariance estimates are presented. An algorithm for sound source localization is used for speaker movement tracking and this information is exploited to update the spatial distribution in the source model. Evaluation in a real environment with a moving speaker shows a significant noise and hands-free interference suppression within the conventional telephone bandwidth. This is achieved with a negligible impact on speech distortion.

[1]  Zhi-Quan Luo,et al.  Robust adaptive beamforming for general-rank signal models , 2003, IEEE Trans. Signal Process..

[2]  Sven Nordholm,et al.  Design of oversampled uniform DFT filter banks with delay specification using quadratic optimization , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Satoshi Nakamura,et al.  Speech enhancement based on the subspace method , 2000, IEEE Trans. Speech Audio Process..

[4]  Karl-Dirk Kammeyer,et al.  Theoretical noise reduction limits of the generalized sidelobe canceller (GSC) for speech enhancement , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[5]  Ingvar Claesson,et al.  A calibrated subband beamforming algorithm for speech enhancement , 2002, Sensor Array and Multichannel Signal Processing Workshop Proceedings, 2002.

[6]  Sven Nordholm,et al.  Speaker localisation using the far-field SRP-PHAT in conference telephony , 2002 .

[7]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.