The effect of speech denoising algorithms on sound source localization for humanoid robots

In this paper we study the suitability and performance of four major speech denoising algorithms as weighting functions for the Generalized Cross-Correlation (GCC) and discuss the role of the microphones' frequency response on the localization estimates. An evaluation is made with the humanoid robot Nao. Moving away from simulated environments, we look at the performance of sound source localization under different in-system and ambient noise conditions. We also present a time delay of arrival (TDOA) / GCC based sound source localization method for real-world humanoid robots which was used for our experiments. This method addresses the robot's shape influence by utilizing a pre-measured set of TDOAs.

[1]  Jacob Benesty,et al.  Time Delay Estimation in Room Acoustic Environments: An Overview , 2006, EURASIP J. Adv. Signal Process..

[2]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[3]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[4]  Werner Verhelst,et al.  A Multi-sensor Speech Database with Applications towards Robust Speech Processing in hostile Environments , 2008, LREC.

[5]  Norbert Wiener,et al.  Extrapolation, Interpolation, and Smoothing of Stationary Time Series, with Engineering Applications , 1949 .

[6]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[7]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[8]  Philipos C. Loizou,et al.  A noise-estimation algorithm for highly non-stationary environments , 2006, Speech Commun..

[9]  Daniele Salvati,et al.  IMPROVEMENT OF ACOUSTIC LOCALIZATION USING A SHORT TIME SPECTRAL ATTENUATION WITH A NOVEL SUPPRESSION RULE TEMPLATES FOR DAFX04, NAPLES, ITALY , 2009 .

[10]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[11]  Jean Rouat,et al.  Robust sound source localization using a microphone array on a mobile robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[12]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[13]  Swen Müller,et al.  Transfer-Function Measurement with Sweeps , 2001 .

[14]  Peter Vary,et al.  Noise reduction by joint maximum a posteriori spectral amplitude and phase estimation with super-Gaussian speech modelling , 2004, 2004 12th European Signal Processing Conference.