We propose a new method for estimating directions of arrival (DOAs) of sound sources, both in azimuthal and elevation angle, using two directional microphones. This method adopts weighted Wiener gain (WWG) for DOA estimation. WWG is an estimate of the Wiener gain that we proposed for use in automatic gain control to enhance speech that is degraded by additive noise. Angular resolution of WWG arises from spectral subtraction (SS)-based noise reduction involved in the WWG calculation, which enhances the signal from the look direction while suppressing signals from other directions. Because WWG involves two-channel SS, which can deal with instantaneous noise, noise sources need not to be stationary, as they must be with ordinary single-channel SS. We further propose the exploitation of a pair of directional microphones whose front directions are arranged in rotational symmetry. The time difference and amplitude difference between the two-channel signal provided by the microphones are utilized to yield a two-dimensional resolution of DOA. We evaluated the proposed method through computer simulations and compared it to three DOA estimation methods that are based on a cross-correlation function and two popular high-resolution methods of multiple signal classification and minimum variance method. Evaluation results of the source detection rate and estimation accuracy demonstrate the remarkable superiority of our method compared to the other methods in conditions where multiple speech sources exist
[1]
R. O. Schmidt,et al.
Multiple emitter location and signal Parameter estimation
,
1986
.
[2]
Kiyohiro Shikano,et al.
Direction of Arrival Estimation Using Nonlinear Microphone Array
,
2001
.
[3]
Futoshi Asano,et al.
Speech Enhancement Based on Short-Time Spectral Amplitude Estimation with Two-Channel Beamformer
,
1996
.
[4]
Jae‐Woong Choi,et al.
ESTIMATION OF LOCATIONS AND STRENGTHS OF BROADBAND PLANAR AND SPHERICAL NOISE SOURCES USING COHERENT SIGNAL SUBSPACE
,
1995
.
[5]
J. Capon.
High-resolution frequency-wavenumber spectrum analysis
,
1969
.
[6]
David G. Long,et al.
Array signal processing
,
1985,
IEEE Trans. Acoust. Speech Signal Process..
[7]
G. Carter,et al.
The generalized correlation method for estimation of time delay
,
1976
.
[8]
L. J. Griffiths,et al.
An alternative approach to linearly constrained adaptive beamforming
,
1982
.
[9]
Tetsunori Kobayashi,et al.
ASJ continuous speech corpus for research
,
1992
.
[10]
S. Boll,et al.
Suppression of acoustic noise in speech using spectral subtraction
,
1979
.
[11]
Masato Abe,et al.
Speech enhancement based on auto gain control
,
2006,
IEEE Transactions on Audio, Speech, and Language Processing.