论文信息 - An effective doa estimation by exploring the spatial sparse representation of the inter-sensor data ratio model

An effective doa estimation by exploring the spatial sparse representation of the inter-sensor data ratio model

This paper investigates speaker direction of arrival (DOA) estimation using a single acoustic vector sensor (AVS). With the definition of the inter-sensor data ratio (ISDR) in the time-frequency (TF) domain and the use of the high local signal-to-noise ratio (HLSNR) TF points, an effective ISDR data model is derived, which determines the relationship between the ISDR and the AVS manifold vector. With the spatial sparse representation of the ISDR data, the DOA estimation is formulated by recovering the sparse matrix and locating the peak of the power spectrum of the reconstructed sparse matrix. Preliminary experimental results using simulations and real AVS recordings show that the proposed DOA estimation method is able to achieve high elevation and azimuth estimation accuracy for all angles when the SNR is above 10dB, avoiding the spatial aliasing problem and suppressing the adverse impact of the room reverberation. It is expected that the proposed DOA estimation method may find wide applications in portable devices due to its small compact physical size and superior performance.

Jiangtao Xi | Christian Ritz | Yifan Guo | Weiqiao Zheng | Yue Xian Zou

[1] Mostafa Kaveh,et al. Directions-of-arrival estimation using a sparse spatial spectrum model with uncertainty , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2] Dmitry M. Malioutov,et al. A sparse signal reconstruction perspective for source localization with sensor arrays , 2005, IEEE Transactions on Signal Processing.

[3] Jiangtao Xi,et al. Multisource DOA estimation based on time-frequency sparsity and joint inter-sensor data ratio with single acoustic vector sensor , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4] Boaz Rafaely,et al. Microphone Array Signal Processing , 2008 .

[5] Thomas F. Quatieri,et al. Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[6] Jont B. Allen,et al. Image method for efficiently simulating small‐room acoustics , 1976 .

[7] Muawiyath Shujau,et al. Designing Acoustic Vector Sensors for localisation of sound sources in air , 2009, 2009 17th European Signal Processing Conference.

[8] Arye Nehorai,et al. Acoustic vector-sensor beamforming and Capon direction estimation , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[9] Bhaskar D. Rao,et al. A Two Microphone-Based Approach for Source Localization of Multiple Speech Sources , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[10] Michael D. Zoltowski,et al. Closed-form underwater acoustic direction-finding with arbitrarily spaced vector-hydrophones at unknown locations , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[11] Douglas L. Jones,et al. Beamformer performance with acoustic vector sensors in air. , 2006, The Journal of the Acoustical Society of America.

[12] Bo Li,et al. Improved DOA estimation with acoustic vector sensor arrays using spatial sparsity and subarray manifold , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).