A weighted MVDR beamformer based on SVM learning for sound source localization

A WMVDR beamformer for sound source localization in a reverberant room is proposed.The weighted coefficients are modeled by a SVM classifier.The skewness measure of marginal distributions is proposed as input feature.Classify the narrowband power maps into constructively and disruptively contributing. A weighted minimum variance distortionless response (WMVDR) algorithm for near-field sound localization in a reverberant environment is presented. The steered response power computation of the WMVDR is based on a machine learning component which improves the incoherent frequency fusion of the narrowband power maps. A support vector machine (SVM) classifier is adopted to select the components of the fusion. The skewness measure of the narrowband power map marginal distribution is showed to be an effective feature for the supervised learning of the power map selection. Experiments with both simulated and real data demonstrate the improvement of the WMVDR beamformer localization accuracy with respect to other state-of-the-art techniques.

[1]  Andrzej Czyzewski,et al.  Automatic identification of sound source position employing neural networks and rough sets , 2003, Pattern Recognit. Lett..

[2]  Sharon Gannot,et al.  Relative transfer function modeling for supervised source localization , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[3]  Jörn Anemüller,et al.  A discriminative learning approach to probabilistic acoustic source localization , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).

[4]  Henry G. Dietz,et al.  Performance of phase transform for detecting sound sources with microphone arrays in reverberant and noisy environments , 2007, Signal Process..

[5]  Ton Kalker,et al.  Voice activity detection and speaker localization using audiovisual cues , 2012, Pattern Recognit. Lett..

[6]  Marc Moonen,et al.  Reduced-Bandwidth and Distributed MWF-Based Noise Reduction Algorithms for Binaural Hearing Aids , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Hong Wang,et al.  Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources , 1985, IEEE Trans. Acoust. Speech Signal Process..

[8]  James A. Bucklew,et al.  Support vector machine techniques for nonlinear equalization , 2000, IEEE Trans. Signal Process..

[9]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[10]  Ju-Hong Lee,et al.  Finite Data Performance Analysis of Mvdr Antenna Array Beamformers with Diagonal Loading , 2013 .

[11]  Hyunsoo Kim,et al.  Sound source localization for robot auditory systems , 2009, IEEE Transactions on Consumer Electronics.

[12]  Cha Zhang,et al.  Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Yun Li,et al.  A Microphone Array System for Automatic Fall Detection , 2012, IEEE Transactions on Biomedical Engineering.

[14]  M.R. Azimi-Sadjadi,et al.  Wideband DOA estimation algorithms for multiple moving sources using unattended acoustic sensors , 2008, IEEE Transactions on Aerospace and Electronic Systems.

[15]  Jacob Benesty,et al.  Performance Study of the MVDR Beamformer as a Function of the Source Incidence Angle , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[16]  Henry Cox,et al.  Robust adaptive beamforming , 2005, IEEE Trans. Acoust. Speech Signal Process..

[17]  Stefano Pantaleoni,et al.  Bone Mineral Density at Diagnosis of Celiac Disease and after 1 Year of Gluten-Free Diet , 2014, TheScientificWorldJournal.

[18]  E. Lehmann,et al.  Prediction of energy decay in room impulse responses simulated with an image-source model. , 2008, The Journal of the Acoustical Society of America.

[19]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[20]  Raffaele Parisi,et al.  WAVES: weighted average of signal subspaces for robust wideband direction finding , 2001, IEEE Trans. Signal Process..

[21]  Carlo Drioli,et al.  Frequency map selection using a RBFN-based classifier in the MVDR beamformer for speaker localization in reverberant rooms , 2015, INTERSPEECH.

[22]  Sergio Canazza,et al.  Adaptive Time Delay Estimation Using Filter Length Constraints for Source Localization in Reverberant Acoustic Environments , 2013, IEEE Signal Processing Letters.

[23]  Curt A. L. Szuberla,et al.  Three-dimensional volcano-acoustic source localization at Karymsky Volcano, Kamchatka, Russia , 2013 .

[24]  Daniele Salvati,et al.  Incident Signal Power Comparison for Localization of Concurrent Multiple Acoustic Sources , 2014, TheScientificWorldJournal.

[25]  Carlo Drioli,et al.  Incoherent Frequency Fusion for Broadband Steered Response Power Algorithms in Noisy Environments , 2014, IEEE Signal Processing Letters.

[26]  Bernhard Schölkopf,et al.  Comparing support vector machines with Gaussian kernels to radial basis function classifiers , 1997, IEEE Trans. Signal Process..

[27]  James H. McClellan,et al.  TOPS: new DOA estimator for wideband signals , 2006, IEEE Transactions on Signal Processing.

[28]  Kazuya Takeda,et al.  Binaural sound localization for untrained directions based on a Gaussian mixture model , 2008, 2008 16th European Signal Processing Conference.

[29]  J. Capon High-resolution frequency-wavenumber spectrum analysis , 1969 .

[30]  David R. Wilson,et al.  Field test of an affordable, portable, wireless microphone array for spatial monitoring of animal ecology and behaviour , 2012 .

[31]  E. Habets,et al.  Generating sensor signals in isotropic noise fields. , 2007, The Journal of the Acoustical Society of America.

[32]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .