TRAMP: Tracking by a Real-time AMbisonic-based Particle filter

This article presents a multiple sound source localization and tracking system, fed by the Eigenmike array. The First Order Ambisonics (FOA) format is used to build a pseudointensity-based spherical histogram, from which the source position estimates are deduced. These instantaneous estimates are processed by a wellknown tracking system relying on a set of particle filters. While the novelty within localization and tracking is incremental, the fully-functional, complete and real-time running system based on these algorithms is proposed for the first time. As such, it could serve as an additional baseline method of the LOCATA challenge.

[1]  Yu Hen Hu,et al.  Maximum likelihood multiple-source localization using acoustic energy measurements with wireless sensor networks , 2005, IEEE Transactions on Signal Processing.

[2]  Rozenn Nicol,et al.  SOUND SPATIALIZATION BY HIGHER ORDER AMBISONICS : ENCODING AND DECODING A SOUND SCENE IN PRACTICE FROM A THEORETICAL POINT OF VIEW , 2010 .

[3]  Archontis Politis,et al.  Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network , 2017, 2018 26th European Signal Processing Conference (EUSIPCO).

[4]  Jerome Daniel,et al.  Further Investigations of High-Order Ambisonics and Wavefield Synthesis for Holophonic Sound Imaging , 2003 .

[5]  Sakari Tervo Direction estimation based on sound intensity vectors , 2009, 2009 17th European Signal Processing Conference.

[6]  Athanasios Mouchtaris,et al.  Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Darren B. Ward,et al.  Particle filtering algorithms for tracking an acoustic source in a reverberant environment , 2003, IEEE Trans. Speech Audio Process..

[8]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[9]  Emmanuel Vincent,et al.  Audio Source Separation and Speech Enhancement , 2018 .

[10]  Emanuel A. P. Habets,et al.  3D source localization in the spherical harmonic domain using a pseudointensity vector , 2010, 2010 18th European Signal Processing Conference.

[11]  Patrick A. Naylor,et al.  Source tracking using moving microphone arrays for robot audition , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Sina Hafezi,et al.  Augmented Intensity Vectors for Direction of Arrival Estimation in the Spherical Harmonic Domain , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[13]  François Michaud,et al.  The ManyEars open framework , 2013, Autonomous Robots.

[14]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[15]  Simon J. Godsill,et al.  Acoustic Source Localization and Tracking of a Time-Varying Number of Speakers , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Michael A. Gerzon,et al.  Ambisonic Decoders for HDTV , 1992 .

[17]  Nancy Bertin,et al.  A review of cosparse signal recovery methods applied to sound source localization , 2013 .

[18]  Ulpu Remes,et al.  Techniques for Noise Robustness in Automatic Speech Recognition , 2012 .

[19]  Archontis Politis,et al.  Parametric Time-Frequency Domain Spatial Audio , 2017 .

[20]  Branko Ristic,et al.  A Metric for Performance Evaluation of Multi-Target Tracking Algorithms , 2011, IEEE Transactions on Signal Processing.

[21]  Ville Pulkki,et al.  Spatial Sound Reproduction with Directional Audio Coding , 2007 .

[22]  Patrick A. Naylor,et al.  The LOCATA Challenge Data Corpus for Acoustic Source Localization and Tracking , 2018, 2018 IEEE 10th Sensor Array and Multichannel Signal Processing Workshop (SAM).

[23]  Jean Rouat,et al.  Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering , 2007, Robotics Auton. Syst..

[24]  V. Lebedev,et al.  A QUADRATURE FORMULA FOR THE SPHERE OF THE 131ST ALGEBRAIC ORDER OF ACCURACY , 1999 .

[25]  Finn Jacobsen,et al.  A note on instantaneous and time-averaged active and reactive sound intensity , 1991 .

[26]  A. Doucet,et al.  A Tutorial on Particle Filtering and Smoothing: Fifteen years later , 2008 .

[27]  Emmanuel Vincent,et al.  CRNN-based Joint Azimuth and Elevation Localization with the Ambisonics Intensity Vector , 2018, 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC).

[28]  Juha Merimaa,et al.  Analysis, synthesis, and perception of spatial sound : binaural localization modeling and multichannel loudspeaker reproduction , 2006 .

[29]  Richard C. Hendriks,et al.  Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay , 2012, IEEE Transactions on Audio, Speech, and Language Processing.