Joint Measurement of Localization and Detection of Sound Events

Sound event detection and sound localization or tracking have historically been two separate areas of research. Recent development of sound event detection methods approach also the localization side, but lack a consistent way of measuring the joint performance of the system; instead, they measure the separate abilities for detection and for localization. This paper proposes augmentation of the localization metrics with a condition related to the detection, and conversely, use of location information in calculating the true positives for detection. An extensive evaluation example is provided to illustrate the behavior of such joint metrics. The comparison to the detection only and localization only performance shows that the proposed joint metrics operate in a consistent and logical manner, and characterize adequately both aspects.

[1]  Climent Nadeu,et al.  Sound-model-based acoustic source localization using distributed microphone arrays , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Raffaele Parisi,et al.  Multi-Source Localization Strategies , 2001, Microphone Arrays.

[3]  José Bento,et al.  A metric for sets of trajectories that is practical and mathematically consistent , 2016, ArXiv.

[4]  Annamaria Mesaros,et al.  Metrics for Polyphonic Sound Event Detection , 2016 .

[5]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[6]  Archontis Politis,et al.  Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network , 2019, DCASE.

[7]  Daniel P. W. Ellis,et al.  A Discriminative Model for Polyphonic Piano Transcription , 2007, EURASIP J. Adv. Signal Process..

[8]  Archontis Politis,et al.  Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks , 2018, IEEE Journal of Selected Topics in Signal Processing.

[9]  Rainer Stiefelhagen,et al.  Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics , 2008, EURASIP J. Image Video Process..

[10]  Gerhard P. Hancke,et al.  Sound based localization and identification in industrial environments , 2017, IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society.

[11]  Sacha Krstulović,et al.  Audio Event Recognition in the Smart Home , 2018 .

[12]  Emmanuel Vincent,et al.  Sound Event Detection in the DCASE 2017 Challenge , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[13]  Taras Butko,et al.  Two-source acoustic event detection and localization: Online implementation in a Smart-room , 2011, 2011 19th European Signal Processing Conference.

[14]  Toni Hirvonen,et al.  Classification of Spatial Audio Location and Content Using Convolutional Neural Networks , 2015 .

[15]  Archontis Politis,et al.  A multi-room reverberant dataset for sound event localization and detection , 2019, DCASE.

[16]  Ba-Ngu Vo,et al.  A Consistent Metric for Performance Evaluation of Multi-Object Filters , 2008, IEEE Transactions on Signal Processing.

[17]  Andrzej Czyzewski,et al.  Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations , 2015, Multimedia Tools and Applications.