Learning New Acoustic Events in an HMM-Based System Using MAP Adaptation

In this paper, we present a system for the recognition of acoustic events suited for a robotic application. HMMs are used to model different acoustic event classes. We are especially looking at the open-set case, where a class of acoustic events occurs that was not included in the training phase. It is evaluated how newly occuring classes can be learnt using MAP adaptation or conventional training methods. A small database of acoustic events was recorded with a robotic platform to perform the experiments.

[1]  Andrey Temko,et al.  Acoustic Event Detection and Classification , 2007, Computers in the Human Interaction Loop.

[2]  Gerhard Rigoll,et al.  Optimizing the Number of States for HMM-Based On-line Handwritten Whiteboard Recognition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[3]  Qiang Huang,et al.  Using high-level information to detect key audio events in a tennis game , 2010, INTERSPEECH.

[4]  Jörn Anemüller,et al.  Detecting novel objects in acoustic scenes through classifier incongruence , 2010, INTERSPEECH.

[5]  D. Wang,et al.  Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , 2008, IEEE Trans. Neural Networks.

[6]  H.G. Okuno,et al.  Computational Auditory Scene Analysis and its Application to Robot Audition , 2004, 2008 Hands-Free Speech Communication and Microphone Arrays.

[7]  Min Xu,et al.  Affective content analysis in comedy and horror videos by audio emotional event detection , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[8]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[9]  R. Bakis Continuous speech recognition via centisecond acoustic states , 1976 .