论文信息 - Individual Ship Detection Using Underwater Acoustics

Individual Ship Detection Using Underwater Acoustics

Individual ship detection from underwater audio is the task of deciding whether a specific ship is present, using sound captured by an underwater hydrophone. It is a task analogous to speaker identification (SID), in the sense that it is an open-class detection task; the ships present could be other irrelevant (“impostor”) ships, never encountered in the training data. We present two methodologies for tackling this problem, both motivated by our work in speech-related technologies: (i) one based on neural networks, which follows, to a large extent, the approach of [1], and (ii) one based on i-vectors and PLDA [2]. To the best of our knowledge, this is the first time that the topic of individual ship detection is approached as an open-class detection problem.

Jan Silovský | Richard M. Schwartz | John Makhoul | Damianos Karakos | William Hartmann

[1] Patrick Kenny,et al. Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[2] Richard M. Schwartz,et al. Score normalization and system combination for improved keyword spotting , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[3] Richard M. Schwartz,et al. Normalizationofphonetic keyword search scores , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[4] J. Hildebrand,et al. Underwater radiated noise from modern commercial ships. , 2012, The Journal of the Acoustical Society of America.

[5] Richard M. Schwartz,et al. White Listing and Score Normalization for Keyword Spotting of Noisy Speech , 2012, INTERSPEECH.

[6] James H. Elder,et al. Probabilistic Linear Discriminant Analysis for Inferences About Identity , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[7] John Makhoul,et al. Applying speech technology to the ship-type classification problem , 2017, OCEANS 2017 – Anchorage.

[8] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[9] José Seixas,et al. Class-modular multi-layer perceptron networks for supporting passive sonar signal classification , 2016 .

[10] Themos Stafylakis,et al. PLDA for speaker verification with utterances of arbitrary duration , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.