Semi-supervised target classification in multi-frequency echosounder data

Acoustic target classification in multi-frequency echosounder data is a major interest for the marine ecosystem and fishery management since it can potentially estimate the abundance or biomass of the species. A key problem of current methods is the heavy dependence on the manual categorization of data samples. As a solution, we propose a novel semi-supervised deep learning method leveraging a few annotated data samples together with vast amounts of unannotated data samples, all in a single model. Specifically, two inter-connected objectives, namely, a clustering objective and a classification objective, optimize one shared convolutional neural network in an alternating manner. The clustering objective exploits the underlying structure of all data, both annotated and unannotated; the classification objective enforces a certain consistency to given classes using the few annotated data samples. We evaluate our classification method using echosounder data from the sandeel case study in the North Sea. In the semi-supervised setting with only a tenth of the training data annotated, our method achieves 67.6% accuracy, outperforming a conventional semi-supervised method by 7.0 percentage points. When applying the proposed method in a fully supervised setup, we achieve 74.7% accuracy, surpassing the standard supervised deep learning method by 4.7 percentage points.

[1]  Michael Kampffmeyer,et al.  Deep Divergence-Based Approach to Clustering , 2019, Neural Networks.

[2]  Atsuto Maki,et al.  A systematic study of the class imbalance problem in convolutional neural networks , 2017, Neural Networks.

[3]  Dag Tjøstheim,et al.  The sampling volume of trawl and acoustics: estimating availability probabilities from observations of tracked individual fish , 2009 .

[4]  Bo Yang,et al.  Towards K-means-friendly Spaces: Simultaneous Deep Learning and Clustering , 2016, ICML.

[5]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[7]  Matthijs Douze,et al.  Deep Clustering for Unsupervised Learning of Visual Features , 2018, ECCV.

[8]  Rolf J. Korneliussen,et al.  Acoustic identification of marine species using a feature library , 2016 .

[9]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[10]  E. Ona,et al.  Synthetic echograms generated from the relative frequency response , 2003 .

[11]  R. Furness,et al.  Regional variation in the role of bottom-up and top-down processes in controlling sandeel abundance in the North Sea , 2007 .

[12]  Paola Cappanera,et al.  Lagrangean-Based Combinatorial Optimization for Large-Scale S3VMs , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[13]  D. Raitt A Preliminary Account of the Sandeels of Scottish Waters , 1934 .

[14]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15]  Alexandra Branzan Albu,et al.  A Deep Learning-based Framework for the Detection of Schools of Herring in Echograms , 2019, ArXiv.

[16]  Gérard Govaert,et al.  Assessing a Mixture Model for Clustering with the Integrated Completed Likelihood , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[18]  Philippe Thomas Review of Semi-supervised learning by O. Chapelle, B. Schölkopf, and A. Zien, Eds. London, UK, MIT Press, 2006 , 2009 .

[19]  Amar Mitiche,et al.  Deep Clustering: On the Link Between Discriminative Models and K-Means , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  David G. Reid Report on Echo Trace Classification , 2000 .

[21]  Arnt-Børre Salberg,et al.  Acoustic classification in multifrequency echosounder data using deep convolutional neural networks , 2020 .

[22]  E. Ona,et al.  Size-dependent frequency response of sandeel schools , 2009 .

[23]  Rudy J. Kloser,et al.  Species identification in deep water using multiple acoustic frequencies , 2002 .

[24]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[25]  J. Hislop,et al.  Ecology of North Sea fish , 1990 .

[26]  R. Furness Management implications of interactions between fisheries and sandeel-dependent seabirds and seals in the North Sea , 2002 .

[27]  Paul G. Fernandes,et al.  A consistent approach to definitions and symbols in fisheries acoustics , 2002 .

[28]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[29]  Lutz Prechelt,et al.  Early Stopping-But When? , 1996, Neural Networks: Tricks of the Trade.

[30]  O. Chapelle,et al.  Semi-Supervised Learning (Chapelle, O. et al., Eds.; 2006) [Book reviews] , 2009, IEEE Transactions on Neural Networks.

[31]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[32]  G. Rieucau,et al.  Collective structures anchor massive schools of lesser sandeel to the seabed, increasing vulnerability to fishery , 2017 .