Deep learning-based framework for expansion, recognition and classification of underwater acoustic signal

ABSTRACT Recently, deep learning has developed rapidly and contributed in many fields like the classification in radar and sonar applications. In some special fields like the underwater acoustic signals, the dataset for training may be scarce due to the reason of security or other restrictions, which affects the performance of the deep learning methods as those need a big dataset to ensure high accuracy. Furthermore, the original dataset is in some formats like audio, which makes those methods difficult to capture features, especially in insufficient sample case because of the interference. In this paper, we present a novel framework that applies the LOFAR spectrum for preprocessing to retain key features and utilises Generative Adversarial Networks (GAN) for the expansion of samples to improve the performance classification. Firstly, our framework selects proper preprocessing method based on the evaluation of the spectrum methods. Secondly, our framework revises a GAN to generate samples and built an independent classification network to ensure the quality of those. Finally, our framework applies the existing classification networks to evaluate the performance and selects the best one for real utilisation. The experimental results show that the generated samples have high quality, which can significantly improve the classification accuracy of the neural models.

[1]  Lilun Zhang,et al.  The Classification of Underwater Acoustic Targets Based on Deep Learning Methods , 2017 .

[2]  David Berthelot,et al.  BEGAN: Boundary Equilibrium Generative Adversarial Networks , 2017, ArXiv.

[3]  Jimeng Sun,et al.  Generating Multi-label Discrete Electronic Health Records using Generative Adversarial Networks , 2017, ArXiv.

[4]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[5]  Masatoshi Okutomi,et al.  Coupled convolution layer for convolutional neural network , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[6]  Jonathon Shlens,et al.  Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.

[7]  Alex Pappachen James,et al.  Variability Analysis of Memristor-based Sigmoid Function , 2018, 2018 International Conference on Computing and Network Communications (CoCoNet).

[8]  S. Markoff,et al.  LOFAR - low frequency array , 2006 .

[9]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[10]  Bruce J. Schachter Automatic Target Recognition , 2016 .

[11]  Pieter Abbeel,et al.  InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[12]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[13]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[14]  Zhe Chen,et al.  Entropy based underwater acoustic signal detection , 2017, 2017 14th International Bhurban Conference on Applied Sciences and Technology (IBCAST).

[15]  Peter F. McGuire,et al.  Automatic Target Recognition in Synthetic Aperture Radar Imagery: A State-of-the-Art Review , 2016, IEEE Access.

[16]  Razvan C. Bunescu,et al.  Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques , 2003, Third IEEE International Conference on Data Mining.

[17]  Mohamed Moustafa,et al.  CIFAR-10: KNN-Based Ensemble of Classifiers , 2016, 2016 International Conference on Computational Science and Computational Intelligence (CSCI).

[18]  Xiu Li,et al.  Accelerating fish detection and recognition by sharing CNNs with objectness learning , 2016, OCEANS 2016 - Shanghai.

[19]  André van Schaik,et al.  A comparison of extreme learning machines and back-propagation trained feed-forward networks processing the mnist database , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  C. Zhang,et al.  DNN speaker adaptation using parameterised sigmoid and ReLU hidden activation functions , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21]  Dhruv Batra,et al.  LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation , 2016, ICLR.

[22]  Joachim Denzler,et al.  Visual fish tracking: Combining a two-stage graph approach with CNN-features , 2017, OCEANS 2017 - Aberdeen.

[23]  Bo Yuan,et al.  Efficient hardware architecture of softmax layer in deep neural network , 2016, 2016 29th IEEE International System-on-Chip Conference (SOCC).

[24]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[25]  Bo Fu,et al.  Deep learning feature extraction for target recognition and classification in underwater sonar images , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[26]  Masatoshi Okutomi,et al.  Coupled convolution layer for convolutional neural network , 2018, Neural Networks.

[27]  Ikuo Matsuo,et al.  Real time detection and localization system for underwater acoustic signal with cable observatories in the west Pacific Ocean , 2016, 2016 Techno-Ocean (Techno-Ocean).

[28]  Geoffrey E. Hinton,et al.  Understanding how Deep Belief Networks perform acoustic modelling , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  Matias Valdenegro-Toro,et al.  Object recognition in forward-looking sonar images with Convolutional Neural Networks , 2016, OCEANS 2016 MTS/IEEE Monterey.

[30]  Orhan Arikan,et al.  Short-time Fourier transform: two fundamental properties and an optimal implementation , 2003, IEEE Trans. Signal Process..

[31]  Gang Hu,et al.  Deep Learning Methods for Underwater Target Feature Extraction and Recognition , 2018, Comput. Intell. Neurosci..

[32]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[33]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.