Data Augmentation using Conditional Generative Adversarial Network for Underwater Target Recognition

In underwater target recognition tasks, a deep learning recognition model is more effective than traditional methods, but meanwhile requires sufficient labeled data at the same time. Data augmentation is an effective method to increase the size of training data and reduced the mismatch between training and test. Different from the traditional approaches by directly adding noise to the original waveform or perform waveform transformation, we explore a supervised conditional generative adversarial network framework for data augmentation by using few ship data. The experiment result shows that higher recognition accuracy can be obtained when generated data are added to the original data. We find that the recognition models do not always perform better when the added generated data increases. It is appropriate to add about 5 times of the amount of the original data. The experimental result shows that in 5-target recognition task, with only 1000 samples for each type of target provided, a relative 8.8% to 13.3% improvement can be obtained when adding the generated data.

[1]  Léon Bottou,et al.  Towards Principled Methods for Training Generative Adversarial Networks , 2017, ICLR.

[2]  Sanjeev Khudanpur,et al.  A study on data augmentation of reverberant speech for robust speech recognition , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Yongqiang Wang,et al.  Speaker and Noise Factorization for Robust Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Mira Lilleholt Vik Speech Enhancement with a Generative Adversarial Network , 2019 .

[5]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[6]  Sanjeev Khudanpur,et al.  Parallel training of DNNs with Natural Gradient and Parameter Averaging , 2014 .

[7]  G. Celeux,et al.  An entropy criterion for assessing the number of clusters in a mixture model , 1996 .

[8]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[9]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[10]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[11]  Anna Kolesárová,et al.  1-Lipschitz Aggregation Operators, Quasi-Copulas and Copulas with Given Diagonals , 2004 .

[12]  Léon Bottou,et al.  Wasserstein GAN , 2017, ArXiv.

[13]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[14]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Shengchun Piao,et al.  The classification of underwater acoustic target signals based on wave structure and support vector machine , 2014 .

[16]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[17]  Tara N. Sainath,et al.  Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home , 2017, INTERSPEECH.

[18]  Ping Yu,et al.  Generating Adversarial Examples With Conditional Generative Adversarial Net , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[19]  M. Stevenson,et al.  Sonar signal detection and classification using artificial neural networks , 2000, 2000 Canadian Conference on Electrical and Computer Engineering. Conference Proceedings. Navigating to a New Era (Cat. No.00TH8492).

[20]  Yonghong Yan,et al.  Underwater target classification using deep learning , 2018, OCEANS 2018 MTS/IEEE Charleston.