An Features Extraction and Recognition Method for Underwater Acoustic Target Based on ATCNN

Facing the complex marine environment, it is extremely challenging to conduct underwater acoustic target recognition (UATR) using ship-radiated noise. Inspired by neural mechanism of auditory perception, this paper provides a new deep neural network trained by original underwater acoustic signals with depthwise separable convolution (DWS) and time-dilated convolution neural network, named auditory perception inspired time-dilated convolution neural network (ATCNN), and then implements detection and classification for underwater acoustic signals. The proposed ATCNN model consists of learnable features extractor and integration layer inspired by auditory perception, and time-dilated convolution inspired by language model. This paper decomposes original time-domain ship-radiated noise signals into different frequency components with depthwise separable convolution filter, and then extracts signal features based on auditory perception. The deep features are integrated on integration layer. The time-dilated convolution is used for long-term contextual modeling. As a result, like language model, intra-class and inter-class information can be fully used for UATR. For UATR task, the classification accuracy reaches 90.9%, which is the highest in contrast experiment. Experimental results show that ATCNN has great potential to improve the performance of UATR classification.

[1]  Yang Yu,et al.  Deep learning-based recognition of underwater target , 2016, 2016 IEEE International Conference on Digital Signal Processing (DSP).

[2]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[3]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[4]  Yanxiong Li,et al.  Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[6]  V M Haughton,et al.  Functional MR of the primary auditory cortex: an analysis of pure tone activation and tone discrimination. , 1997, AJNR. American journal of neuroradiology.

[7]  Qingxin Meng,et al.  A wave structure based method for recognition of marine acoustic target signals , 2015 .

[8]  P. Ramakrishna Rao,et al.  Target classification in a passive sonar-an expert system approach , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[9]  M. Karimi,et al.  Ship noise classification using Probabilistic Neural Network and AR model coefficients , 2005, Europe Oceans 2005.

[10]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[11]  Lilun Zhang,et al.  The Classification of Underwater Acoustic Targets Based on Deep Learning Methods , 2017 .

[12]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[13]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[14]  Stanley A. Gelfand,et al.  Hearing: An Introduction to Psychological and Physiological Acoustics, Fourth Edition , 1998 .

[15]  Richard Kronland-Martinet,et al.  A real-time algorithm for signal analysis with the help of the wavelet transform , 1989 .

[16]  C. Schreiner,et al.  Modular organization of frequency integration in primary auditory cortex. , 2000, Annual review of neuroscience.

[17]  P. Tibbetts :Cognitive Neuroscience: The Biology of the Mind , 2009 .

[18]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[19]  T. Robinson,et al.  Brain Plasticity and Behavior , 2003, Annual review of psychology.

[20]  Arun Kumar,et al.  Marine vessel classification based on passive sonar data: the cepstrum-based approach , 2013 .

[21]  Samy Bengio,et al.  Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.

[22]  Mark J. Shensa,et al.  The discrete wavelet transform: wedding the a trous and Mallat algorithms , 1992, IEEE Trans. Signal Process..

[23]  J. G. Lourens Classification of ships using underwater radiated noise , 1988 .

[24]  Gerald Langner,et al.  Laminar fine structure of frequency organization in auditory midbrain , 1997, Nature.

[25]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[26]  Paul Magron,et al.  Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling , 2019, DCASE.

[27]  Yanxiong Li,et al.  Sound Event Detection with Depthwise Separable and Dilated Convolutions , 2020, 2020 International Joint Conference on Neural Networks (IJCNN).

[28]  Di Wu,et al.  Feature Extraction of Underwater Target Signal Using Mel Frequency Cepstrum Coefficients Based on Acoustic Vector Sensor , 2016, J. Sensors.

[29]  Daniel P. W. Ellis,et al.  Audio tagging with noisy labels and minimal supervision , 2019, DCASE.

[30]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  H. Penny Nii Signal-to-symbol transformation: Reasoning in the HASP/SIAP program , 1984, ICASSP.

[32]  Geoffrey Zweig,et al.  Joint Language and Translation Modeling with Recurrent Neural Networks , 2013, EMNLP.

[33]  B. Moore Cochlear hearing loss : physiological, psychological and technical issues , 2014 .

[34]  Antonio Cardenal-Lopez,et al.  ShipsEar: An underwater vessel noise database , 2016 .

[35]  William Soares Filho,et al.  Preprocessing passive sonar signals for neural classification , 2011 .

[36]  Wang Zhi-qiang,et al.  Underwater Target Recognition Based on Wavelet Packet and Principal Component Analysis , 2011 .

[37]  Shengchun Piao,et al.  The classification of underwater acoustic target signals based on wave structure and support vector machine , 2014 .

[38]  N. Weinberger Learning-induced changes of auditory receptive fields , 1993, Current Opinion in Neurobiology.

[39]  Donald Robertson,et al.  Plasticity of frequency organization in auditory cortex of guinea pigs with partial unilateral deafness , 1989, The Journal of comparative neurology.

[40]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[41]  Yue Pan,et al.  Underwater acoustic target recognition using SVM ensemble via weighted sample and feature selection , 2016, 2016 13th International Bhurban Conference on Applied Sciences and Technology (IBCAST).

[42]  Norman M. Weinberger,et al.  Experience-Dependent Response Plasticity in the Auditory Cortex: Issues, Characteristics, Mechanisms, and Functions , 2004 .

[43]  Tara N. Sainath,et al.  Learning filter banks within a deep neural network framework , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[44]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Gang Hu,et al.  Deep Learning Methods for Underwater Target Feature Extraction and Recognition , 2018, Comput. Intell. Neurosci..

[46]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[47]  Barbara Canlon,et al.  Sound-induced motility of isolated cochlear outer hair cells is frequency-specific , 1989, Nature.

[48]  Jonathan Ashmore,et al.  The cochlea , 2000, Current Biology.

[49]  M. H. Supriya,et al.  Deep learning architectures for underwater target recognition , 2013, 2013 Ocean Electronics (SYMPOL).