论文信息 - Collaborative Generative Adversarial Network with Visual perception and memory reasoning

Collaborative Generative Adversarial Network with Visual perception and memory reasoning

Abstract In order to address such negative issues as GAN’s mediocre image quality, high-demand for training samples and computation resources, this paper proposes the collaborative Generative Adversarial Network with Visual perception and memory reasoning (ESA-CGAN). This not only makes use of the vision self-attention mechanism and objects salience model to analyze the global information and detailed features of objects, but also designs cross-correlation self-attention module so as to make a balance between the computation efficiency and computational on the one hand and the statistical efficiency and the ability to simulate remote dependencies on the other hand. Based on convolutional long-term and short-term memory network, this paper optimizes the attention feature map so as to highlight the features of objects themselves and improve their generative abilities. Meanwhile, a cooperative learning mechanism between generators is designed, which combines self-constructed generation model and pre-training generation model to form a generation model group. It not only improves effectively the generation ability of the model, improve the computing efficiency, but also restrains the collapse of the model from another angle. In fact, the model proposed here has completed numerical experiments on multiple common standard datasets and self-configuring datasets, and has made comparisons with several mainstream generation antagonistic network models in terms of the performance of image data augmentation. The experimental results demonstrate that the model has excellent simulation ability to enable itself to effectively realize the augmentation of data, thus making it highly applicable in the future.

[1] Dong Wang,et al. Real-Time Object Detection in Remote Sensing Images Based on Visual Perception and Memory Reasoning , 2019, Electronics.

[2] Shi-Min Hu,et al. Global contrast based salient region detection , 2011, CVPR 2011.

[3] Dhruv Batra,et al. LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation , 2016, ICLR.

[4] John K. Tsotsos,et al. Saliency, attention, and visual search: an information theoretic approach. , 2009, Journal of vision.

[5] Jonas Obleser,et al. Probing the limits of alpha power lateralisation as a neural marker of selective attention in middle‐aged and older listeners , 2018, The European journal of neuroscience.

[6] Christof Koch,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[7] Andreas Geiger,et al. Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[8] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[9] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[10] Yueting Zhuang,et al. DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection , 2015, IEEE Transactions on Image Processing.

[11] Gang Wang,et al. Deep Level Sets for Salient Object Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] C. Schroeder,et al. The Spectrotemporal Filter Mechanism of Auditory Selective Attention , 2013, Neuron.

[13] L. Deng,et al. The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web] , 2012, IEEE Signal Processing Magazine.

[14] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[15] Lin Yang,et al. Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[17] Huchuan Lu,et al. Saliency Detection with Recurrent Fully Convolutional Networks , 2016, ECCV.

[18] Lianwen Jin,et al. A Multi-Object Rectified Attention Network for Scene Text Recognition , 2019, Pattern Recognit..

[19] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[20] Feng Xiao,et al. Multi-Object Detection in Traffic Scenes Based on Improved SSD , 2018, Electronics.

[21] Huchuan Lu,et al. A Stagewise Refinement Model for Detecting Salient Objects in Images , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).