Channel and spatial attention based deep object co-segmentation

Abstract Object co-segmentation is a challenging task, which aims to segment common objects in multiple images at the same time. Generally, common information of the same object needs to be found to solve this problem. For various scenarios, common objects in different images only have the same semantic information. In this paper, we propose a deep object co-segmentation method based on channel and spatial attention, which combines the attention mechanism with a deep neural network to enhance the common semantic information. Siamese encoder and decoder structure are used for this task. Firstly, the encoder network is employed to extract low-level and high-level features of image pairs. Secondly, we introduce an improved attention mechanism in the channel and spatial domain to enhance the multi-level semantic features of common objects. Then, the decoder module accepts the enhanced feature maps and generates the masks of both images. Finally, we evaluate our approach on the commonly used datasets for the co-segmentation task. And the experimental results show that our approach achieves competitive performance.

[1]  Xia Li,et al.  Weakly Supervised Salient Object Detection With Spatiotemporal Cascade Neural Networks , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Shuyuan Yang,et al.  Mutual Learning Between Saliency and Similarity: Image Cosegmentation via Tree Structured Sparsity and Tree Graph Matching , 2018, IEEE Transactions on Image Processing.

[3]  Qiang Wu,et al.  High-Quality Image Captioning With Fine-Grained and Semantic-Guided Visual Attention , 2019, IEEE Transactions on Multimedia.

[4]  Hod Lipson,et al.  Understanding Neural Networks Through Deep Visualization , 2015, ArXiv.

[5]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[6]  Feiping Nie,et al.  Robust Object Co-Segmentation Using Background Prior , 2018, IEEE Transactions on Image Processing.

[7]  Xiaochun Cao,et al.  Multiple Semantic Matching on Augmented $N$ -Partite Graph for Object Co-Segmentation. , 2017, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[8]  Xuelong Li,et al.  Robust Video Object Cosegmentation , 2015, IEEE Transactions on Image Processing.

[9]  Jianfei Cai,et al.  Quality-Guided Fusion-Based Co-Saliency Estimation for Image Co-Segmentation and Colocalization , 2018, IEEE Transactions on Multimedia.

[10]  Shuying Li,et al.  Landslide Inventory Mapping From Bitemporal Images Using Deep Convolutional Neural Networks , 2019, IEEE Geoscience and Remote Sensing Letters.

[11]  Carsten Rother,et al.  Deep Object Co-Segmentation , 2018, ACCV.

[12]  Jianfei Cai,et al.  Image Co-segmentation via Saliency Co-fusion , 2016, IEEE Transactions on Multimedia.

[13]  Pengfei Xiong,et al.  Pyramid Attention Network for Semantic Segmentation , 2018, BMVC.

[14]  Moncef Gabbouj,et al.  Constrained Directed Graph Clustering and Segmentation Propagation for Multiple Foregrounds Cosegmentation , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[16]  Hong Chen,et al.  Semantic Aware Attention Based Deep Object Co-segmentation , 2018, ACCV.

[17]  Ling Shao,et al.  Video Co-Saliency Guided Co-Segmentation , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[19]  Koray Kavukcuoglu,et al.  Multiple Object Recognition with Visual Attention , 2014, ICLR.

[20]  Xiaoning Qian,et al.  Image Co-Saliency Detection and Co-Segmentation via Progressive Joint Optimization , 2019, IEEE Transactions on Image Processing.

[21]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Xiaochun Cao,et al.  Multiple Semantic Matching on Augmented $N$ -Partite Graph for Object Co-Segmentation , 2017, IEEE Transactions on Image Processing.

[23]  Ian D. Reid,et al.  Weakly Supervised Semantic Segmentation Based on Co-segmentation , 2017, BMVC.

[24]  Brejesh Lall,et al.  Object cosegmentation using deep Siamese network , 2018, ArXiv.

[25]  Bo Zhao,et al.  Diversified Visual Attention Networks for Fine-Grained Object Classification , 2016, IEEE Transactions on Multimedia.

[26]  Hanqing Lu,et al.  Attention CoupleNet: Fully Convolutional Attention Coupling Network for Object Detection , 2019, IEEE Transactions on Image Processing.