Key-Word-Aware Network for Referring Expression Image Segmentation
暂无分享,去创建一个
Hongliang Li | Hengcan Shi | Qingbo Wu | Fanman Meng | Q. Wu | Hongliang Li | Fanman Meng | Hengcan Shi
[1] Jianmin Zhao,et al. A Fast Simple Optical Flow Computation Approach Based on the 3-D Gradient , 2014, IEEE Transactions on Circuits and Systems for Video Technology.
[2] Shuicheng Yan,et al. Semantic Object Parsing with Graph LSTM , 2016, ECCV.
[3] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[4] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[5] Wei Liu,et al. ParseNet: Looking Wider to See Better , 2015, ArXiv.
[6] Tao Mei,et al. Multi-level Attention Networks for Visual Question Answering , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Xiaogang Wang,et al. Learning Object Interactions and Descriptions for Semantic Image Segmentation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Liang Lin,et al. I2T: Image Parsing to Text Description , 2010, Proceedings of the IEEE.
[9] 한보형,et al. Learning Deconvolution Network for Semantic Segmentation , 2015 .
[10] Jian Sun,et al. Convolutional feature masking for joint object and stuff segmentation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Vicente Ordonez,et al. ReferItGame: Referring to Objects in Photographs of Natural Scenes , 2014, EMNLP.
[12] Iasonas Kokkinos,et al. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[13] Hongliang Li,et al. Video Object Segmentation via Global Consistency Aware Query Strategy , 2017, IEEE Transactions on Multimedia.
[14] Jiasen Lu,et al. Hierarchical Question-Image Co-Attention for Visual Question Answering , 2016, NIPS.
[15] Ruimao Zhang,et al. Geometric Scene Parsing with Hierarchical LSTM , 2016, IJCAI.
[16] Licheng Yu,et al. Modeling Context in Referring Expressions , 2016, ECCV.
[17] George Papandreou,et al. Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.
[18] Alexander J. Smola,et al. Stacked Attention Networks for Image Question Answering , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[19] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[20] Jitendra Malik,et al. Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation , 2015, International Journal of Computer Vision.
[21] Yuting Zhang,et al. Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[22] Jitendra Malik,et al. Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[23] Xiaogang Wang,et al. Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[24] Hongliang Li,et al. Unsupervised Multiclass Region Cosegmentation via Ensemble Clustering and Energy Minimization , 2014, IEEE Transactions on Circuits and Systems for Video Technology.
[25] Alan L. Yuille,et al. Generation and Comprehension of Unambiguous Object Descriptions , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[27] Larry S. Davis,et al. Modeling Context Between Objects for Referring Expression Understanding , 2016, ECCV.
[28] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.
[29] Trevor Darrell,et al. Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions , 2016, ArXiv.
[30] Trevor Darrell,et al. Natural Language Object Retrieval , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[31] Chenxi Liu,et al. Recurrent Multimodal Interaction for Referring Image Segmentation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[32] Zhen Li,et al. LSTM-CF: Unifying Context Modeling and Fusion with LSTMs for RGB-D Scene Labeling , 2016, ECCV.
[33] King Ngi Ngan,et al. Globally Measuring the Similarity of Superpixels by Binary Edge Maps for Superpixel Clustering , 2018, IEEE Transactions on Circuits and Systems for Video Technology.
[34] Vladlen Koltun,et al. Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.
[35] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[36] Trevor Darrell,et al. Segmentation from Natural Language Expressions , 2016, ECCV.
[37] Jitendra Malik,et al. Learning Rich Features from RGB-D Images for Object Detection and Segmentation , 2014, ECCV.
[38] Richard Socher,et al. Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[39] M. Welling,et al. Region-Based Semantic Segmentation with End-to-End Training , 2016 .