Part-Aware Interactive Learning for Scene Graph Generation
暂无分享,去创建一个
Yongdong Zhang | Ning Xu | An-An Liu | Hongshuo Tian | Anan Liu | Yongdong Zhang | Ning Xu | Hongshuo Tian
[1] Tao Mei,et al. Exploring Visual Relationship for Image Captioning , 2018, ECCV.
[2] Wei Liu,et al. Learning to Compose Dynamic Tree Structures for Visual Contexts , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[3] Anton van den Hengel,et al. Graph-Structured Representations for Visual Question Answering , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .
[5] Jianqiang Huang,et al. Unbiased Scene Graph Generation From Biased Training , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Jianfei Cai,et al. Scene Graph Generation With External Knowledge and Image Reconstruction , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Xiaogang Wang,et al. Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation , 2018, ECCV.
[8] Xiaogang Wang,et al. Scene Graph Generation from Objects, Phrases and Region Captions , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[9] Samy Bengio,et al. Learning semantic relationships for better action retrieval in images , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Michael S. Bernstein,et al. Visual Relationship Detection with Language Priors , 2016, ECCV.
[11] Xiaogang Wang,et al. ViP-CNN: Visual Phrase Guided Convolutional Neural Network , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Michael S. Bernstein,et al. Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations , 2016, International Journal of Computer Vision.
[13] Chong-Wah Ngo,et al. Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues , 2014, Journal of Computer Science and Technology.
[14] Ning Xu,et al. Scene graph captioner: Image captioning based on structural visual representation , 2019, J. Vis. Commun. Image Represent..
[15] Caiyan Jia,et al. Structure-Aware Deep Learning for Product Image Classification , 2019, ACM Trans. Multim. Comput. Commun. Appl..
[16] Mingmin Chi,et al. Relation Parsing Neural Network for Human-Object Interaction Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[17] Ali Farhadi,et al. Recognition using visual phrases , 2011, CVPR 2011.
[18] Svetlana Lazebnik,et al. Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering , 2016, ECCV.
[19] Bo Dai,et al. Detecting Visual Relationships with Deep Relational Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[20] Jie Nie,et al. M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval , 2021, IEEE Transactions on Multimedia.
[21] Sarah Parisot,et al. Learning Conditioned Graph Structures for Interpretable Visual Question Answering , 2018, NeurIPS.
[22] Lei Zhang,et al. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[23] Hai Wan,et al. Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding , 2018, IJCAI.
[24] Kaiming He,et al. Mask R-CNN , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[25] Cewu Lu,et al. Pairwise Body-Part Attention for Recognizing Human-Object Interactions , 2018, ECCV.
[26] Danfei Xu,et al. Scene Graph Generation by Iterative Message Passing , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[27] Liang Lin,et al. Knowledge-Embedded Routing Network for Scene Graph Generation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Wei Zhang,et al. Pyrboxes: An efficient multi-scale scene text detector with feature pyramids , 2019, Pattern Recognit. Lett..
[29] Liu Wu,et al. Human Mesh Recovery From Monocular Images via a Skeleton-Disentangled Representation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[30] Yejin Choi,et al. Neural Motifs: Scene Graph Parsing with Global Context , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[31] Zhenan Sun,et al. Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[32] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[33] Li Fei-Fei,et al. Image Generation from Scene Graphs , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[34] Michael S. Bernstein,et al. Image retrieval using scene graphs , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[35] Shih-Fu Chang,et al. Visual Translation Embedding Network for Visual Relation Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[36] Zhiyuan Liu,et al. CANE: Context-Aware Network Embedding for Relation Modeling , 2017, ACL.
[37] Long Chen,et al. Counterfactual Critic Multi-Agent Training for Scene Graph Generation , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[38] Yongdong Zhang,et al. Dual-Stream Recurrent Neural Network for Video Captioning , 2019, IEEE Transactions on Circuits and Systems for Video Technology.
[39] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[40] Roger Zimmermann,et al. Towards Natural and Accurate Future Motion Prediction of Humans and Animals , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[41] Weijian Li,et al. Attentive Relational Networks for Mapping Images to Scene Graphs , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[42] Mohan S. Kankanhalli,et al. Scene Graph Inference via Multi-Scale Context Modeling , 2021, IEEE Transactions on Circuits and Systems for Video Technology.
[43] Stefan Lee,et al. Graph R-CNN for Scene Graph Generation , 2018, ECCV.
[44] Sicheng Zhao,et al. 3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval , 2021, IEEE transactions on multimedia.
[45] Larry S. Davis,et al. Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[46] Sicheng Zhao,et al. Deep Correlated Joint Network for 2-D Image-Based 3-D Model Retrieval , 2020, IEEE Transactions on Cybernetics.
[47] Jia Deng,et al. Pixels to Graphs by Associative Embedding , 2017, NIPS.