Joint Learning of Object Graph and Relation Graph for Visual Question Answering
暂无分享,去创建一个
Hao Li | Mingming Sun | Belhal Karimi | Xu Li | Jie Chen
[1] Zhiyuan Liu,et al. Pre-Trained Models: Past, Present and Future , 2021, AI Open.
[2] Chenhui Chu,et al. Understanding the Role of Scene Graphs in Visual Question Answering , 2021, ArXiv.
[3] Liang Lin,et al. Interpretable Visual Question Answering by Reasoning on Dependency Trees , 2019, IEEE transactions on pattern analysis and machine intelligence.
[4] Zhuoqian Yang,et al. Prior Visual Relationship Reasoning For Visual Question Answering , 2020, 2020 IEEE International Conference on Image Processing (ICIP).
[5] Stephan Günnemann,et al. Scene Graph Reasoning for Visual Question Answering , 2020, ArXiv.
[6] Jianqiang Huang,et al. Unbiased Scene Graph Generation From Biased Training , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Nan Duan,et al. Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training , 2019, AAAI.
[8] Shashank Shekhar,et al. From Strings to Things: Knowledge-Enabled VQA Model That Can Read and Reason , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[9] Cheng Zhang,et al. An Empirical Study on Leveraging Scene Graphs for Visual Question Answering , 2019, BMVC.
[10] Christopher D. Manning,et al. Learning by Abstraction: The Neural State Machine , 2019, NeurIPS.
[11] Xiaodan Liang,et al. Spatial-Aware Graph Relation Network for Large-Scale Object Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Yu Cheng,et al. Relation-Aware Graph Attention Network for Visual Question Answering , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[13] Christopher D. Manning,et al. GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[15] Nenghai Yu,et al. Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition , 2018, ECCV.
[16] Wanxiang Che,et al. A Neural Transition-Based Approach for Semantic Dependency Graph Parsing , 2018, AAAI.
[17] Vinay P. Namboodiri,et al. Differential Attention for Visual Question Answering , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[18] Christopher D. Manning,et al. Compositional Attention Networks for Machine Reasoning , 2018, ICLR.
[19] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.
[20] Wanxiang Che,et al. Transition-Based Chinese Semantic Dependency Graph Parsing , 2016, CCL.
[21] Jiasen Lu,et al. Hierarchical Question-Image Co-Attention for Visual Question Answering , 2016, NIPS.
[22] Michael S. Bernstein,et al. Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations , 2016, International Journal of Computer Vision.
[23] Richard S. Zemel,et al. Gated Graph Sequence Neural Networks , 2015, ICLR.
[24] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.