暂无分享,去创建一个
Bertrand Le Saux | Devis Tuia | Benjamin Kellenberger | Sylvain Lobry | Christel Chappuis | D. Tuia | B. L. Saux | B. Kellenberger | S. Lobry | Christel Chappuis | Benjamin Kellenberger
[1] Sanja Fidler,et al. Skip-Thought Vectors , 2015, NIPS.
[2] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[3] Peng Wang,et al. Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge from External Sources , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] L. Tucker,et al. Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.
[5] Xiao Xiang Zhu,et al. Toward a Collective Agenda on AI for Earth Science Data Analysis , 2021, IEEE Geoscience and Remote Sensing Magazine.
[6] Begüm Demir,et al. Bigearthnet: A Large-Scale Benchmark Archive for Remote Sensing Image Understanding , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.
[7] Begüm Demir,et al. RSVQA Meets Bigearthnet: A New, Large-Scale, Visual Question Answering Dataset for Remote Sensing , 2021, 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS.
[8] Sanja Fidler,et al. Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[9] Matthieu Cord,et al. MUTAN: Multimodal Tucker Fusion for Visual Question Answering , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[10] Zhou Yu,et al. Beyond Bilinear: Generalized Multimodal Factorized High-Order Pooling for Visual Question Answering , 2017, IEEE Transactions on Neural Networks and Learning Systems.
[11] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[12] Chunhua Shen,et al. Explicit Knowledge-based Reasoning for Visual Question Answering , 2015, IJCAI.
[13] Zhou Yu,et al. Multi-modal Factorized Bilinear Pooling with Co-attention Learning for Visual Question Answering , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[14] Moses Charikar,et al. Finding frequent items in data streams , 2002, Theor. Comput. Sci..
[15] Trevor Darrell,et al. Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding , 2016, EMNLP.
[16] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Wei Xu,et al. ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering , 2015, ArXiv.
[18] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[19] Dan Klein,et al. Learning to Compose Neural Networks for Question Answering , 2016, NAACL.
[20] Dan Klein,et al. Neural Module Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[21] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[22] Xiaoqiang Lu,et al. Mutual Attention Inception Network for Remote Sensing Visual Question Answering , 2022, IEEE Transactions on Geoscience and Remote Sensing.
[23] Tamara G. Kolda,et al. Tensor Decompositions and Applications , 2009, SIAM Rev..
[24] Maryam Rahnemoonfar,et al. FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding , 2020, IEEE Access.
[25] Matthieu Cord,et al. BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection , 2019, AAAI.
[26] Alexander J. Smola,et al. Stacked Attention Networks for Image Question Answering , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[27] Diego Marcos,et al. RSVQA: Visual Question Answering for Remote Sensing Data , 2020, IEEE Transactions on Geoscience and Remote Sensing.