Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition
暂无分享,去创建一个
Ignazio Gallo | Shah Nawaz | Alessandro Calefati | Omer Arshad | I. Gallo | O. Arshad | S. Nawaz | Alessandro Calefati | Shah Nawaz
[1] Leonardo Neves,et al. Multimodal Named Entity Recognition for Short Social Media Posts , 2018, NAACL.
[2] Tao Shen,et al. DiSAN: Directional Self-Attention Network for RNN/CNN-free Language Understanding , 2017, AAAI.
[3] Jung-Woo Ha,et al. Dual Attention Networks for Multimodal Reasoning and Matching , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Rada Mihalcea,et al. Going Beyond Text: A Hybrid Image-Text Approach for Measuring Word Relatedness , 2011, IJCNLP.
[5] Léon Bottou,et al. Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics , 2014, EMNLP.
[6] Oren Etzioni,et al. Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.
[7] Ignazio Gallo,et al. Multimodal Classification Fusion in Real-World Scenarios , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).
[8] Ignazio Gallo,et al. Learning Fused Representations for Large-Scale Multimodal Classification , 2019, IEEE Sensors Letters.
[9] Lei Zhang,et al. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[10] Prakhar Gupta,et al. Learning Word Vectors for 157 Languages , 2018, LREC.
[11] Eduard H. Hovy,et al. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.
[12] Tomas Mikolov,et al. Efficient Large-Scale Multi-Modal Classification , 2018, AAAI.
[13] Thamar Solorio,et al. A Multi-task Approach for Named Entity Recognition in Social Media Data , 2017, NUT@EMNLP.
[14] Heng Ji,et al. Visual Attention Model for Name Tagging in Multimodal Social Media , 2018, ACL.
[15] Fabio A. González,et al. Gated Multimodal Units for Information Fusion , 2017, ICLR.
[16] Trevor Darrell,et al. Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding , 2016, EMNLP.
[17] Xuanjing Huang,et al. Adaptive Co-attention Network for Named Entity Recognition in Tweets , 2018, AAAI.
[18] Timothy Baldwin,et al. Shared Tasks of the 2015 Workshop on Noisy User-generated Text: Twitter Lexical Normalization and Named Entity Recognition , 2015, NUT@IJCNLP.
[19] Yin Li,et al. Learning Deep Structure-Preserving Image-Text Embeddings , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[20] Ignazio Gallo,et al. Revisiting Cross Modal Retrieval , 2018, ArXiv.
[21] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[22] Guillaume Lample,et al. Neural Architectures for Named Entity Recognition , 2016, NAACL.
[23] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.