See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization
暂无分享,去创建一个
[1] Yanqing He,et al. Biomedical-domain pre-trained language model for extractive summarization , 2020, Knowl. Based Syst..
[2] Tanmoy Chakraborty,et al. Exercise? I thought you said 'Extra Fries': Leveraging Sentence Demarcations and Multi-hop Attention for Meme Affect Analysis , 2021, ICWSM.
[3] Mirella Lapata,et al. Ranking Sentences for Extractive Summarization with Reinforcement Learning , 2018, NAACL.
[4] Louis-Philippe Morency,et al. Computational Analysis of Persuasiveness in Social Multimedia: A Novel Dataset and Multimodal Prediction Approach , 2014, ICMI.
[5] Mirella Lapata,et al. Neural Summarization by Extracting Sentences and Words , 2016, ACL.
[6] Sheng Liu,et al. SibNet: Sibling Convolutional Encoder for Video Captioning , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[7] Fei Liu,et al. Adapting the Neural Encoder-Decoder Framework from Single to Multi-Document Summarization , 2018, EMNLP.
[8] Louis-Philippe Morency,et al. Factorized Multimodal Transformer for Multimodal Sequential Learning , 2019, ArXiv.
[9] Tiejun Zhao,et al. Neural Document Summarization by Jointly Learning to Score and Select Sentences , 2018, ACL.
[10] Esa Rahtu,et al. Multi-modal Dense Video Captioning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[11] Bohyung Han,et al. Text-Guided Attention Model for Image Captioning , 2016, AAAI.
[12] Yutaka Satoh,et al. Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[13] Chenliang Xu,et al. Towards Automatic Learning of Procedures From Web Instructional Videos , 2017, AAAI.
[14] Jindrich Libovický,et al. Attention Strategies for Multi-Source Sequence-to-Sequence Learning , 2017, ACL.
[15] Haoran Li,et al. Multi-modal Sentence Summarization with Modality Attention and Image Filtering , 2018, IJCAI.
[16] Haoran Li,et al. Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video , 2017, EMNLP.
[17] Tanmoy Chakraborty,et al. Neural Abstractive Summarization with Structural Attention , 2020, IJCAI.
[18] Hai Zhuge,et al. Abstractive Text-Image Summarization Using Multi-Modal Attentional Hierarchical RNN , 2018, EMNLP.
[19] Lin Zhao,et al. Structure-Infused Copy Mechanisms for Abstractive Summarization , 2018, COLING.
[20] Phil Blunsom,et al. Language as a Latent Variable: Discrete Generative Models for Sentence Compression , 2016, EMNLP.
[21] Jianfei Cai,et al. Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction , 2018, Neurocomputing.
[22] Bowen Zhou,et al. SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents , 2016, AAAI.
[23] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.
[24] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.
[25] Sen Wang,et al. Multimodal sentiment analysis with word-level fusion and reinforcement learning , 2017, ICMI.
[26] Phil Blunsom,et al. Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics , 2009 .
[27] Lucy Vanderwende,et al. Exploring Content Models for Multi-Document Summarization , 2009, NAACL.
[28] Yongbin Liu,et al. Monotonic alignments for summarization , 2020, Knowl. Based Syst..
[29] Florian Metze,et al. How2: A Large-scale Dataset for Multimodal Language Understanding , 2018, NIPS 2018.
[30] Dragomir R. Radev,et al. LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..
[31] Marcus Rohrbach,et al. A Dataset for Telling the Stories of Social Media Videos , 2018, EMNLP.
[32] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[33] Yi Yu,et al. Leveraging multimodal information for event summarization and concept-level sentiment analysis , 2016, Knowl. Based Syst..
[34] Erich Elsen,et al. Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.
[35] Jiajun Zhang,et al. Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video , 2019, IEEE Transactions on Knowledge and Data Engineering.
[36] Carlos Busso,et al. IEMOCAP: interactive emotional dyadic motion capture database , 2008, Lang. Resour. Evaluation.
[37] Dragomir R. Radev,et al. Multi-News: A Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model , 2019, ACL.
[38] Alec Radford,et al. Improving Language Understanding by Generative Pre-Training , 2018 .
[39] Tanmoy Chakraborty,et al. Neural Abstractive Summarization with Structural Attention , 2020, IJCAI.
[40] Rada Mihalcea,et al. TextRank: Bringing Order into Text , 2004, EMNLP.
[41] Meng Wang,et al. Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification , 2012, IEEE Transactions on Multimedia.
[42] Tanmoy Chakraborty,et al. Detecting Harmful Memes and Their Targets , 2021, FINDINGS.
[43] Dejun Mu,et al. Word-sentence co-ranking for automatic extractive text summarization , 2017, Expert Syst. Appl..
[44] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[45] Florian Metze,et al. Multimodal Abstractive Summarization for How2 Videos , 2019, ACL.
[46] Yvette Graham,et al. Re-evaluating Automatic Summarization with BLEU and 192 Shades of ROUGE , 2015, EMNLP.
[47] Alexander M. Rush,et al. Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.
[48] Verónica Pérez-Rosas,et al. Towards Multimodal Sarcasm Detection (An _Obviously_ Perfect Paper) , 2019, ACL.
[49] Mor Naaman,et al. Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies , 2018, NAACL.
[50] Douwe Kiela,et al. Deep embodiment: grounding semantics in perceptual modalities , 2017 .
[51] Jing Liu,et al. Multimedia News Summarization in Search , 2016, ACM Trans. Intell. Syst. Technol..
[52] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.
[53] Yu Zhou,et al. MSMO: Multimodal Summarization with Multimodal Output , 2018, EMNLP.
[54] Marco Baroni,et al. Grounding Distributional Semantics in the Visual World , 2016, Lang. Linguistics Compass.
[55] David R. Traum,et al. "yeah Right": Sarcasm Recognition for Spoken Dialogue Systems , 2006, INTERSPEECH.
[56] Alexander M. Rush,et al. Bottom-Up Abstractive Summarization , 2018, EMNLP.
[57] Bowen Zhou,et al. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.
[58] Ruslan Salakhutdinov,et al. Multimodal Transformer for Unaligned Multimodal Language Sequences , 2019, ACL.
[59] Bing Liu,et al. Remote sensing image captioning via Variational Autoencoder and Reinforcement Learning , 2020, Knowl. Based Syst..
[60] Juan Carlos Niebles,et al. Title Generation for User Generated Videos , 2016, ECCV.