Visual Summary Identification From Scientific Publications via Self-Supervised Learning

The exponential growth of scientific literature yields the need to support users to both effectively and efficiently analyze and understand the some body of research work. This exploratory process can be facilitated by providing graphical abstracts–a visual summary of a scientific publication. Accordingly, previous work recently presented an initial study on automatic identification of a central figure in a scientific publication, to be used as the publication’s visual summary. This study, however, have been limited only to a single (biomedical) domain. This is primarily because the current state-of-the-art relies on supervised machine learning, typically relying on the existence of large amounts of labeled data: the only existing annotated data set until now covered only the biomedical publications. In this work, we build a novel benchmark data set for visual summary identification from scientific publications, which consists of papers presented at conferences from several areas of computer science. We couple this contribution with a new self-supervised learning approach to learn a heuristic matching of in-text references to figures with figure captions. Our self-supervised pre-training, executed on a large unlabeled collection of publications, attenuates the need for large annotated data sets for visual summary identification and facilitates domain transfer for this task. We evaluate our self-supervised pretraining for visual summary identification on both the existing biomedical and our newly presented computer science data set. The experimental results suggest that the proposed method is able to outperform the previous state-of-the-art without any task-specific annotations.

[1]  ChengXiang Zhai,et al.  Figure Retrieval from Collections of Research Articles , 2019, ECIR.

[2]  Jacques Wainer,et al.  Relationship between high-quality journals and conferences in computer vision , 2012, Scientometrics.

[3]  Hong Yu,et al.  Automatic Figure Ranking and User Interfacing for Intelligent Figure Search , 2010, PloS one.

[4]  E. Lerma,et al.  A Picture Is Worth a Thousand Views: A Triple Crossover Trial of Visual Abstracts to Examine Their Impact on Research Dissemination , 2020, Journal of medical Internet research.

[5]  Jevin D. West,et al.  Identifying the Central Figure of a Scientific Paper , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[6]  Nir Ailon,et al.  Deep Metric Learning Using Triplet Network , 2014, SIMBAD.

[7]  Iryna Gurevych,et al.  Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks , 2019, EMNLP.

[8]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[9]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[10]  Nazli Goharian,et al.  Scientific Article Summarization Using Citation-Context and Article’s Discourse Structure , 2015, EMNLP.

[11]  Yu Zhou,et al.  MSMO: Multimodal Summarization with Multimodal Output , 2018, EMNLP.

[12]  Tiejun Zhao,et al.  Attention-Fused Deep Matching Network for Natural Language Inference , 2018, IJCAI.

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  V. S. Reed,et al.  Pictorial superiority effect. , 1976, Journal of experimental psychology. Human learning and memory.

[15]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[16]  Pengfei Liu,et al.  Extractive Summarization as Text Matching , 2020, ACL.

[17]  Zhi-Hua Zhou,et al.  Learning to Generate Posters of Scientific Papers , 2016, AAAI.

[18]  Jiacheng Xu,et al.  Neural Extractive Text Summarization with Syntactic Compression , 2019, EMNLP.

[19]  Lutz Bornmann,et al.  Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references , 2014, J. Assoc. Inf. Sci. Technol..

[20]  Jungo Kasai,et al.  ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks , 2019, AAAI.

[21]  Zhiguo Wang,et al.  Bilateral Multi-Perspective Matching for Natural Language Sentences , 2017, IJCAI.

[22]  Lu Wang,et al.  Argument Mining for Understanding Peer Reviews , 2019, NAACL.

[23]  Hong Yu,et al.  Learning to Rank Figures within a Biomedical Article , 2014, PloS one.

[24]  Bowen Zhou,et al.  SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents , 2016, AAAI.

[25]  Jevin D. West,et al.  Viziometrics: Analyzing Visual Information in the Scientific Literature , 2016, IEEE Transactions on Big Data.

[26]  Anna Rumshisky,et al.  Revealing the Dark Secrets of BERT , 2019, EMNLP.

[27]  Franck Dernoncourt,et al.  A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents , 2018, NAACL.

[28]  Yu Zhou,et al.  Multimodal Summarization with Guidance of Multimodal Reference , 2020, AAAI.

[29]  Yufang Hou,et al.  D2S: Document-to-Slide Generation Via Query-Based Text Summarization , 2021, NAACL.

[30]  Waleed Ammar,et al.  Extracting Scientific Figures with Distantly Supervised Neural Networks , 2018, JCDL.

[31]  EunKyung Chung,et al.  An investigation on Graphical Abstracts use in scholarly articles , 2017, Int. J. Inf. Manag..

[32]  Richard Socher,et al.  Neural Text Summarization: A Critical Evaluation , 2019, EMNLP.

[33]  Barbara Plank,et al.  Neural Unsupervised Domain Adaptation in NLP—A Survey , 2020, COLING.

[34]  Dragomir R. Radev,et al.  Scientific Paper Summarization Using Citation Summary Networks , 2008, COLING.

[35]  Simone Paolo Ponzetto,et al.  Self-Supervised Learning for Visual Summary Identification in Scientific Publications , 2020, ArXiv.

[36]  ChengXiang Zhai,et al.  A Study of Distributed Representations for Figures of Research Articles , 2021, ECIR.

[37]  Goran Glavas,et al.  University of Mannheim @ CLSciSumm-17: Citation-Based Summarization of Scientific Articles Using Semantic Textual Similarity , 2017, BIRNDL@SIGIR.

[38]  Mirella Lapata,et al.  Neural Summarization by Extracting Sentences and Words , 2016, ACL.

[39]  Christopher Potts,et al.  A large annotated corpus for learning natural language inference , 2015, EMNLP.

[40]  Goran Glavas,et al.  Investigating the Role of Argumentation in the Rhetorical Analysis of Scientific Publications with Neural Multi-Task Learning Models , 2018, EMNLP.

[41]  ChengXiang Zhai,et al.  Generating Impact-Based Summaries for Scientific Literature , 2008, ACL.

[42]  Thomas Wolf,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[43]  Benjamin Bach,et al.  Picturing Science: Design Patterns in Graphical Abstracts , 2018, Diagrams.

[44]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[45]  Iz Beltagy,et al.  SciBERT: A Pretrained Language Model for Scientific Text , 2019, EMNLP.

[46]  Jinan Xu,et al.  Original Semantics-Oriented Attention and Deep Fusion Network for Sentence Matching , 2019, EMNLP/IJCNLP.