D2S: Document-to-Slide Generation Via Query-Based Text Summarization

Presentations are critical for communication in all areas of our lives, yet the creation of slide decks is often tedious and time-consuming. There has been limited research aiming to automate the document-to-slides generation process and all face a critical challenge: no publicly available dataset for training and benchmarking. In this work, we first contribute a new dataset, SciDuet, consisting of pairs of papers and their corresponding slides decks from recent years’ NLP and ML conferences (e.g., ACL). Secondly, we present D2S, a novel system that tackles the document-to-slides task with a two-step approach: 1) Use slide titles to retrieve relevant and engaging text, figures, and tables; 2) Summarize the retrieved context into bullet points with long-form question answering. Our evaluation suggests that long-form QA outperforms state-of-the-art summarization baselines on both automated ROUGE metrics and qualitative human evaluation.

[1]  Jeff Johnson,et al.  Billion-Scale Similarity Search with GPUs , 2017, IEEE Transactions on Big Data.

[2]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[3]  Ming-Wei Chang,et al.  Natural Questions: A Benchmark for Question Answering Research , 2019, TACL.

[4]  Mor Naaman,et al.  Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies , 2018, NAACL.

[5]  Richard Socher,et al.  Neural Text Summarization: A Critical Evaluation , 2019, EMNLP.

[6]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[7]  David Konopnicki,et al.  Unsupervised Query-Focused Multi-Document Summarization using the Cross Entropy Method , 2017, SIGIR.

[8]  Robert A. Bartsch,et al.  Effectiveness of PowerPoint presentations in lectures , 2003, Comput. Educ..

[9]  Mirella Lapata,et al.  Coarse-to-Fine Query Focused Multi-Document Summarization , 2020, EMNLP.

[10]  Michael Elhadad,et al.  Topic Concentration in Query Focused Summarization Datasets , 2016, AAAI.

[11]  Ming-Wei Chang,et al.  REALM: Retrieval-Augmented Language Model Pre-Training , 2020, ICML.

[12]  Mirella Lapata,et al.  Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization , 2018, EMNLP.

[13]  Xiaojun Wan,et al.  Phrase-Based Presentation Slides Generation for Academic Papers , 2017, AAAI.

[14]  Jian Wu,et al.  Automatic Slide Generation for Scientific Papers , 2019, SciKnow@K-CAP.

[15]  Alexander M. Rush,et al.  Bottom-Up Abstractive Summarization , 2018, EMNLP.

[16]  Yuanyuan Wang,et al.  A method for generating presentation slides based on expression styles using document structure , 2013, Int. J. Knowl. Web Intell..

[17]  Xiaojun Wan,et al.  PPSGen: Learning-Based Presentation Slides Generation for Academic Papers , 2015, IEEE Transactions on Knowledge and Data Engineering.

[18]  Sadao Kurohashi,et al.  Automatic Slide Generation Based on Discourse Structure Analysis , 2005, IJCNLP.

[19]  K. Muneeswaran,et al.  Automatic creation of well-organized slides from documents , 2016, 2016 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET).

[20]  Ming-Wei Chang,et al.  Well-Read Students Learn Better: On the Importance of Pre-training Compact Models , 2019 .

[21]  Jason Weston,et al.  ELI5: Long Form Question Answering , 2019, ACL.

[22]  Chetan J. Awati,et al.  Automatic era: Presentation slides from Academic paper , 2016, 2016 International Conference on Automatic Control and Dynamic Optimization Techniques (ICACDOT).

[23]  Danqi Chen,et al.  Dense Passage Retrieval for Open-Domain Question Answering , 2020, EMNLP.

[24]  Kory Wallace Mathewson,et al.  Automatically Generating Engaging Presentation Slide Decks , 2019, EvoMUSART.

[25]  Charles Jochim,et al.  Identification of Tasks, Datasets, Evaluation Metrics, and Numeric Scores for Scientific Leaderboards Construction , 2019, ACL.

[26]  Daniel S. Weld,et al.  TLDR: Extreme Summarization of Scientific Documents , 2020, FINDINGS.

[27]  David Konopnicki,et al.  A Summarization System for Scientific Documents , 2019, EMNLP.

[28]  Christopher Andreas Clark,et al.  PDFFigures 2.0: Mining figures from research papers , 2016, 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL).

[29]  S. Syamili,et al.  Presentation slides generation from scientific papers using support vector regression , 2017, 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT).

[30]  Andy Field,et al.  Discovering statistics using SPSS: and sex and drugs and rock 'n' roll, 3rd Edition , 2009 .

[31]  Mitsuru Ishizuka,et al.  'Auto-Presentation': a multi-agent system for building automatic multi-modal presentation of a topic from World Wide Web information , 2005, IEEE/WIC/ACM International Conference on Intelligent Agent Technology.

[32]  Bowen Zhou,et al.  Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[33]  Ming-Wei Chang,et al.  Latent Retrieval for Weakly Supervised Open Domain Question Answering , 2019, ACL.

[34]  Mirella Lapata,et al.  Text Summarization with Pretrained Encoders , 2019, EMNLP.

[35]  Sameer Singh,et al.  Beyond Accuracy: Behavioral Testing of NLP Models with CheckList , 2020, ACL.

[36]  Dakuo Wang How People Write Together Now: Exploring and Supporting Today's Computer-Supported Collaborative Writing , 2016, CSCW '16 Companion.

[37]  T. V. Geetha,et al.  Document Summarization and Information Extraction for Generation of Presentation Slides , 2009, 2009 International Conference on Advances in Recent Technologies in Communication and Computing.

[38]  Soya Park,et al.  How AI Developers Overcome Communication Challenges in a Multidisciplinary Team , 2021, Proc. ACM Hum. Comput. Interact..

[39]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[40]  Omer Levy,et al.  BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.

[41]  Min-Yen Kan SlideSeer: a digital library of aligned document and presentation pairs , 2007, JCDL '07.

[42]  Shuai Ma,et al.  CASS: Towards Building a Social-Support Chatbot for Online Health Community , 2021, Proc. ACM Hum. Comput. Interact..