Phrase-Based Presentation Slides Generation for Academic Papers

Automatic generation of presentation slides for academic papers is a very challenging task. Previous methods for addressing this task are mainly based on document summarization techniques and they extract document sentences to form presentation slides, which are not well-structured and concise. In this study, we propose a phrase-based approach to generate well-structured and concise presentation slides for academic papers. Our approach first extracts phrases from the given paper, and then learns both the saliency of each phrase and the hierarchical relationship between a pair of phrases. Finally a greedy algorithm is used to select and align the salient phrases in order to form the well-structured presentation slides. Evaluation results on a real dataset verify the efficacy of our proposed approach.

[1]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[2]  Xiaojun Wan,et al.  PPSGen: Learning to Generate Presentation Slides for Academic Papers , 2013, IJCAI.

[3]  Roxana Girju,et al.  Investigating Automatic Alignment Methods for Slide Generation from Academic Papers , 2009, CoNLL.

[4]  Jihoon Yang,et al.  Extracting sentence segments for text summarization: a machine learning approach , 2000, SIGIR '00.

[5]  Xiaojun Wan,et al.  Phrase-based Compressive Cross-Language Summarization , 2015, EMNLP.

[6]  Jun-ichi Fukumoto,et al.  Automated Summarization Evaluation with Basic Elements. , 2006, LREC.

[7]  Karel Jezek,et al.  Text Summarization and Singular Value Decomposition , 2004, ADVIS.

[8]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[9]  P. Sreenivasa Kumar,et al.  SlidesGen: Automatic Generation of Presentation Slides for a Technical Paper Using Summarization , 2009, FLAIRS Conference.

[10]  H. Nanba,et al.  Alignment between a technical paper and presentation sheets using a hidden Markov model , 2005, Proceedings of the 2005 International Conference on Active Media Technology, 2005. (AMT 2005)..

[11]  Katsumi Nitta,et al.  A Support System for Making Presentation Slides , 2003 .

[12]  Piji Li,et al.  Abstractive Multi-Document Summarization via Phrase Selection and Merging , 2015, ACL.

[13]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[14]  Kôiti Hasida,et al.  Automatic Slide Presentation from Semantically Annotated Documents , 1999, COREF@ACL.

[15]  Ani Nenkova,et al.  The Impact of Frequency on Summarization , 2005 .

[16]  Mitsuru Ishizuka,et al.  Making Topic-Specific Report and Multimodal Presentation Automatically by Mining the Web Resources , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[17]  Min-Yen Kan SlideSeer: a digital library of aligned document and presentation pairs , 2007, JCDL '07.

[18]  Mitsuru Ishizuka,et al.  'Auto-Presentation': a multi-agent system for building automatic multi-modal presentation of a topic from World Wide Web information , 2005, IEEE/WIC/ACM International Conference on Intelligent Agent Technology.

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Sadao Kurohashi,et al.  Automatic Slide Generation Based on Discourse Structure Analysis , 2006 .

[21]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[22]  Zhi-Hua Zhou,et al.  Learning to Generate Posters of Scientific Papers , 2016, AAAI.