PEAK: Pyramid Evaluation via Automated Knowledge Extraction

Evaluating the selection of content in a summary is important both for human-written summaries, which can be a useful pedagogical tool for reading and writing skills, and machine-generated summaries, which are increasingly being deployed in information management. The pyramid method assesses a summary by aggregating content units from the summaries of a wise crowd (a form of crowdsourcing). It has proven highly reliable but has largely depended on manual annotation. We propose PEAK, the first method to automatically assess summary content using the pyramid method that also generates the pyramid content models. PEAK relies on open information extraction and graph algorithms. The resulting scores correlate well with manually derived pyramid scores on both human and machine summaries, opening up the possibility of wide-spread use in numerous applications.

[1]  Ani Nenkova,et al.  The Pyramid Method: Incorporating human content selection variation in summarization evaluation , 2007, TSLP.

[2]  Weiwei Guo,et al.  Automated Pyramid Scoring of Summaries using Distributional Semantics , 2013, ACL.

[3]  Roberto Navigli,et al.  Align, Disambiguate and Walk: A Unified Approach for Measuring Semantic Similarity , 2013, ACL.

[4]  Nelson Morgan,et al.  The Elements of Automatic Summarization , 2011 .

[5]  Ani Nenkova,et al.  Automation of Summary Evaluation by the Pyramid Method , 2005 .

[6]  Kathleen R. McKeown,et al.  Applying the Pyramid Method in DUC 2005 , 2005 .

[7]  Ani Nenkova,et al.  Evaluating Content Selection in Summarization: The Pyramid Method , 2004, NAACL.

[8]  Dipanjan Das Andr,et al.  A Survey on Automatic Text Summarization , 2007 .

[9]  Mark T. Maybury,et al.  Automatic Summarization , 2002, Computational Linguistics.

[10]  Luciano Del Corro,et al.  ClausIE: clause-based open information extraction , 2013, WWW.

[11]  J. Steinberger,et al.  Using Latent Semantic Analysis in Text Summarization and Summary Evaluation , 2004 .

[12]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[13]  Linda H. Mason,et al.  A Contextualized Curricular Supplement for Developmental Reading and Writing , 2013 .

[14]  Gerhard Weikum,et al.  Knowlywood: Mining Activity Knowledge From Hollywood Narratives , 2015, CIKM.

[15]  Piji Li,et al.  Abstractive Multi-Document Summarization via Phrase Selection and Merging , 2015, ACL.

[16]  S. Graham,et al.  A meta-analysis of writing instruction for adolescent students. , 2007 .

[17]  Kathleen R. McKeown,et al.  Applying the Pyramid Method in the 2006 Document Understanding Conference , 2006 .

[18]  Ani Nenkova,et al.  Automatically Evaluating Content Selection in Summarization without Human Models , 2009, EMNLP.

[19]  Rebecca J. Passonneau,et al.  Formal and functional assessment of the pyramid method for summary content evaluation* , 2009, Natural Language Engineering.

[20]  Ann L. Brown,et al.  Macrorules for summarizing texts: the development of expertise , 1983 .