Automatic Summarization for Student Reflective Responses

Educational research has demonstrated that asking students to respond to reflection prompts can improve both teaching and learning. However, summarizing student responses to these prompts is an onerous task for humans and poses challenges for existing summarization methods. From the input perspective, there are three challenges. First, there is a lexical variety problem due to the fact that different students tend to use different expressions. Second, there is a length variety problem that student inputs range from single words to multiple sentences. Third, there is a redundancy issue since some content among student responses are not useful. From the output perspective, there are two additional challenges. First, the human summaries consist of a list of important phrases instead of sentences. Second, from an instructor's perspective, the number of students who have a particular problem or are interested in a particular topic is valuable. The goal of this research is to enhance student response summarization at multiple levels of granularity. At the sentence level, we propose a novel summarization algorithm by extending traditional ILP-based framework with a low-rank matrix approximation to address the challenge of lexical variety. At the phrase level, we propose a phrase summarization framework by a combination of phrase extraction, phrase clustering, and phrase ranking. Experimental results show the effectiveness on multiple student response data sets. Also at the phrase level, we propose a quantitative phrase summarization algorithm in order to estimate the number of students who semantically mention the phrases in a summary. We first introduce a new phrase-based highlighting scheme for automatic summarization. It highlights the phrases in the human summaries and also the corresponding semantically-equivalent phrases in student responses. Enabled by the highlighting scheme, we improve the previous phrase-based summarization framework by developing a supervised candidate phrase extraction, learning to estimate the phrase similarities, and experimenting with different clustering algorithms to group phrases into clusters. Experimental results show that our proposed methods not only yield better summarization performance evaluated using ROUGE, but also produce summaries that capture the pressing student needs.

[1]  Yang Liu,et al.  Using Supervised Bigram-based ILP for Extractive Summarization , 2013, ACL.

[2]  Yang Liu,et al.  Using Relevant Public Posts to Enhance News Article Summarization , 2016, COLING.

[3]  Brian D. Steele The One-Minute Paper , 1995 .

[4]  Fei Liu,et al.  An Improved Phrase-based Approach to Annotating and Summarizing Student Course Responses , 2016, COLING.

[5]  Wenting Xiong Helpfulness-Guided Review Summarization , 2013, HLT-NAACL.

[6]  Fei Liu,et al.  Automatic Summarization of Student Course Feedback , 2016, HLT-NAACL.

[7]  Stephen G. Pulman,et al.  What is my essay really saying? Using extractive summarization to motivate reflection and redrafting , 2013, AIED Workshops.

[8]  Roi Blanco,et al.  Online News Tracking for Ad-Hoc Information Needs , 2015, ICTIR.

[9]  Evangelos E. Milios,et al.  World Wide Web site summarization , 2004, Web Intell. Agent Syst..

[10]  Ani Nenkova,et al.  Automatic Summarization , 2011, ACL.

[11]  Chun Chen,et al.  Document Summarization Based on Data Reconstruction , 2012, AAAI.

[12]  André F. T. Martins,et al.  Fast and Robust Compressive Summarization with Dual Decomposition and Multi-Task Learning , 2013, ACL.

[13]  Vincent Aleven,et al.  An effective metacognitive strategy: learning by doing and explaining with a computer-based Cognitive Tutor , 2002, Cogn. Sci..

[14]  Hans van Halteren,et al.  Evaluating Information Content by Factoid Analysis: Human annotation and stability , 2004, EMNLP.

[15]  Ronan Collobert Deep Learning for Ecient Discriminative Parsing , 2011 .

[16]  Michalis Vazirgiannis,et al.  GrammAds: Keyword and Ad Creative Generator for Online Advertising Campaigns , 2013 .

[17]  Mirella Lapata,et al.  Multiple Aspect Summarization Using Integer Linear Programming , 2012, EMNLP.

[18]  Ani Nenkova,et al.  Evaluating Content Selection in Summarization: The Pyramid Method , 2004, NAACL.

[19]  Jugal K. Kalita,et al.  Summarization of Historical Articles Using Temporal Event Clustering , 2012, NAACL.

[20]  Ani Nenkova,et al.  Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion , 2007, Information Processing & Management.

[21]  Alexander M. Rush,et al.  Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.

[22]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[23]  Giuseppe Carenini,et al.  Abstractive Meeting Summarization with Entailment and Fusion , 2013, ENLG.

[24]  Feifan Liu,et al.  Exploring Correlation Between ROUGE and Human Evaluation on Meeting Summaries , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[26]  Jason Weston,et al.  A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[27]  Matt Jones,et al.  Using keyphrases as search result surrogates on small screen devices , 2004, Personal and Ubiquitous Computing.

[28]  Vasile Rus,et al.  SEMILAR: The Semantic Similarity Toolkit , 2013, ACL.

[29]  Yoshihiro Ueda,et al.  Toward the "At-a-glance" Summary: Phrase-representation Summarization Method , 2000, COLING.

[30]  Koji Yatani,et al.  Review spotlight: a user interface for summarizing user-generated reviews using adjective-noun word pairs , 2011, CHI.

[31]  Bing Liu,et al.  Opinion spam and analysis , 2008, WSDM '08.

[32]  Tao Li,et al.  A Participant-based Approach for Event Summarization Using Twitter Streams , 2013, NAACL.

[33]  D. Boud,et al.  Reflection, turning experience into learning , 1985 .

[34]  Mirella Lapata,et al.  Movie Script Summarization as Graph-based Scene Extraction , 2015, NAACL.

[35]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[36]  Using Word Clouds for Fast, Formative Assessment of Students’ Short Written Responses , 2014 .

[37]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[38]  Marc Moens,et al.  Articles Summarizing Scientific Articles: Experiments with Relevance and Rhetorical Status , 2002, CL.

[39]  Sujian Li,et al.  Query-focused Multi-Document Summarization: Combining a Topic Model with Graph-based Semi-supervised Learning , 2012, COLING.

[40]  Manabu Okumura,et al.  Towards Multi-paper Summarization Using Reference Information , 1999, IJCAI.

[41]  Jingtao Wang,et al.  CourseMIRROR: Enhancing Large Classroom Instructor-Student Interactions via Mobile Interfaces and Natural Language Processing , 2015, CHI Extended Abstracts.

[42]  Ian H. Witten,et al.  Human-competitive tagging using automatic keyphrase extraction , 2009, EMNLP.

[43]  Andreas Paepcke,et al.  Seeing the whole in parts: text summarization for web browsing on handheld devices , 2001, WWW '01.

[44]  Inderjeet Mani,et al.  SUMMAC: a text summarization evaluation , 2002, Natural Language Engineering.

[45]  Noah A. Smith,et al.  Toward Abstractive Summarization Using Semantic Representations , 2018, NAACL.

[46]  Noah A. Smith,et al.  Summarization with a Joint Model for Sentence Extraction and Compression , 2009, ILP 2009.

[47]  William M. Campbell,et al.  Content+Context=Classification: Examining the Roles of Social Interactions and Linguist Content in Twitter User Classification , 2014, SocialNLP@COLING.

[48]  Elena L. Glassman,et al.  Mudslide: A Spatially Anchored Census of Student Confusion for Online Lecture Videos , 2015, CHI.

[49]  Bowen Zhou,et al.  Sequence-to-Sequence RNNs for Text Summarization , 2016, ArXiv.

[50]  Naoaki Okazaki,et al.  Neural Headline Generation on Abstract Meaning Representation , 2016, EMNLP.

[51]  Yiannis Kompatsiaris,et al.  Visual Event Summarization on Social Media using Topic Modelling and Graph-based Ranking Algorithms , 2015, ICMR.

[52]  Dan Klein,et al.  Learning-Based Single-Document Summarization with Compression and Anaphoricity Constraints , 2016, ACL.

[53]  Vasile Rus,et al.  Latent Semantic Analysis Models on Wikipedia and TASA , 2014, LREC.

[54]  Fei Liu,et al.  Document Summarization via Guided Sentence Compression , 2013, EMNLP.

[55]  Yejin Choi,et al.  Globally Coherent Text Generation with Neural Checklist Models , 2016, EMNLP.

[56]  François Yvon,et al.  Practical Very Large Scale CRFs , 2010, ACL.

[57]  Benoit Favre,et al.  A Scalable Global Model for Summarization , 2009, ILP 2009.

[58]  David Evans,et al.  Tracking and summarizing news on a daily basis with Columbia's Newsblaster , 2002 .

[59]  Yvette Graham,et al.  Re-evaluating Automatic Summarization with BLEU and 192 Shades of ROUGE , 2015, EMNLP.

[60]  Robert Tibshirani,et al.  Spectral Regularization Algorithms for Learning Large Incomplete Matrices , 2010, J. Mach. Learn. Res..

[61]  Jade Goldstein-Stewart,et al.  The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries , 1998, SIGIR Forum.

[62]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[63]  Hugh E. Williams,et al.  Fast generation of result snippets in web search , 2007, SIGIR.

[64]  Ming Zhou,et al.  A Redundancy-Aware Sentence Regression Framework for Extractive Summarization , 2016, COLING.

[65]  Ion Androutsopoulos,et al.  Extractive Multi-Document Summarization with Integer Linear Programming and Support Vector Regression , 2012, COLING.

[66]  Dragomir R. Radev,et al.  NewsInEssence: summarizing online news topics , 2005, Commun. ACM.

[67]  Diane J. Litman,et al.  Empirical analysis of exploiting review helpfulness for extractive summarization of online reviews , 2014, COLING.

[68]  Shibamouli Lahiri,et al.  Building a Dataset for Summarization and Keyword Extraction from Emails , 2014, LREC.

[69]  Xiaojun Wan,et al.  Multi-document summarization using cluster-based link analysis , 2008, SIGIR '08.

[70]  Omer Levy,et al.  word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method , 2014, ArXiv.

[71]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[72]  Dilek Z. Hakkani-Tür,et al.  The ICSI/UTD Summarization System at TAC 2009 , 2009, TAC.

[73]  Dragomir R. Radev,et al.  Learning From Collective Human Behavior to Introduce Diversity in Lexical Choice , 2011, ACL.

[74]  Hai Zhuge,et al.  Abstractive News Summarization based on Event Semantic Link Network , 2016, COLING.

[75]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[76]  Michalis Vazirgiannis,et al.  Clustering and Community Detection in Directed Networks: A Survey , 2013, ArXiv.

[77]  Hui Lin,et al.  Multi-document Summarization via Budgeted Maximization of Submodular Functions , 2010, NAACL.

[78]  David Jurgens,et al.  Word Sense Induction by Community Detection , 2011, Graph-based Methods for Natural Language Processing.

[79]  Hui Lin,et al.  A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization , 2014, LREC.

[80]  Simone Teufel,et al.  Examining the consensus between human summaries: initial experiments with factoid analysis , 2003, HLT-NAACL 2003.

[81]  Xiaojin Zhu,et al.  Improving Diversity in Ranking using Absorbing Random Walks , 2007, NAACL.

[82]  Min-Yen Kan Keywords, phrases, clauses and sentences: topicality, indicativeness and informativeness at scales , 2015, ACL 2015.

[83]  Dan Klein,et al.  Jointly Learning to Extract and Compress , 2011, ACL.

[84]  Xun Wang,et al.  Exploring Text Links for Coherent Multi-Document Summarization , 2016, COLING.

[85]  Vincent Ng,et al.  Automatic Keyphrase Extraction: A Survey of the State of the Art , 2014, ACL.

[86]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[87]  Lu Wang,et al.  Neural Network-Based Abstract Generation for Opinions and Arguments , 2016, NAACL.