Improving single document summarization in a multi-document environment

Most automatic document summarization tools produce summaries from single or multiple document environments. Recent works have shown that there are possibilities to combine both systems: when summarising a single document, its related documents can be fou

[1]  Mark Sanderson,et al.  Do user preferences and evaluation measures line up? , 2010, SIGIR.

[2]  Vagelis Hristidis,et al.  Structure-based query-specific document summarization , 2005, CIKM '05.

[3]  Edward Gibson,et al.  Paragraph-, Word-, and Coherence-based Approaches to Sentence Ranking: A Comparison of Algorithm and Human Performance , 2004, ACL.

[4]  Vimla L. Patel,et al.  Usability evaluation of an experimental text summarization system and three search engines: implications for the reengineering of health care interfaces , 2002, AMIA.

[5]  Ingemar J. Cox,et al.  On Aggregating Labels from Multiple Crowd Workers to Infer Relevance of Documents , 2012, ECIR.

[6]  Bin Pang,et al.  Analysis of Automated Evaluation for Multi-document Summarization Using Content-Based Similarity , 2008, Second International Conference on the Digital Society.

[7]  Leila Kosseim,et al.  Summarizing Blog Entries versus News Texts , 2009 .

[8]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[9]  Matthew Lease,et al.  Crowdsourcing Document Relevance Assessment with Mechanical Turk , 2010, Mturk@HLT-NAACL.

[10]  Paulo Cesar Fernandes de Oliveira,et al.  How to evaluate the 'goodness' of summaries automatically , 2005 .

[11]  Elena Lloret,et al.  Analyzing the capabilities of crowdsourcing services for text summarization , 2013, Lang. Resour. Evaluation.

[12]  Karel Jezek,et al.  Evaluation Measures for Text Summarization , 2012, Comput. Informatics.

[13]  Xiaohua Hu,et al.  Integrating biomedical literature clustering and summarization approaches using biomedical ontology , 2006, TMBIO '06.

[14]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[15]  Karen Spärck Jones What Might be in a Summary? , 1993, Information Retrieval.

[16]  Armelle Brun,et al.  Comparisons Instead of Ratings: Towards More Stable Preferences , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[17]  Michele Banko,et al.  Headline Generation Based on Statistical Translation , 2000, ACL.

[18]  Sanda M. Harabagiu,et al.  Satisfying information needs with multi-document summaries , 2007, Inf. Process. Manag..

[19]  Maksims Volkovs,et al.  New learning methods for supervised and unsupervised preference aggregation , 2014, J. Mach. Learn. Res..

[20]  A. Viera,et al.  Understanding interobserver agreement: the kappa statistic. , 2005, Family medicine.

[21]  Inderjeet Mani,et al.  Machine Learning of Generic and User-Focused Summarization , 1998, AAAI/IAAI.

[22]  Inderjeet Mani,et al.  Summarizing Similarities and Differences Among Related Documents , 1997, Information Retrieval.

[23]  Dragomir R. Radev,et al.  Generating summaries of multiple news articles , 1995, SIGIR '95.

[24]  Steven K. Feiner,et al.  PERSIVAL, a system for personalized search and summarization over multimedia healthcare information , 2001, JCDL '01.

[25]  Hua Li,et al.  Improving web search results using affinity graph , 2005, SIGIR '05.

[26]  ChengXiang Zhai,et al.  When documents are very long, BM25 fails! , 2011, SIGIR.

[27]  Bart Thomee,et al.  Automatic selection of social media responses to news , 2013, KDD.

[28]  Chuleerat Jaruskulchai,et al.  Generic text summarization using local and global properties of sentences , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[29]  Task-Based Evaluation of Summary Quality: Describing Relationships between Scientific Papers , 2001 .

[30]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[31]  Rasim M. Alguliyev,et al.  An unsupervised approach to generating generic summaries of documents , 2015, Appl. Soft Comput..

[32]  Peng Li,et al.  Joint topic modeling for event summarization across news and social media streams , 2012, CIKM.

[33]  Douglas W. Oard,et al.  Extrinsic Evaluation of Automatic Metrics for Summarization , 2004 .

[34]  Hinrich Schütze,et al.  Automatic generation of short informative sentiment summaries , 2012, EACL.

[35]  John Atkinson,et al.  Rhetorics-based multi-document summarization , 2013, Expert Syst. Appl..

[36]  Naomie Salim,et al.  Multi document summarization based on news components using fuzzy cross-document relations , 2014, Appl. Soft Comput..

[37]  Oren Kurland,et al.  From "Identical" to "Similar": Fusing Retrieved Lists Based on Inter-document Similarities , 2009, ICTIR.

[38]  George A. Vouros,et al.  Summarization system evaluation revisited: N-gram graphs , 2008, TSLP.

[39]  Ani Nenkova,et al.  Evaluating Content Selection in Summarization: The Pyramid Method , 2004, NAACL.

[40]  Inderjeet Mani,et al.  Multi-Document Summarization by Graph Search and Matching , 1997, AAAI/IAAI.

[41]  Yong Zhang,et al.  Improving Document Summarization by Incorporating Social Contextual Information , 2011, AIRS.

[42]  Ee-Peng Lim,et al.  Comments-oriented document summarization: understanding documents with readers' feedback , 2008, SIGIR '08.

[43]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[44]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[45]  Antonio Zamora,et al.  Automatic Abstracting Research at Chemical Abstracts Service , 1975, J. Chem. Inf. Comput. Sci..

[46]  Hongyuan Zha,et al.  Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering , 2002, SIGIR '02.

[47]  Barry Smyth,et al.  From social bookmarking to social summarization: an experiment in community-based summary generation , 2007, IUI '07.

[48]  Hugo Larochelle,et al.  Learning to rank by aggregating expert preferences , 2012, CIKM.

[49]  Stephen E. Robertson,et al.  Experimentation as a way of life: Okapi at TREC , 2000, Inf. Process. Manag..

[50]  Xuan Li,et al.  Update Summarization via Graph-Based Sentence Ranking , 2013, IEEE Transactions on Knowledge and Data Engineering.

[51]  Lucia Specia,et al.  BLEU Deconstructed: Designing a Better MT Evaluation Metric , 2013, Int. J. Comput. Linguistics Appl..

[52]  Longfei Wu,et al.  Social Summarization via Automatically Discovered Social Context , 2011, IJCNLP.

[53]  Wauter Bosma Query-Based Summarization using Rhetorical Structure Theory , 2004, CLIN.

[54]  Omar Alonso,et al.  Crowdsourcing for relevance evaluation , 2008, SIGF.

[55]  Gustave J. Rath,et al.  The formation of abstracts by the selection of sentences , 1961 .

[56]  Xiaojun Wan,et al.  Exploiting neighborhood knowledge for single document summarization and keyphrase extraction , 2010, TOIS.

[57]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[58]  W. Bruce Croft,et al.  Evaluating answer passages using summarization measures , 2014, SIGIR.

[59]  Inderjeet Mani,et al.  The Tipster Summac Text Summarization Evaluation , 1999, EACL.

[60]  Gerard Salton,et al.  Automatic Text Structuring and Summarization , 1997, Inf. Process. Manag..

[61]  Simone Teufel,et al.  Whose Idea Was This, and Why Does it Matter? Attributing Scientific Work to Citations , 2007, HLT-NAACL.

[62]  P. Lachenbruch Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[63]  Ani Nenkova,et al.  The Pyramid Method: Incorporating human content selection variation in summarization evaluation , 2007, TSLP.

[64]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[65]  Mark Sanderson,et al.  Tweet‐biased summarization , 2016, J. Assoc. Inf. Sci. Technol..

[66]  Carol L. Barry User-defined relevance criteria: an exploratory study , 1994 .

[67]  Javier Parapar,et al.  Blog snippets: a comments-biased approach , 2010, SIGIR '10.

[68]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[69]  Kathleen R. McKeown,et al.  Generating natural language summaries from multiple on-line sources , 1998 .

[70]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[71]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[72]  Chin-Yew Lin,et al.  Looking for a Few Good Metrics: ROUGE and its Evaluation , 2004 .

[73]  Mark T. Maybury,et al.  Automatic Summarization , 2002, Computational Linguistics.

[74]  George M. Kasper,et al.  The Effects and Limitations of Automated Text Condensing on Reading Comprehension Performance , 1992, Inf. Syst. Res..

[75]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[76]  Anton Leuski,et al.  iNeATS: Interactive Multi-Document Summarization , 2003, ACL.

[77]  Alistair Moffat,et al.  Statistical power in retrieval experimentation , 2008, CIKM '08.

[78]  Mor Naaman,et al.  Finding and assessing social media information sources in the context of journalism , 2012, CHI.

[79]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[80]  Feifan Liu,et al.  Exploring Correlation Between ROUGE and Human Evaluation on Meeting Summaries , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[81]  K. Baker Condorcet. From natural philosophy to social mathematics , 1974, Medical History.

[82]  T. Tideman,et al.  Independence of clones as a criterion for voting rules , 1987 .

[83]  Ben Hachey Multi-Document Summarisation Using Generic Relation Extraction , 2009, EMNLP.

[84]  Soe-Tsyr Yuan,et al.  Ontology-based structured cosine similarity in document summarization: with applications to mobile audio-based knowledge management , 2005, IEEE Trans. Syst. Man Cybern. Part B.

[85]  Aniket Kittur,et al.  Crowdsourcing user studies with Mechanical Turk , 2008, CHI.

[86]  David R. Thomas,et al.  A General Inductive Approach for Analyzing Qualitative Evaluation Data , 2006 .

[87]  W. Bruce Croft,et al.  Indri: A language-model based search engine for complex queries1 , 2005 .

[88]  M. B. Chandak,et al.  Graph-Based Algorithms for Text Summarization , 2010, 2010 3rd International Conference on Emerging Trends in Engineering and Technology.

[89]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[90]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .

[91]  Ani Nenkova,et al.  Can you summarize this? Identifying correlates of input difficulty for generic multi-document summarization , 2008, ACL 2008.

[92]  Jarkko Kari,et al.  User-defined relevance criteria in web searching , 2006, J. Documentation.

[93]  Girish Keshav Palshikar,et al.  Combining Summaries Using Unsupervised Rank Aggregation , 2012, CICLing.

[94]  Omar Alonso,et al.  Implementing crowdsourcing-based relevance experimentation: an industrial perspective , 2013, Information Retrieval.

[95]  Simone Teufel,et al.  Argumentative zoning information extraction from scientific text , 1999 .

[96]  Craig MacDonald,et al.  On choosing an effective automatic evaluation metric for microblog summarisation , 2014, IIiX.

[97]  T. Martin McGinnity,et al.  A Context-Based Word Indexing Model for Document Summarization , 2013, IEEE Transactions on Knowledge and Data Engineering.

[98]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[99]  Min-Yen Kan,et al.  Using librarian techniques in automatic text summarization for information retrieval , 2002, JCDL '02.

[100]  Ping Chen,et al.  A Query-Based Medical Information Summarization System Using Ontology Knowledge , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).

[101]  Vagelis Hristidis,et al.  A system for query-specific document summarization , 2006, CIKM '06.

[102]  Vibhu O. Mittal,et al.  Ultra-summarization (poster abstract): a statistical approach to generating highly condensed non-extractive summaries , 1999, SIGIR '99.

[103]  Yihong Gong,et al.  Integrating Document Clustering and Multidocument Summarization , 2011, TKDD.

[104]  Katharina Reinecke,et al.  Crowdsourcing performance evaluations of user interfaces , 2013, CHI.

[105]  Sanghee Oh,et al.  Best-answer selection criteria in a social Q&A site from the user-oriented relevance perspective , 2008, ASIST.

[106]  Daniel Marcu The rhetorical parsing of natural language texts , 1997 .

[107]  Diane H. Sonnenwald,et al.  User perspectives on relevance criteria: A comparison among relevant, partially relevant, and not-relevant judgments , 2002, J. Assoc. Inf. Sci. Technol..

[108]  Kathleen F. McCoy,et al.  Efficient text summarization using lexical chains , 2000, IUI '00.

[109]  Panagiotis Stamatopoulos,et al.  Summarization from Medical Documents: A Survey , 2005, Artif. Intell. Medicine.

[110]  Eduard H. Hovy,et al.  Identifying Topics by Position , 1997, ANLP.

[111]  Stefanie Nowak,et al.  How reliable are annotations via crowdsourcing: a study about inter-annotator agreement for multi-label image annotation , 2010, MIR '10.

[112]  Rada Mihalcea,et al.  Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization , 2004, ACL.

[113]  Shamima Mithun,et al.  Exploiting Rhetorical Relations in Blog Summarization , 2010, Canadian Conference on AI.

[114]  Lisa F. Rau,et al.  Automatic Condensation of Electronic Publications by Sentence Selection , 1995, Inf. Process. Manag..

[115]  G. Meade Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001 .

[116]  Tao Tao,et al.  Language Model Information Retrieval with Document Expansion , 2006, NAACL.

[117]  Edgar Erdfelder,et al.  G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences , 2007, Behavior research methods.

[118]  Sadaoki Furui,et al.  Sentence extraction-based presentation summarization techniques and evaluation metrics , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[119]  J. Kalita,et al.  Automatic Summarization of Twitter Topics , 2010 .

[120]  Xiaojun Wan,et al.  CollabSum: exploiting multiple document clustering for collaborative single document summarizations , 2007, SIGIR.

[121]  Jugal K. Kalita,et al.  Comparing Twitter Summarization Algorithms for Multiple Post Summaries , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[122]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[123]  Daniel Marcu,et al.  Bayesian Query-Focused Summarization , 2006, ACL.

[124]  Alan Ritter,et al.  Unsupervised Modeling of Twitter Conversations , 2010, NAACL.

[125]  T. Saracevic,et al.  Relevance: A review of the literature and a framework for thinking on the notion in information science. Part II: nature and manifestations of relevance , 2007, J. Assoc. Inf. Sci. Technol..

[126]  Ee-Peng Lim,et al.  Comments-oriented blog summarization by sentence extraction , 2007, CIKM '07.

[127]  Ron Artstein,et al.  Crowdsourcing micro-level multimedia annotations: the challenges of evaluation and interface , 2012, CrowdMM '12.

[128]  Dragomir R. Radev,et al.  Introduction to the Special Issue on Summarization , 2002, CL.

[129]  Kam-Fai Wong,et al.  Ranking model selection and fusion for effective microblog search , 2014, SoMeRA@SIGIR.

[130]  Elena Lloret,et al.  Text summarisation in progress: a literature review , 2011, Artificial Intelligence Review.

[131]  Yun Chi,et al.  Summarization System by Identifying Influential Blogs , 2007, ICWSM.

[132]  Pablo Gervás,et al.  A semantic graph-based approach to biomedical summarisation , 2011, Artif. Intell. Medicine.

[133]  Sang-goo Lee,et al.  Web content summarization using social bookmarks: a new approach for social summarization , 2008, WIDM '08.

[134]  Vasudeva Varma,et al.  Capturing Sentence Prior for Query-Based Multi-Document Summarization , 2007, RIAO.

[135]  Diego Molla-Aliod A Corpus for Evidence Based Medicine Summarisation , 2010 .

[136]  Dianne P. O'Leary,et al.  Text summarization via hidden Markov models , 2001, SIGIR '01.

[137]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[138]  Walid Magdy,et al.  Detecting Comments on News Articles in Microblogs , 2013, ICWSM.

[139]  Mary S. Neff,et al.  Multi-document Summarization by Visualizing Topical Content , 2000 .

[140]  Eduard Hovy,et al.  Automated multi-document summarization in NeATS , 2002 .

[141]  Juan-Zi Li,et al.  Social context summarization , 2011, SIGIR.

[142]  Jeffrey Nichols,et al.  Summarizing sporting events using twitter , 2012, IUI '12.

[143]  Marc Moens,et al.  Articles Summarizing Scientific Articles: Experiments with Relevance and Rhetorical Status , 2002, CL.

[144]  Xiaojun Wan,et al.  Improved Affinity Graph Based Multi-Document Summarization , 2006, NAACL.

[145]  Philipp Koehn,et al.  Re-evaluating the Role of Bleu in Machine Translation Research , 2006, EACL.

[146]  Christopher J. C. Burges,et al.  A machine learning approach for improved BM25 retrieval , 2009, CIKM.

[147]  Kathleen R. McKeown,et al.  Summarization Evaluation Methods: Experiments and Analysis , 1998 .

[148]  Sanda M. Harabagiu,et al.  Generating Single and Multi-Document Summaries with GIST EXTER , 2002 .

[149]  Mark Dredze,et al.  Annotating Named Entities in Twitter Data with Crowdsourcing , 2010, Mturk@HLT-NAACL.

[150]  Gleb Sizov,et al.  Extraction-Based Automatic Summarization: Theoretical and Empirical Investigation of Summarization Techniques , 2010 .

[151]  Karen Spärck Jones Automatic summarising: The state of the art , 2007, Inf. Process. Manag..

[152]  Karel Jezek,et al.  Two uses of anaphora resolution in summarization , 2007, Inf. Process. Manag..