Information Preparation with the Human in the Loop

With the advent of the World Wide Web (WWW) and the rise of digital media consumption, abundant information is available nowadays for any topic. But these days users often suffer from information overload posing a great challenge for finding relevant and important information. To alleviate this information overload and provide significant value to the users, there is a need for automatic information preparation methods. Such methods need to support users by discovering and recommending important information while filtering redundant and irrelevant information. They need to ensure that the users do not drown in, but rather benefit from the prepared information. However, the definition of what is relevant and important is subjective and highly specific to the user’s information need and the task at hand. Therefore, a method must continually learn from the feedback of its users. In this thesis, we propose new approaches to put the human in the loop in order to interactively prepare information along the three major lines of research: information aggregation, condensation, and recommendation. For multiple well-studied tasks in natural language processing, we point out the limitation of existing methods and discuss how our approach can successfully close the gap to the human upper bound by considering user feedback and adapting to the user’s information need. We put a particular focus on applications in digital journalism and introduce the new task of live blog summarization. We show that the corpora we create for this task are highly heterogeneous as compared to the standard summarization datasets which pose new challenges to previously proposed non-interactive methods. One way to alleviate information overload is information aggregation. We focus on the corresponding task of multi-document summarization and argue that previously proposed methods are of limited usefulness in the real-world application as they do not take the users’ goal into account. To address these drawbacks, we propose an interactive summarization loop to iteratively create and refine multi-document summaries based on the users’ feedback. We investigate sampling strategies based on active machine learning and joint optimization to reduce the number of iterations and the amount of user feedback required. Our approach significantly improves the quality of the summaries and reaches a performance near the human upper bound. We present a system demonstration implementing the interactive summarization loop, study its scalability, and highlight its use cases in exploring document collections and creating focused summaries in journalism. For information condensation, we investigate a text compression setup. We address the problem of neural models requiring huge amounts of training data and propose a new interactive text compression method to reduce the need for large-scale annotated data. We employ state-of-the-art Seq2Seq text compression methods as our base models and propose an active learning setup with multiple sampling strategies to efficiently use minimal training data. We find that our method significantly reduces the amount of data needed to train and that it adapts well to new datasets and domains. We finally focus on information recommendation and discuss the need for explainable models in machine learning. We propose a new joint recommendation system of rating prediction and review summarization, which shows major improvements over state-of-the-art systems in both the rating prediction and the review summarization task. By solving this task jointly based on multi-task learning techniques, we furthermore obtain explanations for a rating by showing the generated review summary marked based on the model’s attention and a histogram of user preferences learned from the reviews of the users. We conclude the thesis with a summary of how human-in-the-loop approaches improve information preparation systems and envision the use of interactive machine learning methods also for other areas of natural language processing.

[1]  Odette Pollar,et al.  Surviving information overload how to find, filter, and focus on what's important , 2004 .

[2]  Lucy Vanderwende,et al.  Exploring Content Models for Multi-Document Summarization , 2009, NAACL.

[3]  Hiroya Takamura,et al.  Learning to generate summary as structured output , 2010, CIKM '10.

[4]  Ryan T. McDonald Discriminative Sentence Compression with Soft Syntactic Evidence , 2006, EACL.

[5]  Eduard Hovy,et al.  Manual and automatic evaluation of summaries , 2002, ACL 2002.

[6]  Iryna Gurevych,et al.  GermEval-2014: Nested Named Entity Recognition with Neural Networks , 2014 .

[7]  Ingrid Zukerman,et al.  Personalised rating prediction for new users using latent factor models , 2011, HT '11.

[8]  John Pavlopoulos,et al.  Improved Abusive Comment Moderation with User Embeddings , 2017, NLPmJ@EMNLP.

[9]  Inderjeet Mani,et al.  Multi-Document Summarization by Graph Search and Matching , 1997, AAAI/IAAI.

[10]  Christian Igel,et al.  Active learning with support vector machines , 2014, WIREs Data Mining Knowl. Discov..

[11]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[12]  Jiahui Liu,et al.  Personalized news recommendation based on click behavior , 2010, IUI '10.

[13]  Benoit Favre,et al.  A Scalable Global Model for Summarization , 2009, ILP 2009.

[14]  Daniel Marcu,et al.  Statistics-Based Summarization - Step One: Sentence Compression , 2000, AAAI/IAAI.

[15]  Sun Park,et al.  Automatic query-based personalized summarization that uses pseudo relevance feedback with NMF , 2010, ICUIMC '10.

[16]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[17]  Andreas Hotho,et al.  Social Tagging Recommender Systems , 2011, Recommender Systems Handbook.

[18]  Juan-Manuel Torres-Moreno,et al.  A French Human Reference Corpus for Multi-Document Summarization and Sentence Compression , 2010, LREC.

[19]  Takehito Utsuro,et al.  A Web-based English Abstract Writing Tool Using a Tagged E-J Parallel Corpus , 2002, LREC.

[20]  Si Li,et al.  Guiding Generation for Abstractive Text Summarization Based on Key Information Guide Network , 2018, NAACL.

[21]  George Giannakopoulos,et al.  Multi-document multilingual summarization and evaluation tracks in ACL 2013 MultiLing Workshop , 2013 .

[22]  Eduard H. Hovy,et al.  Automated Text Summarization and the SUMMARIST System , 1998, TIPSTER.

[23]  Masaaki Nagata,et al.  Higher-Order Syntactic Attention Network for Longer Sentence Compression , 2018, NAACL-HLT.

[24]  Cane Wing-ki Leung,et al.  Integrating Collaborative Filtering and Sentiment Analysis: A Rating Inference Approach , 2006 .

[25]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[26]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[27]  Eugene Charniak,et al.  Supervised and Unsupervised Learning for Sentence Compression , 2005, ACL.

[28]  Markus Zopf,et al.  Estimating Summary Quality with Pairwise Preferences , 2018, NAACL.

[29]  Jimmy J. Lin,et al.  Overview of the TREC 2017 Real-Time Summarization Track , 2017, TREC.

[30]  Sergei Nirenburg,et al.  The Proper Place of Men and Machines in Language Translation , 2003 .

[31]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[32]  Lejian Liao,et al.  Can Syntax Help? Improving an LSTM-based Sentence Compression Model for New Domains , 2017, ACL.

[33]  Gordon V. Cormack,et al.  Email Spam Filtering: A Systematic Review , 2008, Found. Trends Inf. Retr..

[34]  Yong Yu,et al.  Enhancing diversity, coverage and balance for summarization through structure learning , 2009, WWW '09.

[35]  Kam-Fai Wong,et al.  Extractive Summarization Using Supervised and Semi-Supervised Learning , 2008, COLING.

[36]  Emilie M. Roth,et al.  Predicting Vulnerabilities in Computer-Supported Inferential Analysis under Data Overload , 2001, Cognition, Technology & Work.

[37]  Elena Lloret,et al.  Towards automatic tweet generation: A comparative study from the text summarization perspective in the journalism genre , 2013, Expert Syst. Appl..

[38]  Piji Li,et al.  Reader-Aware Multi-Document Summarization: An Enhanced Model and The First Dataset , 2017, NFiS@EMNLP.

[39]  Aishwarya Jadhav,et al.  Extractive Summarization with SWAP-NET: Sentences and Words from Alternating Pointer Networks , 2018, ACL.

[40]  Alok N. Choudhary,et al.  Voice of the Customers: Mining Online Customer Reviews for Product Feature-based Ranking , 2010, WOSN.

[41]  Tao Chen,et al.  TriRank: Review-aware Explainable Recommendation by Modeling Aspects , 2015, CIKM.

[42]  Xiaoyan Zhu,et al.  Product Review Summarization by Exploiting Phrase Properties , 2016, COLING.

[43]  Hoa Trang Dang,et al.  Overview of the TAC 2008 Update Summarization Task , 2008, TAC.

[44]  Minlie Huang,et al.  An Operation Network for Abstractive Sentence Compression , 2018, COLING.

[45]  Pablo Gervás,et al.  User-model based personalized summarization , 2007, Inf. Process. Manag..

[46]  W. Bruce Croft,et al.  Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2013 .

[47]  Dragomir R. Radev,et al.  The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics , 2008, LREC.

[48]  Inderjeet Mani,et al.  Summarizing Similarities and Differences Among Related Documents , 1997, Information Retrieval.

[49]  Mirella Lapata,et al.  Sentence Compression as Tree Transduction , 2009, J. Artif. Intell. Res..

[50]  Philipp Koehn,et al.  Neural Interactive Translation Prediction , 2016, AMTA.

[51]  Marc'Aurelio Ranzato,et al.  Analyzing Uncertainty in Neural Machine Translation , 2018, ICML.

[52]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[53]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[54]  Neil Thurman,et al.  LIVE BLOGGING–DIGITAL JOURNALISM’S PIVOTAL PLATFORM? , 2013 .

[55]  Ming Zhou,et al.  Ranking with Recursive Neural Networks and Its Application to Multi-Document Summarization , 2015, AAAI.

[56]  Xuanjing Huang,et al.  Adversarial Multi-task Learning for Text Classification , 2017, ACL.

[57]  Andrew McCallum,et al.  Reducing Labeling Effort for Structured Prediction Tasks , 2005, AAAI.

[58]  Yang Liu,et al.  Using Supervised Bigram-based ILP for Extractive Summarization , 2013, ACL.

[59]  Dong Wang,et al.  SocialFM: A Social Recommender System with Factorization Machines , 2016, WAIM.

[60]  Yuxiang Wu,et al.  Learning to Extract Coherent Summary via Deep Reinforcement Learning , 2018, AAAI.

[61]  Mirella Lapata,et al.  Neural Summarization by Extracting Sentences and Words , 2016, ACL.

[62]  Guokun Lai,et al.  Explicit factor models for explainable recommendation based on phrase-level sentiment analysis , 2014, SIGIR.

[63]  Udo Kruschwitz,et al.  MultiLing 2015: Multilingual Summarization of Single and Multi-Documents, On-line Fora, and Call-center Conversations , 2015, SIGDIAL Conference.

[64]  Jade Goldstein-Stewart,et al.  Creating and evaluating multi-document sentence extract summaries , 2000, CIKM '00.

[65]  N. Newman,et al.  The Future of Breaking News Online? , 2014 .

[66]  Thomas Arnold,et al.  Beyond Generic Summarization: A Multi-faceted Hierarchical Summarization Corpus of Large Heterogeneous Data , 2018, LREC.

[67]  J. Steinberger,et al.  Using Latent Semantic Analysis in Text Summarization and Summary Evaluation , 2004 .

[68]  George Kurian,et al.  Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[69]  Jason Phang,et al.  Unsupervised Sentence Compression using Denoising Auto-Encoders , 2018, CoNLL.

[70]  Gustave J. Rath,et al.  The formation of abstracts by the selection of sentences , 1961 .

[71]  Elena Beisswanger,et al.  Active Learning-Based Corpus Annotation - The PathoJen Experience , 2012, AMIA.

[72]  Giuseppe Carenini,et al.  Summarizing email conversations with clue words , 2007, WWW '07.

[73]  Walter Bender,et al.  Network Plus , 1988, Photonics West - Lasers and Applications in Science and Engineering.

[74]  Gao Cong,et al.  SAR: A sentiment-aspect-region model for user preference analysis in geo-tagged reviews , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[75]  Chenguang Wang,et al.  Active Learning for Black-Box Semantic Role Labeling with Neural Factors , 2017, IJCAI.

[76]  Marek Rei,et al.  Semi-supervised Multitask Learning for Sequence Labeling , 2017, ACL.

[77]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[78]  Markus Zopf,et al.  Auto-hMDS: Automatic Construction of a Large Heterogeneous Multilingual Multi-Document Summarization Corpus , 2018, LREC.

[79]  Maxine Eskénazi,et al.  Explainable Entity-based Recommendations with Knowledge Graphs , 2017, RecSys Posters.

[80]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[81]  Ani Nenkova,et al.  A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization , 2006, SIGIR.

[82]  Mirella Lapata,et al.  Learning to Generate Product Reviews from Attributes , 2017, EACL.

[83]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[84]  Xiaojun Wan,et al.  CTSUM: extracting more certain summaries for news articles , 2014, SIGIR.

[85]  Carsten Binnig,et al.  Sherlock: A System for Interactive Summarization of Large Text Collections , 2018, Proc. VLDB Endow..

[86]  Piji Li,et al.  Neural Rating Regression with Abstractive Tips Generation for Recommendation , 2017, SIGIR.

[87]  Piji Li,et al.  Persona-Aware Tips Generation? , 2019, WWW.

[88]  Iryna Gurevych,et al.  A Retrospective Analysis of the Fake News Challenge Stance-Detection Task , 2018, COLING.

[89]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[90]  Alexander M. Rush,et al.  Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.

[91]  Youngjoong Ko,et al.  An effective sentence-extraction technique using contextual information and statistical approaches for text summarization , 2008, Pattern Recognition Letters.

[92]  Frank Hopfgartner,et al.  The plista dataset , 2013, NRS '13.

[93]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[94]  Tatsunori Mori,et al.  Construction of Text Summarization Corpus for the Credibility of Information on the Web , 2010, LREC.

[95]  Kristina Toutanova,et al.  A Dataset and Evaluation Metrics for Abstractive Compression of Sentences and Short Paragraphs , 2016, EMNLP.

[96]  Simon McEnnis FOLLOWING THE ACTION , 2016 .

[97]  Jeffrey Chan,et al.  J3R: Joint Multi-task Learning of Ratings and Review Summaries for Explainable Recommendation , 2019, ECML/PKDD.

[98]  Tat-Seng Chua,et al.  Neural Factorization Machines for Sparse Predictive Analytics , 2017, SIGIR.

[99]  Iryna Gurevych,et al.  Finding Convincing Arguments Using Scalable Bayesian Preference Learning , 2018, TACL.

[100]  Patrick Gallinari,et al.  Extended Recommendation Framework: Generating the Text of a User Review as a Personalized Summary , 2015, CBRecSys@RecSys.

[101]  Min-Yen Kan,et al.  Product Review Summarization based on Facet Identification and Sentence Clustering , 2011, ArXiv.

[102]  David Allen,et al.  Information overload: context and causes , 2003 .

[103]  Johannes Fürnkranz,et al.  Interactive Data Analytics for the Humanities , 2017, CICLing.

[104]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[105]  Joydeep Ghosh,et al.  Review quality aware collaborative filtering , 2012, RecSys '12.

[106]  Benjamin Van Durme,et al.  Annotated Gigaword , 2012, AKBC-WEKEX@NAACL-HLT.

[107]  Lei Zheng,et al.  Joint Deep Modeling of Users and Items Using Reviews for Recommendation , 2017, WSDM.

[108]  Stephan Oepen,et al.  39th Annual Meeting and 10th Conference of the European Chapter , 2001 .

[109]  David Kempe,et al.  A General Framework for Robust Interactive Learning , 2017, NIPS.

[110]  Longbing Cao,et al.  Attention-Based Transactional Context Embedding for Next-Item Recommendation , 2018, AAAI.

[111]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[112]  Mirella Lapata,et al.  WikiSimple: Automatic Simplification of Wikipedia Articles , 2011, AAAI.

[113]  Maite Taboada,et al.  Using New York Times Picks to Identify Constructive Comments , 2017, NLPmJ@EMNLP.

[114]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[115]  Yen-Chun Chen,et al.  Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting , 2018, ACL.

[116]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[117]  Iryna Gurevych,et al.  New Collection Announcement: Focused Retrieval Over the Web , 2016, SIGIR.

[118]  Seth Flaxman,et al.  European Union Regulations on Algorithmic Decision-Making and a "Right to Explanation" , 2016, AI Mag..

[119]  Houfeng Wang,et al.  Learning Summary Prior Representation for Extractive Summarization , 2015, ACL.

[120]  Germán Sanchis-Trilles,et al.  CASMACAT: A Computer-assisted Translation Workbench , 2014, EACL.

[121]  Silvia Bernardini,et al.  BootCaT: Bootstrapping Corpora and Terms from the Web , 2004, LREC.

[122]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[123]  David Cohn,et al.  Active Learning , 2010, Encyclopedia of Machine Learning.

[124]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[125]  Wenpeng Yin,et al.  Optimizing Sentence Modeling and Selection for Document Summarization , 2015, IJCAI.

[126]  Simon Corston-Oliver,et al.  Text compaction for display on very small screens , 2001 .

[127]  Jon Atle Gulla,et al.  The Adressa dataset for news recommendation , 2017, WI.

[128]  Eugene Charniak,et al.  Immediate-Head Parsing for Language Models , 2001, ACL.

[129]  Shujian Huang,et al.  Deep Matrix Factorization Models for Recommender Systems , 2017, IJCAI.

[130]  Gökhan Tür,et al.  Statistical Sentence Extraction for Information Distillation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[131]  Bowen Zhou,et al.  Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[132]  M. de Rijke,et al.  Social Collaborative Viewpoint Regression with Explainable Recommendations , 2017, WSDM.

[133]  Ralf Steinberger Multilingual and Cross-Lingual News Analysis in the Europe Media Monitor (EMM) (Extended Abstract) , 2013, IRFC.

[134]  Judith Eckle-Kohler,et al.  Supervised Learning of Automatic Pyramid for Optimization-Based Multi-Document Summarization , 2017, ACL.

[135]  E. Thorsen,et al.  Seven Characteristics Defining Online News Formats , 2018, Digital Journalism.

[136]  Jingbo Zhu,et al.  Learning a Stopping Criterion for Active Learning for Word Sense Disambiguation and Text Classification , 2008, IJCNLP.

[137]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[138]  Christian M. Meyer,et al.  Data-efficient Neural Text Compression with Interactive Learning , 2019, NAACL.

[139]  Jason Weston,et al.  A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[140]  Yasemin Altun,et al.  Overcoming the Lack of Parallel Data in Sentence Compression , 2013, EMNLP.

[141]  Kathleen McKeown,et al.  Content Selection in Deep Learning Models of Summarization , 2018, EMNLP.

[142]  Naoaki Okazaki,et al.  Neural Headline Generation on Abstract Meaning Representation , 2016, EMNLP.

[143]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[144]  Zhang Zuping,et al.  A Hierarchical Structured Self-Attentive Model for Extractive Document Summarization (HSSAS) , 2018, IEEE Access.

[145]  Hsiang Iris Chyi,et al.  News and the Overloaded Consumer: Factors Influencing Information Overload Among News Consumers , 2012, Cyberpsychology Behav. Soc. Netw..

[146]  Marko Grobelnik,et al.  News Across Languages - Cross-Lingual Document Similarity and Event Tracking , 2015, J. Artif. Intell. Res..

[147]  Anders Søgaard,et al.  Deep multi-task learning with low level tasks supervised at lower layers , 2016, ACL.

[148]  Daniel Marcu,et al.  Summarization beyond sentence extraction: A probabilistic approach to sentence compression , 2002, Artif. Intell..

[149]  Rui Zhang,et al.  Graph-based Neural Multi-Document Summarization , 2017, CoNLL.

[150]  Mirella Lapata,et al.  Sentence Compression Beyond Word Deletion , 2008, COLING.

[151]  Paul Over,et al.  Intrinsic Evaluation of Generic News Text Summarization Systems , 2003 .

[152]  Christian M. Meyer,et al.  Live Blog Corpus for Summarization , 2018, LREC.

[153]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[154]  Judith Eckle-Kohler,et al.  Optimizing an Approximation of ROUGE - a Problem-Reduction Approach to Extractive Multi-Document Summarization , 2016, ACL.

[155]  Tobias Falke,et al.  Automatic Structured Text Summarization with Concept Maps , 2019 .

[156]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[157]  Barry Smyth,et al.  A multi-criteria evaluation of a user generated content based recommender system , 2011, RecSys 2011.

[158]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[159]  Gregory Grefenstette Producing Intelligent Telegraphic Text Reduction to provide an Audio Scanning Service for the Blind , 1998 .

[160]  Neil Yorke-Smith,et al.  A Novel Bayesian Similarity Measure for Recommender Systems , 2013, IJCAI.

[161]  Boi Faltings,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Recommendation Using Textual Opinions , 2022 .

[162]  Mirella Lapata,et al.  Multiple Aspect Summarization Using Integer Linear Programming , 2012, EMNLP.

[163]  Dong-Hong Ji,et al.  Context-Enhanced Personalized Social Summarization , 2012, COLING.

[164]  Jiawei Han,et al.  Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions , 2010, COLING.

[165]  Silvio Savarese,et al.  Active Learning for Convolutional Neural Networks: A Core-Set Approach , 2017, ICLR.

[166]  Craig Boutilier,et al.  Optimal Bayesian Recommendation Sets and Myopically Optimal Choice Query Sets , 2010, NIPS.

[167]  Gerald DeJong Automatic Schema Acquisition in a Natural Language Environment , 1982, AAAI.

[168]  Dominik Benz,et al.  The social bookmark and publication management system bibsonomy , 2010, The VLDB Journal.

[169]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[170]  Einar Thorsen,et al.  Live Blogging and Social Media Curation: Challenges and Opportunities for Journalism , 2013 .

[171]  Anton van den Hengel,et al.  Image-Based Recommendations on Styles and Substitutes , 2015, SIGIR.

[172]  Udo Hahn,et al.  Proceedings of the ACL-02 Workshop on Automatic Summarization - Volume 4 , 2002 .

[173]  Ludovic Denoyer,et al.  Learning social network embeddings for predicting information diffusion , 2014, WSDM.

[174]  Bowen Zhou,et al.  SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents , 2016, AAAI.

[175]  Paul Over,et al.  DUC in context , 2007, Inf. Process. Manag..

[176]  Dan Roth,et al.  Incidental Supervision: Moving beyond Supervised Learning , 2017, AAAI.

[177]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[178]  Bipin Suresh Stanford Inclusion of large input corpora in Statistical Machine Translation , 2010 .

[179]  Ani Nenkova,et al.  A Survey of Text Summarization Techniques , 2012, Mining Text Data.

[180]  Mirella Lapata,et al.  Models for Sentence Compression: A Comparison across Domains, Training Requirements and Evaluation Measures , 2006, ACL.

[181]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[182]  Helen Petrie,et al.  The Evaluation of Accessibility, Usability, and User Experience , 2009, The Universal Access Handbook.

[183]  Xiaoli Li,et al.  Cooperative Hybrid Semi-Supervised Learning for Text Sentiment Classification , 2019, Symmetry.

[184]  Min Yang,et al.  Generative Adversarial Network for Abstractive Text Summarization , 2017, AAAI.

[185]  Gerhard Weikum,et al.  Exploring Latent Semantic Factors to Find Useful Product Reviews , 2017, SDM.

[186]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[187]  Freddy Lécué,et al.  Explainable AI: The New 42? , 2018, CD-MAKE.

[188]  Walter Daelemans,et al.  On the Limits of Sentence Compression by Deletion , 2010, Empirical Methods in Natural Language Generation.

[189]  Li Chen,et al.  Sentiment-enhanced explanation of product recommendations , 2014, WWW.

[190]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[191]  Sean M. McNee,et al.  Improving recommendation lists through topic diversification , 2005, WWW '05.

[192]  Katja Filippova,et al.  Multi-Sentence Compression: Finding Shortest Paths in Word Graphs , 2010, COLING.

[193]  Ying Liu,et al.  Active Learning with Support Vector Machine Applied to Gene Expression Data for Cancer Classification , 2004, J. Chem. Inf. Model..

[194]  Chao Liu,et al.  Wisdom of the better few: cold start recommendation via representative based rating elicitation , 2011, RecSys '11.

[195]  Krys J. Kochut,et al.  Text Summarization Techniques: A Brief Survey , 2017, International Journal of Advanced Computer Science and Applications.

[196]  Constantin Orasan,et al.  Computer-aided summarisation – what the user really wants , 2006, LREC.

[197]  Furu Wei,et al.  Improving Multi-Document Summarization via Text Classification , 2016, AAAI.

[198]  Barry Smyth,et al.  On the real-time web as a source of recommendation knowledge , 2010, RecSys '10.

[199]  Klaus Zechner,et al.  Automatic Summarization of Open-Domain Multiparty Dialogues in Diverse Genres , 2002, CL.

[200]  Yi Zhang,et al.  Incorporating Diversity and Density in Active Learning for Relevance Feedback , 2007, ECIR.

[201]  Yang Zhao,et al.  A Language Model based Evaluator for Sentence Compression , 2018, ACL.

[202]  Horacio Saggion,et al.  Generating Indicative-Informative Summaries with SumUM , 2002, Computational Linguistics.

[203]  Emiel Krahmer,et al.  Abstractive Compression of Captions with Attentive Recurrent Neural Networks , 2016, INLG.

[204]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[205]  Benjamin M. Marlin,et al.  Modeling User Rating Profiles For Collaborative Filtering , 2003, NIPS.

[206]  Hongyan Jing,et al.  Sentence Reduction for Automatic Text Summarization , 2000, ANLP.

[207]  G. Carenini,et al.  A Publicly Available Annotated Corpus for Supervised Email Summarization , 2008 .

[208]  Kai Hong,et al.  Improving the Estimation of Word Importance for News Multi-Document Summarization , 2014, EACL.

[209]  Benoît Favre,et al.  Concept-based Summarization using Integer Linear Programming: From Concept Pruning to Multiple Optimal Solutions , 2015, EMNLP.

[210]  Bowen Zhou,et al.  Pointing the Unknown Words , 2016, ACL.

[211]  Xiaoyu Du,et al.  Outer Product-based Neural Collaborative Filtering , 2018, IJCAI.

[212]  Phil Blunsom,et al.  Language as a Latent Variable: Discrete Generative Models for Sentence Compression , 2016, EMNLP.

[213]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[214]  Hui Lin,et al.  A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization , 2014, LREC.

[215]  Ming Zhou,et al.  TGSum: Build Tweet Guided Multi-Document Summarization Dataset , 2015, AAAI.

[216]  Thomas Hofmann,et al.  Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[217]  David Kauchak,et al.  Simple English Wikipedia: A New Text Simplification Task , 2011, ACL.

[218]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[219]  Dan Klein,et al.  Jointly Learning to Extract and Compress , 2011, ACL.

[220]  Johannes Fürnkranz,et al.  What's Important in a Text? An Extensive Evaluation of Linguistic Annotations for Summarization , 2018, 2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS).

[221]  Ani Nenkova,et al.  Can You Summarize This? Identifying Correlates of Input Difficulty for Multi-Document Summarization , 2008, ACL.

[222]  Yi Zhang,et al.  Bayesian graphical models for adaptive filtering , 2005, SIGF.

[223]  Iryna Gurevych,et al.  Bridging the gap between extractive and abstractive summaries: Creation and evaluation of coherent extracts from heterogeneous sources , 2016, COLING.

[224]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[225]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[226]  Dit-Yan Yeung,et al.  Collaborative Deep Learning for Recommender Systems , 2014, KDD.

[227]  Bhargav Srinivasa Desikan,et al.  Natural Language Processing and Computational Linguistics , 2018 .

[228]  J. Clarke,et al.  Global inference for sentence compression : an integer linear programming approach , 2008, J. Artif. Intell. Res..

[229]  Timothy C. Craven Abstracts produced using computer assistance , 2000, J. Am. Soc. Inf. Sci..

[230]  Mirella Lapata,et al.  Modelling Compression with Discourse Constraints , 2007, EMNLP.

[231]  John Gantz,et al.  The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East , 2012 .

[232]  Filip Ginter,et al.  Sentence Compression For Automatic Subtitling , 2015, NODALIDA.

[233]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[234]  Aljosha Karim Schapals,et al.  Live blogs, sources, and objectivity: The contradictions of real-time online reporting , 2016 .

[235]  Gholamreza Haffari,et al.  Learning to Actively Learn Neural Machine Translation , 2018, CoNLL.

[236]  Francisco Casacuberta,et al.  Active Learning for Interactive Neural Machine Translation of Data Streams , 2018, CoNLL.

[237]  Shashi Narayan,et al.  Hybrid Simplification using Deep Semantics and Machine Translation , 2014, ACL.

[238]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[239]  Yi Pan,et al.  Sentence Compression for Automated Subtitling: A Hybrid Approach , 2004, ACL 2004.

[240]  Regina Barzilay,et al.  Sentence Fusion for Multidocument News Summarization , 2005, CL.

[241]  Wei-Ying Ma,et al.  A Study for Document Summarization Based on Personal Annotation , 2003, HLT-NAACL 2003.

[242]  Mirella Lapata,et al.  Neural Extractive Summarization with Side Information , 2017, ArXiv.

[243]  Hoa Trang Dang,et al.  Overview of DUC 2005 , 2005 .

[244]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[245]  M. de Rijke,et al.  Sentence Relations for Extractive Summarization with Deep Neural Networks , 2018, ACM Trans. Inf. Syst..

[246]  Hung-yi Lee,et al.  Learning to Encode Text as Human-Readable Summaries using Generative Adversarial Networks , 2018, EMNLP.

[247]  Pasquale Lops,et al.  Content-based Recommender Systems: State of the Art and Trends , 2011, Recommender Systems Handbook.

[248]  Mark Craven,et al.  An Analysis of Active Learning Strategies for Sequence Labeling Tasks , 2008, EMNLP.

[249]  Chance York,et al.  Overloaded By the News: Effects of News Exposure and Enjoyment on Reporting Information Overload , 2013 .

[250]  Naomie Salim,et al.  Clustered genetic semantic graph approach for multi-document abstractive summarization , 2016, 2016 International Conference on Intelligent Systems Engineering (ICISE).

[251]  Iryna Gurevych,et al.  APRIL: Interactively Learning to Summarise by Combining Active Preference Learning and Reinforcement Learning , 2018, EMNLP.

[252]  Yiqun Liu,et al.  Neural Attentional Rating Regression with Review-level Explanations , 2018, WWW.

[253]  George Giannakopoulos,et al.  TAC2011 MultiLing Pilot Overview , 2011, TAC.

[254]  Steven R. Edscorn The Routledge Companion to Digital Journalism Studies , 2017 .

[255]  George F. Foster,et al.  Adaptive Language and Translation Models for Interactive Machine Translation , 2004, EMNLP.

[256]  Gholamreza Haffari,et al.  Active Learning for Multilingual Statistical Machine Translation , 2009, ACL.

[257]  Stuart M. Shieber,et al.  Synchronous Tree-Adjoining Grammars , 1990, COLING.

[258]  Mikel L. Forcada,et al.  Black-box integration of heterogeneous bilingual resources into an interactive translation system , 2014, HaCaT@EACL.

[259]  Jenq-Neng Hwang,et al.  Uncertainty sampling based active learning with diversity constraint by sparse selection , 2017, 2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP).

[260]  Dragomir R. Radev,et al.  Introduction to the Special Issue on Summarization , 2002, CL.

[261]  Lu Wang,et al.  Neural Network-Based Abstract Generation for Opinions and Arguments , 2016, NAACL.

[262]  Lukasz Kaiser,et al.  Sentence Compression by Deletion with LSTMs , 2015, EMNLP.

[263]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[264]  Kathleen McKeown,et al.  Lexicalized Markov Grammars for Sentence Compression , 2007, NAACL.

[265]  Mirella Lapata,et al.  An abstractive approach to sentence compression , 2013, TIST.

[266]  John M. Conroy,et al.  An Assessment of the Accuracy of Automatic Evaluation in Summarization , 2012, EvalMetrics@NAACL-HLT.

[267]  Constantin Orasan,et al.  CAST: A computer-aided summarisation tool , 2003, EACL.

[268]  John K. Debenham,et al.  Informed Recommender: Basing Recommendations on Consumer Product Reviews , 2007, IEEE Intelligent Systems.

[269]  George Sylvie The Elements of Journalism: What Newspeople Should Know and the Public Should Expect , 2001 .

[270]  Heike Adel,et al.  Comparing Convolutional Neural Networks to Traditional Models for Slot Filling , 2016, NAACL.

[271]  Kazi Mostak Gausul Hoq Information Overload: Causes, Consequences and Remedies - A Study , 2016 .

[272]  Dianne P. O'Leary,et al.  Text summarization via hidden Markov models , 2001, SIGIR '01.

[273]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies , 2000, ArXiv.

[274]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[275]  Francisco Casacuberta,et al.  Active learning for interactive machine translation , 2012, EACL.

[276]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[277]  Jeffrey Heer,et al.  The Effects of Interactive Latency on Exploratory Visual Analysis , 2014, IEEE Transactions on Visualization and Computer Graphics.

[278]  Kenneth Y. Goldberg,et al.  Eigentaste: A Constant Time Collaborative Filtering Algorithm , 2001, Information Retrieval.

[279]  Ming Zhou,et al.  A Redundancy-Aware Sentence Regression Framework for Extractive Summarization , 2016, COLING.

[280]  Hang Li,et al.  “ Tony ” DNN Embedding for “ Tony ” Selective Read for “ Tony ” ( a ) Attention-based Encoder-Decoder ( RNNSearch ) ( c ) State Update s 4 SourceVocabulary Softmax Prob , 2016 .

[281]  Anne Schuth,et al.  Online Learning to Rank for Recommender Systems , 2017, RecSys.

[282]  Klaus Brinker,et al.  Incorporating Diversity in Active Learning with Support Vector Machines , 2003, ICML.

[283]  Russell Greiner,et al.  Optimistic Active-Learning Using Mutual Information , 2007, IJCAI.

[284]  Dragomir R. Radev,et al.  NewsInEssence: summarizing online news topics , 2005, Commun. ACM.

[285]  Jian Su,et al.  Multi-Criteria-based Active Learning for Named Entity Recognition , 2004, ACL.

[286]  Navdeep Jaitly,et al.  Pointer Networks , 2015, NIPS.

[287]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[288]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[289]  Milica Gasic,et al.  Phrase-Based Statistical Language Generation Using Graphical Models and Active Learning , 2010, ACL.

[290]  Judith Eckle-Kohler,et al.  The Next Step for Multi-Document Summarization: A Heterogeneous Multi-Genre Corpus Built with a Novel Construction Approach , 2016, COLING.

[291]  P. V. S. Avinesh,et al.  Joint Optimization of User-desired Content in Multi-document Summaries by Learning from User Feedback , 2017, ACL.

[292]  Ryan T. McDonald A Study of Global Inference Algorithms in Multi-document Summarization , 2007, ECIR.

[293]  Judith Masthoff,et al.  Designing and Evaluating Explanations for Recommender Systems , 2011, Recommender Systems Handbook.

[294]  Hoa Trang Dang,et al.  Overview of DUC 2006 , 2006 .

[295]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.