Storylines for structuring massive streams of news

Stories are the most natural ways for people to deal with information about the changing world. They provide an efficient schematic structure to order and relate events according to some explanation. We describe (1) a formal model for representing storylines to handle streams of news and (2) a first implementation of a system that automatically extracts the ingredients of a storyline from news articles according to the model. Our model mimics the basic notions from narratology by adding bridging relations to timelines of events in relation to a climax point. We provide a method for defining the climax score of each event and the bridging relations between them. We generate a JSON structure for any set of news articles to represent the different stories they contain and visualize these stories on a timeline with climax and bridging relations. This visualization helps inspecting the validity of the generated structures.

[1]  M. Bal,et al.  Narratology: Introduction to the Theory of Narrative , 1988 .

[2]  Antske Fokkens,et al.  NAF and GAF: Linking Linguistic Annotations , 2014 .

[3]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[4]  J. Bruner Acts of meaning , 1990 .

[5]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[6]  Wilson L. Taylor,et al.  “Cloze Procedure”: A New Tool for Measuring Readability , 1953 .

[7]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[8]  Mirella Lapata,et al.  Learning to Tell Tales: A Data-driven Approach to Story Generation , 2009, ACL.

[9]  Lora Aroyo,et al.  Digital hermeneutics: Agora and the online understanding of cultural heritage , 2011, WebSci '11.

[10]  Vladimir Propp,et al.  Morphology of the folktale , 1959 .

[11]  Véronique Malaisé,et al.  Design and use of the Simple Event Model (SEM) , 2011, J. Web Semant..

[12]  Gaye Tuchman Making News by Doing Work: Routinizing the Unexpected , 1973, American Journal of Sociology.

[13]  Xavier Tannier,et al.  Ranking Multidocument Event Descriptions for Building Thematic Timelines , 2014, COLING.

[14]  Yan Zhang,et al.  Summarizing Complex Events: a Cross-Modal Solution of Storylines Extraction and Reconstruction , 2013, EMNLP.

[15]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Schemas and their Participants , 2009, ACL.

[16]  Lifu Huang,et al.  Optimized Event Storyline Generation based on Mixture-Event-Aspect Model , 2013, EMNLP.

[17]  Eneko Agirre,et al.  SemEval-2015 Task 4: TimeLine: Cross-Document Event Ordering , 2015, *SEMEVAL.